#Gemini

27 articles

TechMay 11, 202610 min

Gemini API multimodal File Search as game NPC memory: metadata filters, store tiers, and a cost estimate

Gemini API File Search now indexes images alongside text in the same store. Metadata filters can isolate NPC memories by chapter and character, and a single-character prototype costs under $1/month on Flash-Lite. Notes on tier limits, pricing breakdown, and what to test first.

AI Gemini RAG API Game

TechMay 4, 202611 min

Fine-Tuning Reignites Verbatim Memorization of Copyrighted Books in LLMs

An arXiv paper reports that fine-tuning GPT-4o, Gemini 2.5 Pro, and DeepSeek-V3.1 on summary-to-text expansion tasks increases verbatim reproduction of copyrighted books.

AI LLM Copyright OpenAI Gemini DeepSeek Fine-tuning Paper

TechMay 2, 202614 min

Lessons from VoteWise AI on Building Election Guides with Next.js and Gemini 2.5 Flash

VoteWise AI turns election education into a multilingual chat, voice, and story-mode experience built on Next.js. Notes on designing around Gemini 2.5 Flash's safety filters in a political context.

AI Gemini Next.js Firebase Google Cloud 設計

TechMay 2, 202612 min

Kana Chat v3 and Leaning Into Blog-Specific Use

From v2 to v3 of Kana Chat, an AI agent built around official CLI wrappers. The story of stepping back from the DIY OpenClaw direction and pivoting toward a blog pipeline that quickly drafts the daily flood of AI news and papers.

AI Agent Claude Code Codex OpenClaw Gemini tmux FastAPI Tailscale Custom Tool Experiment

TechApr 18, 20267 min

Google Ships Android CLI v0.7 Preview with android create and skills for Agents

Google launched Android CLI v0.7 preview on April 16, 2026. Subcommands like android create/run/skills/docs, combined with Android Knowledge Base and Android Skills, let any agent (Gemini CLI, Claude Code, Codex) drive Android development through a standardized interface.

Android CLI AI Agents Google Gemini Claude Code Codex

TechApr 15, 2026updated11 min

Five layers of LLM safety filters: where abliterated and uncensored models actually intervene

LLM safety stacks five layers — input filter, system prompt, RLHF, Constitutional AI, output filter — and each provider blocks at different layers. A breakdown of where abliterated vs uncensored models cut, and the default censorship level baked into local LLMs.

AI LLM ローカルLLM Security Gemini Claude Ollama

TechApr 15, 2026updated14 min

Google DeepMind Fabula: research prototype since May 2025, demoed again at CHI 2026, still no GA

Google DeepMind's AI writing tool Fabula was demoed at CHI 2026 by Piotr Mirowski. Co-designed with 42 professional writers using convergent iteration for story structure. But the timeline shows Fabula was first demoed in May 2025 and entered early access in September 2025 — still a research prototype with no general availability.

AI Google Gemini LLM

TechApr 10, 202615 min

Reverse-Engineering Gemini's SynthID Watermark via Spectral Analysis: 90% Detection, 91% Removal

A research project reverse-engineered Google DeepMind's SynthID image watermark using FFT-based spectral analysis. The V3 bypass achieves 91% phase removal while maintaining SSIM 0.997. Is removing an invisible watermark copyright infringement? Analysis from DMCA, EU AI Act, and Japanese law perspectives.

AI Gemini SynthID Security Watermarking Signal Processing

TechMar 24, 20264 min

GPT-5.4 Pro solved a Ramsey hypergraph problem in FrontierMath for the first time, and also pushed Brian-Larson asymptotics

GPT-5.4 Pro became the first model to solve a researcher-level open problem in FrontierMath, a benchmark managed by Epoch AI. Claude Opus 4.6 and Gemini 3.1 Pro later solved it as well.

GPT Claude Gemini FrontierMath AI MachineLearning

TechMar 23, 202611 min

Kana Chat v2 Architecture Changes

Changes from v1 to v2 of Kana Chat, an AI agent built around official CLI wrappers. Covers dual-model router, Heartbeat memory, planner mode, image input, speech transcription, PWA push notifications, and the lessons learned from a month of daily use.

AI Agent Claude Code Codex OpenClaw Gemini tmux FastAPI Tailscale Custom Tool Experiment

TechMar 19, 20265 min

Google engineers release “Sashiko”, an AI code review system for Linux kernel

An AI system that automatically reviews Linux kernel patches. Detected 53.6% of known bugs that passed human review. Rust implementation supports both Gemini and Claude.

Linux AI Code Review Google Gemini Claude

TechFeb 24, 20266 min

The Technical Architecture of Kana Chat

Design and implementation of Kana Chat, a personal AI agent system that wraps official CLIs. Covers the tmux bridge, context isolation, and tool approval gate that make it safe to run in your own environment.

AI Agent Claude Code Codex OpenClaw Gemini tmux FastAPI Custom Tool Experiment