Changes from v1 to v2 of Kana Chat, an AI agent built around official CLI wrappers. Covers dual-model router, Heartbeat memory, planner mode, image input, speech transcription, PWA push notifications, and the lessons learned from a month of daily use.
Design and implementation of Kana Chat, a personal AI agent system that wraps official CLIs. Covers the tmux bridge, context isolation, and tool approval gate that make it safe to run in your own environment.
The Web Speech API + Gemini + VOICEVOX setup is complete — an AI character you can actually have a voice conversation with. Key implementation notes and impressions.
Testing the image generation features of Flow, now available in Google AI Pro. Findings on the He/She pronoun issue, the effectiveness of natural English prompts, and how to use Flow vs Gem.
After reporting that Nano Banana Pro wasn't available even with a Gemini Pro subscription, it finally rolled out — here's what character-consistent image generation actually looks like in practice.
A record of designing prompts to make Gemini's image generation correctly draw a side ponytail, and building a Gem using a full 360° set of reference images.