Tech Apr 28, 2026 6 min Sarashina2.2-TTS Is a Japanese-First Zero-Shot Voice Synthesis Model SB Intuitions released sarashina2.2-tts, an LLM-based TTS model focused on Japanese. It clones speaker voice and style from short reference audio without fine-tuning, and handles Japanese-English code-switching. AI TTS Voice Synthesis LLM Voice Cloning