Voice Synthesis articles | lilting channel

TechApr 28, 20266 min

Sarashina2.2-TTS Is a Japanese-First Zero-Shot Voice Synthesis Model

SB Intuitions released sarashina2.2-tts, an LLM-based TTS model focused on Japanese. It clones speaker voice and style from short reference audio without fine-tuning, and handles Japanese-English code-switching.

AI TTS Voice Synthesis LLM Voice Cloning

#Voice Synthesis

Sarashina2.2-TTS Is a Japanese-First Zero-Shot Voice Synthesis Model