#Voice Cloning

5 articles

TechJul 6, 20265 min

Irodori-TTS Japanese voice clone on a 4GB RTX 3050 Ti: 3.2s gen, FFmpeg DLL fix

Tested Irodori-TTS 500M-v3 on a 4GB RTX 3050 Ti Laptop (Windows): default-voice and zero-shot clone samples, real timings, and the FFmpeg DLL fix for MP3 references.

AI TTS Speech Synthesis Voice Cloning Local AI Experiment

TechMay 13, 202611 min

VoxCPM2 and OSS TTS in 2026: Irodori-TTS, F5-TTS, and Japanese fine-tune notes

VoxCPM2 sits in the tokenizer-free corner. Mapped vs F5-TTS, CosyVoice2, Irodori-TTS, Style-Bert-VITS2; plus why Japanese TTS still leans on OpenJTalk.

AI TTS Speech Synthesis Voice Cloning Local AI Open Source Fine-tuning

TechApr 28, 20266 min

Sarashina2.2-TTS Is a Japanese-First Zero-Shot Voice Synthesis Model

SB Intuitions released sarashina2.2-tts, an LLM-based TTS model focused on Japanese. It clones speaker voice and style from short reference audio without fine-tuning, and handles Japanese-English code-switching.

AI TTS Voice Synthesis LLM Voice Cloning

TechMar 17, 20264 min

LuxTTS - lightweight ZipVoice-based voice cloning that runs in 1 GB of VRAM

An open-source TTS model distilled from the ZipVoice architecture into four inference steps, delivering voice cloning with 1 GB of VRAM and 150x real-time speed. It also compares itself with the other TTS models covered on this blog.

AI TTS Speech Synthesis OSS Voice Cloning

TechFeb 14, 20266 min

MimikaStudio - a local TTS app that unifies multiple engines in one GUI

A local-first voice cloning, TTS, and audiobook app that brings Qwen3-TTS, Chatterbox, Kokoro, and IndexTTS-2 into a single GUI. It uses a FastAPI backend, Flutter UI, and an MCP server.

AI TTS Speech Synthesis Voice Cloning Flutter