#STT

2 articles

TechApr 30, 2026updated10 min

NII's 48,000-Hour Audio Dataset Is Raw Material for TTS

NII/LLMC released CC Audio and Archive.org Audio Dataset. URL lists, metadata, and a downloader covering 48,000+ hours of Japanese audio. What it actually contains and how it fits into TTS, ASR, and audio model training.

AI Voice AI Speech Synthesis Speech Recognition TTS STT LLM Machine Learning

TechJan 10, 20266 min

Building a Voice-Chat AI (1): Voice API Survey

Aiming for a characterful AI with an avatar and voice chat, I started by comparing voice APIs.

AI Speech Synthesis Speech Recognition TTS STT Gemini OpenAI ChatGPT VOICEVOX Google Cloud