TechApr 30, 2026updated10 minNII's 48,000-Hour Audio Dataset Is Raw Material for TTSNII/LLMC released CC Audio and Archive.org Audio Dataset. URL lists, metadata, and a downloader covering 48,000+ hours of Japanese audio. What it actually contains and how it fits into TTS, ASR, and audio model training.AIVoice AISpeech SynthesisSpeech RecognitionTTSSTTLLMMachine Learning
TechJan 10, 20266 minBuilding a Voice-Chat AI (1): Voice API SurveyAiming for a characterful AI with an avatar and voice chat, I started by comparing voice APIs.AISpeech SynthesisSpeech RecognitionTTSSTTGeminiOpenAIChatGPTVOICEVOXGoogle Cloud