When building a WebRTC voice call, you can't pass a remote MediaStream to the SpeechRecognition API. Here are three workable approaches — remote-side recognition, server-side processing, and AudioContext — plus iOS-specific implementation strategies.
The Web Speech API + Gemini + VOICEVOX setup is complete — an AI character you can actually have a voice conversation with. Key implementation notes and impressions.