VoxCPM2 sits in the tokenizer-free corner. Mapped vs F5-TTS, CosyVoice2, Irodori-TTS, Style-Bert-VITS2; plus why Japanese TTS still leans on OpenJTalk.
fspecii/ace-step-ui wraps ACE-Step 1.5's Gradio API in a React/Express/SQLite app with a library, player, editing, and stem separation. On Mac, the MLX+MPS split brings memory and LoRA constraints.
A look at ACE-Step, the 'Stable Diffusion of music,' covering its architecture, features, installation, and expected performance on Apple Silicon before trying it on an M1 Max.