Joycent: Diffusion-based Accent TTS without Accented Phone Prediction


This is a companion discussion topic for the original entry at https://arxiv.org/abs/2606.16417