In 2019, Mozilla’s Voice team developed a method to evaluate the quality of text-to-speech voices.A lot of the existing work answered the core question of “can you understand this voice?” But now that we’ve reached a stage of computerized voice quality where so many voices can pass the comprehension test with flying colours, what’s the next step?