December 1, 2023
Unsupervised speech-to-speech translation from monolingual dataOctober 26, 2023
Spoken question answering and speech continuation using a spectrogram-powered LLMOctober 19, 2023
English learners can now practice speaking on SearchJune 22, 2023
SoundStorm: Efficient parallel audio generationJune 21, 2023
Responsible AI at Google Research: AI for Social GoodJune 7, 2023
Evaluating speech synthesis in many languages with SQuIdJune 2, 2023
AVFormer: Injecting vision into frozen speech models for zero-shot AV-ASRMarch 6, 2023
Universal Speech Model (USM): State-of-the-art speech AI for 100+ languagesDecember 14, 2022
Who said what? Recorder's on-device solution for labeling speakersSeptember 18, 2022
Google at Interspeech 2022June 30, 2022
Identifying Disfluencies in Natural SpeechApril 1, 2022
Introducing CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus