Google is enhancing Gemini's text-to-speech (TTS). On Tuesday at Google I/O 2025, the company previewed a new TTS feature, built on native audio output, that can "converse in more expressive ways." ...
Machine learning is bringing us closer to a Babel-fish-style universal translation device. Meta has released a new AI model that can translate speech from 101 different languages. It represents a step ...
On Tuesday, Meta announced SeamlessM4T, a multimodal AI model for speech and text translations. As a neural network that can process both text and audio, it can perform text-to-speech, speech-to-text, ...
Meta has created an AI language model that (in a refreshing change of pace) isn’t a ChatGPT clone. The company’s Massively Multilingual Speech (MMS) project can recognize over 4,000 spoken languages ...