The enterprise voice AI market is in the middle of a land grab. ElevenLabs and IBM announced a collaboration just this week to bring premium voice capabilities into IBM's watsonx Orchestrate platform.
In today’s digital world, audio content has become a crucial element of communication, learning, and entertainment. Podcasts, video narrations, online courses, and voice assistants all rely on voice ...
Google AI Edge Eloquent for iOS is now available in the App Store.
What if you could replicate any voice, yes, any voice—with just a few audio samples? In this overview, Sam Witteveen explores how the Qwen 3 TTS AI model has shattered barriers in voice cloning and ...
The Paris-based Mistral AI SAS today announced the release of Voxtral TTS, its first text-to-speech artificial intelligence model aimed at unseating the best-known and most powerful voice models on ...
Despite lots of hype, "voice AI" has so far largely been a euphemism for a request-response loop. You speak, a cloud server transcribes your words, a language model thinks, and a robotic voice reads ...
In a new paper titled Principled Coarse-Grained Acceptance for Speculative Decoding in Speech, Apple researchers detail an interesting approach to generating speech from text. While there are ...