A startup called Modulate Inc. wants to turn the world of conversational voice intelligence on its head after developing a novel artificial intelligence model architecture that it says far surpasses ...
MAI released models that can transcribe voice into text as well as generate audio and images after the group's formation six ...
Voice interfaces are entering a new phase. After years of limited adoption, the field is experiencing a resurgence, driven by real-time large language models, multimodal assistants, and a wave of new ...
Voice AI is moving faster than the tools we use to measure it. Every major AI lab — OpenAI, Google DeepMind, Anthropic, xAI — is racing to ship voice models capable of natural, real-time conversation.
Scale AI has released Voice Showdown, a platform for evaluating voice AI models, as it seeks to compete with rivals such as OpenAI, xAI, and Anthropic. Voice Showdown is the first global preference ...
For the past year, enterprise decision-makers have faced a rigid architectural trade-off in voice AI: adopt a "Native" speech-to-speech (S2S) model for speed and emotional fidelity, or stick with a ...
Robot creating audiowave. Cloning of human voices with the help of artifical intelligence concept, generative audio content concept. Vector illustration. Voice AI can sound impressive in a demo, but ...
Microsoft AI has made its in-house models for transcription, speech recognition, and image generation available on Foundry.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results