Meta's Voicebox AI is a Dall-E for text-to-speech

GPT-4: Meta introduces Voicebox, a generative text-to-speech model capable of producing conversational audio clips in multiple languages. Trained on over 50,000 hours of unfiltered audio, Voicebox outperforms current text-to-speech systems in intelligibility and audio similarity while operating up to 20 times faster. Potential applications include prosthetics for vocal cord damage patients, in-game NPCs, and digital assistants. However, Meta has not released the app or source code to the public due to potential misuse risks.
Read more at Engadget…

Meta’s Voicebox AI is a Dall-E for text-to-speech | Engadget

Related

OpenAI Codex CLI: Executable AI Reasoning Hits Your Terminal

GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano

DolphinGemma: Unveiling the Language of the Seas with AI

Grok 3 API Debuts with Scalable Models for Code, Data, and Enterprise Tasks

Smarter GitHub Automation with the MCP Server

China Unveils GPMI: A Single-Cable Standard for 8K Video and High Power

When Weather Apps Steal Your SSH Keys

Llama 4

Tame Your Terminal: Managing AI Coding Agents with Claude Squad