OpenAI upgrades its transcription and voice-generating AI models

OpenAI is bringing new transcription and voice-generating AI models to its API that the company claims improve upon its previous releases.

For OpenAI, the models fit into its broader “agentic” vision: building automated systems that can independently accomplish tasks on behalf of users. The definition of “agent” might be in dispute, but OpenAI Head of Product Olivier Godement described one interpretation as a chatbot that can speak with a business’s customers.

“We’re going to see more and more agents pop up in the coming months” Godement told TechCrunch during a briefing. “And so the general theme is helping customers and developers leverage agents that are useful, available, and accurate.”

OpenAI claims that its new text-to-speech model, “gpt-4o-mini-tts,” not only delivers more nuanced and realistic-sounding speech but is also more “steerable” than its previous-gen speech-synthesizing models. Developers can instruct gpt-4o-mini-tts on how to say things in natural language — for example, “speak like a mad scientist” or “use a serene voice, like a mindfulness teacher.”

Here’s a “true crime-style,” weathered voice:

OpenAI transcription results — The results from OpenAI transcription benchmarking.Image Credits:OpenAI

Source link

What's Hot

Investors trust Google more than Meta when comes to spending on AI

Paragon is not collaborating with Italian authorities probing spyware attacks, report says

Microsoft cuts OpenAI revenue share as their AI alliance loosens

OpenAI upgrades its transcription and voice-generating AI models

Paragon is not collaborating with Italian authorities probing spyware attacks, report says

Zoom teams up with World to verify humans in meetings

Hackers are abusing unpatched Windows security flaws to hack into organizations

‘Tokenmaxxing’ is making developers less productive than they think

Sources: Cursor in talks to raise $2B+ at $50B valuation as enterprise growth surges

Kevin Weil and Bill Peebles exit OpenAI as company continues to shed ‘side quests’

Investors trust Google more than Meta when comes to spending on AI

Google launches training and inference TPUs in latest shot at Nvidia

Meta tracks employee usage on Google, LinkedIn AI training project

Meta will cut 10% of workforce as company pushes deeper into AI

Malicious Chrome Extension Steal ChatGPT and DeepSeek Conversations from 900K Users

Top 10 Best Server Monitoring Tools

10 Best Cybersecurity Risk Management Tools

What's Hot

OpenAI upgrades its transcription and voice-generating AI models

Keep Reading

Subscribe to Updates