AI Models

OpenAI is Developing a New AI Model for Real-Time, Natural-Sounding Audio Generation

The model is expected to launch by the end of March and is designed to produce more natural-sounding speech while supporting smoother, real-time conversational interactions.

The Left Shift Bureau

02 Jan 2026 — 1 min read

OpenAI Group PBC is developing a new artificial intelligence model focused on audio generation, as the company looks to improve the quality and responsiveness of speech-based AI experiences, according to a report by The Information.

The model is expected to launch by the end of March and is designed to produce more natural-sounding speech while supporting smoother, real-time conversational interactions.

Sources cited in the report said the system will outperform OpenAI’s existing audio models in handling back-and-forth dialogue, a key requirement for voice-driven applications.

The new model will reportedly be built on a fresh architecture. OpenAI’s current real-time audio system, GPT-realtime, relies on the transformer architecture, though it remains unclear whether the company will move away from transformers entirely or introduce a redesigned implementation.

OpenAI has previously used different approaches for audio processing, including its Whisper model, which converts audio into spectrograms before analysis.

Beyond speech, the model could extend to music generation, a rapidly growing market. The development is also linked to OpenAI’s broader push into consumer hardware, including plans for an audio-first personal device and other AI-powered products. OpenAI is expected to explore lightweight, on-device audio models to support these ambitions.

YouTube Expands Conversational AI Assistant to TVs Amid Streaming Surge

The feature enables real-time responses to queries such as recipe ingredients in cooking videos or the meaning behind song lyrics.

OpenAI CEO Sam Altman Says True Superintelligence Could Arrive by 2028

“On our current trajectory, we believe we may be only a couple of years away from early versions of true super intelligence,” he said.

LTM Partners with Indian Institute of Creative Technologies (IICT) To Build India’s Next-Generation Creative Technology Talent Pipeline

IICT will design and deliver training programs aligned to industry's changing skill needs, covering tech-enabled advertising, AI-driven creative workflows, virtual production, and immersive storytelling.

Google Launches Gemini 3.1 Pro with Enhanced Reasoning and Creative Capabilities

The new model is being rolled out in preview today via the Gemini API, Vertex AI, the Gemini app and NotebookLM.

Read more

YouTube Expands Conversational AI Assistant to TVs Amid Streaming Surge

OpenAI CEO Sam Altman Says True Superintelligence Could Arrive by 2028

LTM Partners with Indian Institute of Creative Technologies (IICT) To Build India’s Next-Generation Creative Technology Talent Pipeline

Google Launches Gemini 3.1 Pro with Enhanced Reasoning and Creative Capabilities