OpenAI近日发布了三款新的实时语音模型:GPT-Realtime-2具备近似GPT-5的推理能力,允许语音助手在对话中实时思考;GPT-Realtime-Translate支持超过70种语言的实时翻译;GPT-Realtime-Whisper则能实现流式语音转文本。与此同时,OpenAI官方通过引用推文暗示,用户期待已久的ChatGPT语音功能更新正在积极准备中,即将正式推出。这预示着ChatGPT很可能在近期迎来全新的高级语音模式,进一步提升其交互体验与应用能力。
OpenAI just dropped three new realtime voice models:
-GPT-Realtime-2 (with GPT-5-class reasoning for voice agents that can actually think mid-conversation),
- GPT-Realtime-Translate (live translation across 70+ input languages), and
- GPT-Realtime-Whisper (streaming speech-to-text as people talk).
However, their teaser probably refers to their upcoming new Voice Mode in ChatGPT (advanced voice mode 2?)