Key capabilities
- Continuous stream processing: Translates live audio without segmenting or buffering, producing output that tracks the cadence of the original speech.
- Speech and text output: Produces both translated speech (audio) and a translated transcript in the target language.
- Low-latency translation: Keeps pace with real-time conversation, reducing the gap between the original speech and translated output.
When to use GPT Realtime Translate
Use GPT Realtime Translate when you need:- Live streaming events, conferences, and broadcasts requiring real-time multilingual output.
- Cross-language customer support calls.
- Multilingual voice interfaces and applications.
- Live media localization.
- International real-time meetings and collaboration.
Example use cases
- Live multilingual events: Translate conference talks, webinars, or broadcasts in real time so audiences can listen in their preferred language. Pair with GPT Realtime Whisper to simultaneously provide source-language captions.
- Global customer support: Route inbound calls through GPT Realtime Translate to bridge language gaps between customers and agents. The translated transcript gives agents a written record in their language for follow-up.
- International voice assistants: Build once and deploy across languages. GPT Realtime Translate enables multilingual voice interactions without requiring per-language model deployments.