This feature is currently in public preview. This preview is provided without a service-level agreement, and we don’t recommend it for production workloads. Certain features might not be supported or might have constrained capabilities. For more information, see Supplemental Terms of Use for Microsoft Azure Previews.
What’s new in GPT Realtime 2
- Reasoning support with an adjustable
reasoning.effortcontrol. - Response phases that distinguish preambles (“commentary”) from the final answer (“final_answer”).
- Longer context window (256,000 tokens).
Key concepts
Reasoning effort
Control reasoning intensity with thereasoning.effort session parameter. Valid values are minimal, low, medium, and high.
Preambles and response phases
Realtime responses can include multiple output items per turn. Each item has aphase that indicates its role:
| Phase | Description |
|---|---|
commentary | A promptable preamble, often used before longer reasoning. |
final_answer | The final answer after the model completes reasoning. |