Speeding up agentic workflows with WebSockets in the Responses API
OpenAI optimizes agentic workflows using WebSockets in the Responses API: reduced model latency and API overhead through connection-scoped caching. Improvements documented on the Codex agent loop.