OpenAI rearchitected their WebRTC stack to improve real-time AI interactions by reducing latency and preserving standard WebRTC behavior for clients. They achieved this by introducing a relay service that routes packets to the correct transceiver, allowing for deterministic first-packet routing and a small public UDP footprint.