Voice Agents in 2026 — STT, LLM, TTS, and Latency That Doesn't Hurt

Practical voice agent architecture: streaming Deepgram/AssemblyAI → LLM → ElevenLabs/OpenAI TTS, latency budgeting, barge-in, and patterns from production calls.

May 5, 2026 · 4 min · 845 words · Manvendra Rajpoot

Design a Voice Chat System Like Discord — System Design Walkthrough

End-to-end design for voice chat: WebRTC, SFU vs MCU, signaling, presence, room state, and the operational realities of running voice at scale.

May 1, 2026 · 4 min · 656 words · Manvendra Rajpoot

Voice Agents and Realtime LLM APIs in 2026 — How They Actually Work

A practical look at building voice agents in 2026. Realtime LLM APIs (OpenAI Realtime, Anthropic, Gemini Live), end-to-end latency, ASR and TTS, interruption handling, and the production patterns from real deployments.

April 30, 2026 · 6 min · 1265 words · Manvendra Rajpoot