LiveKit Agents is the framework for building real-time voice AI agents. It provides a pipeline for speech-to-text, LLM processing, and text-to-speech with sub-second latency — enabling natural conversational voice interactions.
LiveKit Agents enables building voice AI agents that hold natural, real-time conversations. The framework handles the entire pipeline — capturing audio, transcribing, processing with an LLM, and synthesizing speech — with latency low enough for natural flow.
from livekit.agents.voice_assistant import VoiceAssistant
from livekit.plugins import openai, silero
assistant = VoiceAssistant(
vad=silero.VAD.load(),
stt=openai.STT(),
llm=openai.LLM(),
tts=openai.TTS(),
)
User speaks: "What's the weather in San Francisco?"
Pipeline: STT → LLM (+ weather tool) → TTS → Audio response
Total latency: ~800ms
AI agents that work well with LiveKit Agents.