Lokin is an open-source Python framework for building real-time voice and multimodal conversational agents. Orchestrate audio and video, AI services, web transport, and conversation pipelines.
Audio Capture: Your browser captures microphone audio and sends it via WebRTC
Share Screen : Inject your screen sharing enabling Lokin to give you direct feedback.
Voice Activity Detection: Silero VAD detects when you start and stop speaking
Speech Recognition: Deepgram converts your speech to text in real-time
Language Processing: OpenAI’s GPT model generates an intelligent response
Speech Synthesis: Cartesia converts the response text back to natural speech
Audio Playback: The generated audio streams back to your browser
-
Clone the Repository
git clone https://github.com/ZahrizhalAli/lokin.git cd lokin -
Instal uv
curl -LsSf https://astral.sh/uv/install.sh | shRefer to the uv install documentation.
-
Set up environment
cp env.example .env
-
Try the Sample App
Now you can test the local package with the sample app:
uv sync # Installs dependencies and the local package in editable mode uv run app.py
Then open http://localhost:7860 in your browser.
