Skip to content

ZahrizhalAli/lokin

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🎙️ Lokin: Real-Time AI Interviewer

Lokin is an open-source Python framework for building real-time voice and multimodal conversational agents. Orchestrate audio and video, AI services, web transport, and conversation pipelines.

Real-Time Pipeline

Audio Capture: Your browser captures microphone audio and sends it via WebRTC

Share Screen : Inject your screen sharing enabling Lokin to give you direct feedback.

Voice Activity Detection: Silero VAD detects when you start and stop speaking

Speech Recognition: Deepgram converts your speech to text in real-time

Language Processing: OpenAI’s GPT model generates an intelligent response

Speech Synthesis: Cartesia converts the response text back to natural speech

Audio Playback: The generated audio streams back to your browser

🔧 Getting Started

  1. Clone the Repository

    git clone https://github.com/ZahrizhalAli/lokin.git
    cd lokin
  2. Instal uv

    curl -LsSf https://astral.sh/uv/install.sh | sh

    Refer to the uv install documentation.

  3. Set up environment

    cp env.example .env
  4. Try the Sample App

    Now you can test the local package with the sample app:

    uv sync  # Installs dependencies and the local package in editable mode
    uv run app.py

Then open http://localhost:7860 in your browser.

Project Logo

About

A framework for voice conversational AI

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages