Skip to content

Voicing Silent Speech

This repository contains code for synthesizing speech audio from silently mouthed words captured with electromyography (EMG). This is an updated fork supporting the TinyMyo EMG foundation model.

Quick Start

  1. Install Dependencies:

    uv sync
    

  2. Download & Prepare Data:

    # Download data from Zenodo
    python download_data.py
    # Build HDF5 for fast loading
    python build_hdf5.py
    

  3. Train Transduction (EMG to Audio):

    python transduction_model.py
    

  4. Train Recognition (EMG to Text):

    # Setup KenLM and Lexicon
    python get_lexicon.py
    # Start training
    python recognition_model.py
    

Sections