Voicing Silent Speech¶

This repository contains code for synthesizing speech audio from silently mouthed words captured with electromyography (EMG). This is an updated fork supporting the TinyMyo EMG foundation model.

Quick Start¶

Install Dependencies:
```
uv sync
```

Download & Prepare Data:

# Download data from Zenodo
python download_data.py
# Build HDF5 for fast loading
python build_hdf5.py

Train Transduction (EMG to Audio):
```
python transduction_model.py
```

Train Recognition (EMG to Text):

# Setup KenLM and Lexicon
python get_lexicon.py
# Start training
python recognition_model.py

Sections¶

Silent Speech Synthesis

Transduction and automated speech recognition models for silent speech synthesis. From EMG to audio features (MFCCs). Open Silent Speech Synthesis docs
Silent Speech Recognition

Transcription and decoding of silent speech signals directly to text using CTC. Open Silent Speech Recognition docs