feat: add GigaAM v3 for Russian speech recognition by pantafive · Pull Request #913 · cjpais/Handy

pantafive · 2026-02-28T00:57:53Z

Adds GigaAM v3 e2e_ctc engine — Russian speech recognition with punctuation, Latin characters and digits. Uses int8 quantized ONNX model (225 MB), BPE tokenizer with 257 subword tokens.

The model is currently downloaded from HuggingFace (istupakov/gigaam-v3-onnx). It needs to be mirrored to blob.handy.computer to be consistent with other models.

cjpais · 2026-02-28T05:42:46Z

Can you add gigaam to transcribe rs first and then I will pull that in

pantafive · 2026-02-28T12:49:34Z

Added GigaAM as a proper engine in transcribe-rs: cjpais/transcribe-rs#45

Once that's merged and published, I'll update this PR to use the crate feature instead of the standalone module.

cjpais · 2026-03-01T04:09:42Z

Thank you so much @pantafive, amazing to see how much support we have for this in just one day! I've released transcribe-rs 0.2.7 with support :) I also have uploaded to handy blob site at https://blob.handy.computer/giga-am-v3.int8.onnx

Only comment I have is maybe in the description section just making sure it's very clear it's for Russian. Also if you have opinion on testing, would be helpful to have your opinion there too. Is it the best for Russian speech you've tested?

pantafive · 2026-03-01T10:03:14Z

Thanks for the release and the CDN upload!

Updated the PR — now uses transcribe-rs 0.2.7 with the gigaam feature, the standalone module is removed. Model URL points to blob.handy.computer. Tested locally, everything works.

Regarding the description — could you clarify which description you'd like updated? The model description in the app already says "Russian speech recognition", but happy to adjust wherever you think it needs to be clearer.

As for testing — GigaAM v3 is the best Russian speech model I've tested. It outperforms Whisper-large-v3 on Russian benchmarks (9.2% vs 25.1% avg WER) and handles punctuation natively.

Add GigaAM v3 e2e_ctc as a new transcription engine using transcribe-rs 0.2.7 gigaam feature. Russian speech recognition with punctuation, Latin characters and digit support. Co-Authored-By: Claude Opus 4.6 <[email protected]>

cjpais · 2026-03-01T10:17:24Z

Thank you! Mainly I was thinking something a bit stronger for the description, like "Best model for Russian speakers or similar"

pantafive · 2026-03-01T10:29:34Z

I think "best" might be risky in a description — it's subjective, and things move fast in this space, so it could become misleading quickly. "Russian speech recognition. Fast and accurate." states what it does without overpromising. But it's your project — happy to go with whatever you think works best!

eboyko · 2026-03-01T11:10:15Z

"Best model for Russian speakers. Great for bilingual Russian/English use — especially developers mixing both languages."

cjpais · 2026-03-01T11:28:37Z

I'm good with whatever and will defer to you since I don't speak Russian haha. Things do sure move fast

Co-Authored-By: Claude Opus 4.6 <[email protected]>

blob website.

gtubolcev · 2026-03-01T18:11:26Z

Not a streaming model or am I wrong?
Great work anyway, thank you!
It would be great to have some real streaming model. Like the last vosk 0.54 https://huggingface.co/alphacep/vosk-model-ru

* feat: add GigaAM v3 model for Russian speech recognition Add GigaAM v3 e2e_ctc as a new transcription engine using transcribe-rs 0.2.7 gigaam feature. Russian speech recognition with punctuation, Latin characters and digit support. Co-Authored-By: Claude Opus 4.6 <[email protected]> * fix: cargo fmt formatting Co-Authored-By: Claude Opus 4.6 <[email protected]> * Keep the file name of the model download the same as the file on the blob website. --------- Co-authored-by: Claude Opus 4.6 <[email protected]> Co-authored-by: CJ Pais <[email protected]> (cherry picked from commit ff86122) # Conflicts: # src-tauri/Cargo.lock # src-tauri/Cargo.toml # src/bindings.ts # src/i18n/locales/ar/translation.json # src/i18n/locales/cs/translation.json # src/i18n/locales/de/translation.json # src/i18n/locales/es/translation.json # src/i18n/locales/fr/translation.json # src/i18n/locales/it/translation.json # src/i18n/locales/ja/translation.json # src/i18n/locales/ko/translation.json # src/i18n/locales/pl/translation.json # src/i18n/locales/pt/translation.json # src/i18n/locales/ru/translation.json # src/i18n/locales/tr/translation.json # src/i18n/locales/uk/translation.json # src/i18n/locales/vi/translation.json # src/i18n/locales/zh-TW/translation.json # src/i18n/locales/zh/translation.json

pantafive mentioned this pull request Feb 28, 2026

feat: add GigaAM v3 engine for Russian speech recognition cjpais/transcribe-rs#45

Merged

pantafive force-pushed the feat/gigaam branch from 05e4f38 to 561eab9 Compare March 1, 2026 09:49

feat: add GigaAM v3 model for Russian speech recognition

4a52e2b

Add GigaAM v3 e2e_ctc as a new transcription engine using transcribe-rs 0.2.7 gigaam feature. Russian speech recognition with punctuation, Latin characters and digit support. Co-Authored-By: Claude Opus 4.6 <[email protected]>

pantafive force-pushed the feat/gigaam branch from 561eab9 to 4a52e2b Compare March 1, 2026 10:12

pantafive and others added 2 commits March 1, 2026 13:47

fix: cargo fmt formatting

adc5e7a

Co-Authored-By: Claude Opus 4.6 <[email protected]>

Keep the file name of the model download the same as the file on the

e8429a5

blob website.

cjpais merged commit ff86122 into cjpais:main Mar 1, 2026

zsky01 mentioned this pull request Mar 17, 2026

[BUG] GigaAM v3 model fails to load on Windows - downloaded as a flat file instead of a directory #1075

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add GigaAM v3 for Russian speech recognition#913

feat: add GigaAM v3 for Russian speech recognition#913
cjpais merged 3 commits intocjpais:mainfrom
pantafive:feat/gigaam

pantafive commented Feb 28, 2026

Uh oh!

cjpais commented Feb 28, 2026

Uh oh!

pantafive commented Feb 28, 2026

Uh oh!

cjpais commented Mar 1, 2026 •

edited

Loading

Uh oh!

pantafive commented Mar 1, 2026

Uh oh!

cjpais commented Mar 1, 2026

Uh oh!

pantafive commented Mar 1, 2026

Uh oh!

eboyko commented Mar 1, 2026

Uh oh!

cjpais commented Mar 1, 2026

Uh oh!

gtubolcev commented Mar 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

pantafive commented Feb 28, 2026

Uh oh!

cjpais commented Feb 28, 2026

Uh oh!

pantafive commented Feb 28, 2026

Uh oh!

cjpais commented Mar 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pantafive commented Mar 1, 2026

Uh oh!

cjpais commented Mar 1, 2026

Uh oh!

pantafive commented Mar 1, 2026

Uh oh!

eboyko commented Mar 1, 2026

Uh oh!

cjpais commented Mar 1, 2026

Uh oh!

gtubolcev commented Mar 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

cjpais commented Mar 1, 2026 •

edited

Loading