feat: add Canary 1B v2 ONNX engine by intech · Pull Request #55 · cjpais/transcribe-rs

intech · 2026-03-12T21:26:58Z

Summary

Closes #54

Adds NVIDIA Canary 1B v2 as a new ONNX speech model in src/onnx/canary/
Supports 27 languages with transcription and translation to English
Three ONNX sessions: preprocessor (mel), encoder, decoder (autoregressive with KV-cache)
Follows all v0.3.0 patterns: SpeechModel trait, Quantization, shared session.rs utilities

What's included

File	Description
`src/onnx/canary/mod.rs`	`CanaryModel`, `CanaryParams`, `SpeechModel` impl, `CAPABILITIES`
`src/onnx/canary/decoder.rs`	Autoregressive KV-cache decode loop, greedy argmax
`src/onnx/canary/vocab.rs`	Vocabulary loading, 9-token prompt building, SentencePiece decoding
`src/onnx/mod.rs`	Added `pub mod canary`
`tests/canary.rs`	Integration tests (SpeechModel trait + transcribe_with)
`examples/canary.rs`	CLI example with timing, quantization, translation flags
`Cargo.toml`	Added test/example entries, no new dependencies

Key design decisions

Preprocessor always FP32: nemo128.onnx doesn't have quantized variants
Encoder/decoder respect Quantization: via resolve_model_path()
Translation mapping: TranscribeOptions.translate: true → target_language = "en"
No new feature flags: Canary compiles under existing onnx feature
Ported from standalone canary-engine crate used in Handy, re-architected for v0.3.0 API

Test plan

cargo check --features onnx — compiles
cargo test --features onnx --lib — 3 unit tests pass (argmax, vocab, decode_tokens)
cargo clippy --features onnx — no warnings in canary module
cargo fmt --check — formatted
Tested with real Canary 1B v2 model in local Handy build (daily use)

🤖 Generated with Claude Code

Port NVIDIA Canary 1B v2 speech model as a new ONNX engine, supporting 27 languages with transcription and translation capabilities. New files: - src/onnx/canary/mod.rs — CanaryModel, CanaryParams, SpeechModel impl - src/onnx/canary/decoder.rs — autoregressive KV-cache decode loop - src/onnx/canary/vocab.rs — vocabulary loading and prompt building - tests/canary.rs — integration tests - examples/canary.rs — CLI usage example Co-Authored-By: Claude Opus 4.6 <[email protected]>

cjpais · 2026-03-13T08:30:29Z

I tested this and it works. Going to review it a bit further and pull in. Thank you.

cjpais · 2026-03-13T13:18:28Z

Thank you. I tested this and added some code and cleaned some things up a bit, but I think it's good to go.

This was referenced Mar 12, 2026

Migrate to transcribe-rs v0.3.0 and remove local canary-engine crate cjpais/Handy#1022

Closed

Migrate to transcribe-rs-0.3.1 and add Canary support cjpais/Handy#1023

Merged

cjpais added 2 commits March 13, 2026 20:22

add itn to canary, also .wav files and tests

e264d02

minor

74744d7

cjpais merged commit 4704c0e into cjpais:main Mar 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Canary 1B v2 ONNX engine#55

feat: add Canary 1B v2 ONNX engine#55
cjpais merged 3 commits intocjpais:mainfrom
intech:feat/canary-engine

intech commented Mar 12, 2026

Uh oh!

cjpais commented Mar 13, 2026

Uh oh!

cjpais commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

intech commented Mar 12, 2026

Summary

What's included

Key design decisions

Test plan

Uh oh!

cjpais commented Mar 13, 2026

Uh oh!

cjpais commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants