Add general stdin/stdout command filter for transcription post-processing by NightMachinery · Pull Request #739 · cjpais/Handy

NightMachinery · 2026-02-08T16:29:34Z

Before Submitting This PR

Please confirm you have done the following:

I have searched existing issues and pull requests (including closed ones) to ensure this isn't a duplicate
I have read CONTRIBUTING.md

Human Written Description

This PR allows setting any general program as the post-processor. It feeds the transcribed text into the given program's stdin, and uses what the program outputs in its stdout as the final output.

The advantage is that this allows doing any kind of post-processing. The disadvantage is that the user must program the post-processing completely by themselves.

I use it to wrap the transcribed text when interacting with LLM agents such as Codex:

```speech-to-text
...dictated text...
```

My post-processor even detects which app is in focus and adapts its post-processing logic accordingly.

The code was completely written by the Codex App 5.3 Extra-High, but I drived the design choices of the UX. I haven't looked at the code diff yet. I am not familiar with Rust and the rest of the stack. I did manually test the feature, and it works.

Related Issues/Discussions

Fixes #
Discussion:

Community Feedback

Testing

I manually tested it.

Screenshots/Videos (if applicable)

AI Assistance

No AI was used in this PR
AI was used (please describe below)

If AI was used:

Tools used: Codex App 5.3 Extra-High
How extensively: All code changes

The rest of the PR is written by Codex:

Summary

This PR adds a general local command filter pipeline to Handy so users can run any executable that:

reads transcription text from stdin
writes transformed text to stdout

The filter is configurable from the Post Process page and can be applied to normal hotkey, post-process hotkey, or both.

Why this is valuable

This makes Handy a composable STT tool, not just a fixed pipeline:

integrate with existing scripts/tools you already trust
keep transformations local and fast
support highly custom workflows without shipping app-specific logic for each one

In practice, this unlocks lightweight automation that sits between raw dictation and paste.

Real example (from my usage)

I use a wrapper filter script that:

reads stdin from Handy
checks current app/window focus
conditionally wraps dictated text in a fenced block for LLM tools

When I’m focused in ChatGPT/Gemini/Codex contexts, it outputs:

```speech-to-text
...dictated text...
```

Otherwise, it passes text through unchanged.

That lets downstream instructions treat STT input differently (e.g. typo-aware cleanup) while keeping normal app usage unaffected.

What changed

Backend

Added new settings fields:
- command_filter_enabled
- command_filter_scope (transcribe | post_process | both)
- command_filter_order (before_llm | after_llm)
- command_filter_executable
- command_filter_args
- command_filter_timeout_ms
Added helper logic for when secondary shortcut registration is needed.
Added src-tauri/src/command_filter.rs:
- executes executable + args directly (no shell string execution)
- writes exact transcription bytes to stdin
- captures stdout/stderr
- enforces timeout
- returns applied / failed / empty-cancel states
- expands ~ / ~/... in executable and args to home dir before execution
Integrated filter in actions.rs with configurable order relative to LLM.
Trimmed-empty stdout cancels paste while preserving original transcription in history.

Shortcut registration

transcribe_with_post_process now registers if either:

AI post-processing is enabled, or
command filter is enabled and scope includes post_process.

Commands + bindings

Added Tauri commands:

change_command_filter_enabled_setting
change_command_filter_scope_setting
change_command_filter_order_setting
change_command_filter_executable_setting
change_command_filter_args_setting
change_command_filter_timeout_setting

Wired in lib.rs and updated src/bindings.ts.

Frontend/UI

Post Process sidebar section is always visible.
Removed old Post Processing toggle from Advanced > Experimental.
Added Post Process > Modes controls:
- AI Post-Processing toggle
- Command Filter toggle
Added Post Process > Command Filter controls:
- scope
- order
- executable
- args (one per line)
- timeout (ms)
Updated settings store mapping for new settings.

i18n

Added settings.postProcessing.modes.*
Added settings.postProcessing.commandFilter.*
Added keys across all locale files for consistency.
Updated secondary hotkey description text.

Documentation

Added: docs/PRs/general_postprocess.md

Behavior details

Filter failures: fallback to previous text.
Trimmed-empty stdout: cancel paste.
History on empty-cancel: still stores original transcription.
Chinese conversion remains before filter stage(s).
LLM stage still only runs when AI post-processing is enabled and secondary hotkey is used.

Validation

All passed:

cargo check -q
cargo test -q
bun run lint
bun run check:translations
bun run build
bun run format:check

VirenMohindra · 2026-02-08T16:36:52Z

hey @NightMachinery, appreciate the detailed writeup. the summary is thorough and it's clear a lot of work went into this. however, we'd need this to follow the PR template in .github/PULL_REQUEST_TEMPLATE.md before we can review. the project has a lot of open PRs and inflight work right now so following the template helps us triage consistently

specifically what's missing~

the checklist confirmations (searched existing issues/PRs, read CONTRIBUTING.md)
a human written description section — a few sentences in your own words about what problem you noticed and why this matters (separate from the technical summary)
related issues / discussions
the AI assistance disclosure checkbox
screenshots (if needed)

the technical summary you have is great and can stay, just needs the template sections wrapped around it

NightMachinery · 2026-02-08T18:26:25Z

@VirenMohindra Hi! Thanks for the heads up. I added the template before the previous text.

cjpais · 2026-02-09T00:02:40Z

Can you provide screenshots?

This is a large change, I would generally prefer community support for something like this before a PR is submitted.

NightMachinery · 2026-02-09T05:03:21Z

Can you provide screenshots?

This is a large change, I would generally prefer community support for something like this before a PR is submitted.

The only visible changes are these additions to the post-processing panel. This panel is now always shown.

cjpais · 2026-02-09T06:35:57Z

Uhhhhhh okay, this is not going to be merged anytime soon. Mainly it's far too advanced and specific to be generally distributed I feel. You need to collect community support if you want this

Maybe in v2 we will have something like this but it will likely be a more agentic flow

danlamanna · 2026-02-27T05:03:21Z

FWIW, I would be in favor of a feature like this. One use case I can think of is making the output more casual in an instant messaging context. This is trivial and instantaneous for a scripting language but slower and more brittle for LLM post processing.

Add support for transcription hook - an executable script in app's data directory. If `transcription_hook` file exists, Handy runs it passing transcription text via stdin and uses script stdout as a transcription result. This approach is a flexible extension point for advanced users (which nowadays means with access to coding LLM) akin to git hooks. Here are some possible scenarios: * simple transcription modifications * a pipeline involving LLM processing, language detection and translation * custom paste method (as Handy does nothing if transcription is empty) * conditional processing based on the active application waiting for the input See related: * cjpais#168 * cjpais#739 * cjpais#638 * cjpais#455

Add support for transcription hook - an executable script in app's data directory. If `transcription_hook` file exists, Handy runs it passing transcription text via stdin and uses script stdout as a transcription result. This approach is a flexible extension point for advanced users (which nowadays means with access to coding LLM) akin to git hooks. Here are some possible scenarios: * simple transcription modifications * a pipeline involving LLM processing, language detection and translation * custom paste method (as Handy does nothing if transcription is empty) * conditional processing based on the active application waiting for the input See related: * cjpais#168 * cjpais#162 * cjpais#916 * cjpais#911 * cjpais#834 * cjpais#847 * cjpais#833 * cjpais#662 * cjpais#601 * cjpais#335 * cjpais#162 * cjpais#739 * cjpais#638 * cjpais#455 * cjpais#157

Add support for transcription hook - an executable script in app's data directory. If `transcription_hook` file exists, Handy runs it passing transcription text via stdin and uses script stdout as a transcription result. This approach is a flexible extension point for advanced users (which nowadays means with access to coding LLM) akin to git hooks. Here are some possible scenarios: * simple transcription modifications * a pipeline involving LLM processing, language detection and translation * custom paste method (as Handy does nothing if transcription is empty) * conditional processing based on the active application waiting for the input See related: * cjpais#168 * cjpais#162 * cjpais#916 * cjpais#911 * cjpais#834 * cjpais#847 * cjpais#833 * cjpais#662 * cjpais#601 * cjpais#335 * cjpais#739 * cjpais#638 * cjpais#455 * cjpais#157

Add support for transcription hook - an executable script in app's data directory. If `hooks/transcription` file exists, Handy runs it passing transcription text via stdin and uses script stdout as a transcription result. This approach is a flexible extension point for advanced users (which nowadays means with access to coding LLM) akin to git hooks. Here are some possible scenarios: * simple transcription modifications * a pipeline involving LLM processing, language detection and translation * custom paste method (as Handy does nothing if transcription is empty) * conditional processing based on the active application waiting for the input See related: * cjpais#168 * cjpais#162 * cjpais#916 * cjpais#911 * cjpais#834 * cjpais#847 * cjpais#833 * cjpais#662 * cjpais#601 * cjpais#335 * cjpais#739 * cjpais#638 * cjpais#455 * cjpais#157

NightMachinery added 2 commits February 8, 2026 19:56

feat(postprocess): add configurable stdin/stdout command filter

ea2126f

feat(postprocess): expand ~/ paths in command filter args

e592a76

cjpais closed this Feb 9, 2026

AlexanderYastrebov mentioned this pull request Mar 1, 2026

Add transcription hook #930

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add general stdin/stdout command filter for transcription post-processing#739

Add general stdin/stdout command filter for transcription post-processing#739
NightMachinery wants to merge 2 commits intocjpais:mainfrom
NightMachinery:codex/command-filter-stdin-stdout

NightMachinery commented Feb 8, 2026 •

edited

Loading

Uh oh!

VirenMohindra commented Feb 8, 2026

Uh oh!

NightMachinery commented Feb 8, 2026 •

edited

Loading

Uh oh!

cjpais commented Feb 9, 2026 •

edited

Loading

Uh oh!

NightMachinery commented Feb 9, 2026

Uh oh!

cjpais commented Feb 9, 2026 •

edited

Loading

Uh oh!

danlamanna commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

NightMachinery commented Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Before Submitting This PR

Human Written Description

Related Issues/Discussions

Community Feedback

Testing

Screenshots/Videos (if applicable)

AI Assistance

Summary

Why this is valuable

Real example (from my usage)

What changed

Backend

Shortcut registration

Commands + bindings

Frontend/UI

i18n

Documentation

Behavior details

Validation

Uh oh!

VirenMohindra commented Feb 8, 2026

Uh oh!

NightMachinery commented Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cjpais commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NightMachinery commented Feb 9, 2026

Uh oh!

cjpais commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

danlamanna commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

NightMachinery commented Feb 8, 2026 •

edited

Loading

NightMachinery commented Feb 8, 2026 •

edited

Loading

cjpais commented Feb 9, 2026 •

edited

Loading

cjpais commented Feb 9, 2026 •

edited

Loading