Skip to content

gaia talk: stuck listening, ignores LEMONADE_BASE_URL, no mic sensitivity control #347

@itomek

Description

@itomek

Summary

gaia talk has two distinct problems that prevent it from working in common configurations, particularly when using a remote Lemonade Server or on Windows systems with typical microphone hardware.

Problem 1: Stuck in "Listening" mode, never detects speech

Observed behavior: Running gaia talk shows the "Listening" spinner indefinitely. Speech is never detected regardless of how loudly the user speaks.

Root cause: The voice activity detection (VAD) thresholds in AudioClient are set far more aggressively than the underlying AudioRecorder defaults:

Parameter AudioClient value AudioRecorder default
silence_threshold 0.01 0.003
min_audio_length 1.0s 0.5s

The AudioClient silence threshold is 3.3x higher than the recorder default, meaning audio must be 3.3x louder to be detected as speech. Many Windows microphones produce audio levels below 0.01, so the system never triggers.

Impact: There is no user-visible diagnostic information during this failure:

  • No microphone device name is shown at startup
  • No audio level feedback while listening
  • No warning when extended listening produces no detected speech
  • The user has no way to tell if the microphone is working, too quiet, or misconfigured

Affected files:

  • src/gaia/audio/audio_client.py

Problem 2: No CLI option to adjust microphone sensitivity

Observed behavior: Users with quiet microphones or noisy environments have no way to adjust the voice detection amplitude threshold without modifying source code.

Impact: Even if a user discovers that their microphone levels are too low for the default threshold, there is no --silence-threshold or equivalent CLI flag on gaia talk to tune the sensitivity. This forces users to either modify source code or accept that gaia talk will not work with their hardware.

Affected files:

  • src/gaia/cli.py (talk command argument parsing)
  • src/gaia/talk/sdk.py (TalkSDK configuration)
  • src/gaia/audio/audio_client.py (AudioClient configuration)

Environment

  • OS: Windows 11
  • Python: 3.10+
  • GAIA version: main branch (commit 05b6fda)

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions