openclaw-listen

openclaw-listen is a command-line Rust application for Linux that listens to a local microphone, transcribes speech with OpenAI Whisper, and sends the resulting text to an OpenClaw Gateway session.

Goals

Run as a local CLI daemon on Linux.
Capture microphone input from a local sound device.
Detect speech boundaries automatically using a simple silence gate.
Transcribe utterances with the OpenAI audio transcription API.
Send the transcribed text into an OpenClaw session through the Gateway.

Configuration

Configuration is loaded from:

--config <path> if provided
$XDG_CONFIG_HOME/openclaw-listen/config.toml
~/.config/openclaw-listen/config.toml
.env in the current working directory, if present
environment variables

Environment variables:

OPENCLAW_GATEWAY_URL
OPENCLAW_GATEWAY_TOKEN
OPENCLAW_SESSION_KEY
OPENCLAW_SESSION_FILTER
OPENAI_API_KEY
OPENAI_BASE_URL
OPENAI_TRANSCRIPTION_MODEL
OPENAI_TRANSCRIPTION_LANGUAGE
OPENAI_TRANSCRIPTION_PROMPT
AUDIO_INPUT_DEVICE
AUDIO_INPUT_GAIN
OPENCLAW_LISTEN_LOG_PATH
WAKE_WORD_ENABLED
WAKE_WORD_ENGINE
WAKE_WORD_MODEL_PATH
WAKE_WORD_THRESHOLD
WAKE_WORD_SIDECAR_COMMAND
WAKE_WORD_SIDECAR_SCRIPT
RUST_LOG

See config.example.toml. For secret values, see .env.example.

Recommended split:

keep stable non-secret settings in config.toml
keep secrets such as OPENCLAW_GATEWAY_TOKEN and OPENAI_API_KEY in .env
use real exported shell env vars only when you want to override .env

Example .env:

OPENCLAW_GATEWAY_TOKEN=replace-me
OPENAI_API_KEY=replace-me
AUDIO_INPUT_GAIN=1.0
WAKE_WORD_MODEL_PATH=$HOME/.config/openclaw-listen/wake/model.onnx

Target Session Selection

openclaw-listen needs a single destination session for outgoing transcripts.

Set openclaw.session_key or OPENCLAW_SESSION_KEY for an exact target.
Otherwise, set openclaw.session_filter or OPENCLAW_SESSION_FILTER and the app will require that it matches exactly one live session.
If neither is set, the app will try a main session first. If that is still ambiguous, it will ask you to configure a target explicitly.

Development

cargo run -- sessions
cargo run -- test
cargo run -- test --send
cargo run -- daemon

To enable real Linux microphone access through CPAL and ALSA:

cargo run --features audio-cpal -- devices
cargo run --features audio-cpal -- test
cargo run --features audio-cpal -- daemon

On Debian or Ubuntu style systems, that feature typically needs:

sudo apt install pkg-config libasound2-dev

With audio-cpal enabled, the app will:

listen to the configured microphone
wait for an openWakeWord sidecar wake detection by default
wait for speech to cross the configured amplitude threshold
stop after trailing silence
resample to mono 16 kHz WAV
send the captured utterance to OpenAI for transcription
forward the resulting text to OpenClaw using chat.send
append transcribed speech and observed agent replies to /var/log/openclaw-listen.log

Wake Word

Wake-word mode is enabled by default and uses a small Python sidecar at scripts/openwakeword-sidecar.py. The Rust daemon owns the microphone stream and writes 16-bit 16 kHz mono PCM frames to the sidecar's stdin; the sidecar never opens the audio device itself. The default model path is $HOME/.config/openclaw-listen/wake/model.onnx; point WAKE_WORD_MODEL_PATH or [wake].model_path at any existing openWakeWord-compatible .onnx or .tflite model. If your microphone level runs low, increase AUDIO_INPUT_GAIN or [audio].input_gain so the wake model and Whisper both see a stronger signal.

Install the sidecar dependencies in the Python environment used by the service:

python3 -m pip install openwakeword numpy

In practice, say the wake word, wait a beat, then speak the command. The same long-lived Rust microphone stream is used for both wake detection and command capture.

Transcript Log

When running daemon or test --send, openclaw-listen appends JSON Lines entries to the configured transcript log. The default path is /var/log/openclaw-listen.log; override it with OPENCLAW_LISTEN_LOG_PATH or [logging].transcript_log_path.

Example entry:

{"timestamp_unix_ms":1776935117036,"session_key":"agent:main:telegram:direct:8735858952","role":"user","text":"Good morning."}

systemd

Build the release binary and install the bundled service and tmpfiles config:

./scripts/install-systemd-service.sh
sudo systemctl enable --now openclaw-listen.service

The installer renders the systemd unit for the current $USER, this checkout path, and the current user runtime directory.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
scripts		scripts
src		src
systemd		systemd
.codex		.codex
.env.example		.env.example
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
config.example.toml		config.example.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

openclaw-listen

Goals

Configuration

Target Session Selection

Development

Wake Word

Transcript Log

systemd

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

openclaw-listen

Goals

Configuration

Target Session Selection

Development

Wake Word

Transcript Log

systemd

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages