🗣️ voice2machine

voice dictation for any text field in your OS

what is this

A tool that converts your voice to text using your local GPU.

The premise is simple: speaking is faster than typing. This project allows you to dictate in any application without depending on cloud services.

philosophy

local-first: your audio never leaves your machine
modular: started as a script, now it's an app with separated responsibilities
gpu-powered: transcription speed using WHISPER locally

how it works

Two global keyboard shortcuts:

script	function
`v2m-toggle.sh`	records → transcribes → copies to clipboard
`v2m-llm.sh`	takes text from clipboard → refines it with LLM → replaces it

documentation

All technical info is in /docs (consolidated in Spanish):

visual flows

voice → text

flowchart LR
A[🎤 record] --> B{whisper}
B --> C[📋 clipboard]

text → improved text

flowchart LR
A[📋 copy] --> B{LLM}
B --> C[📋 replace]

if you don't see the diagrams, you need a mermaid extension

license

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 211 Commits
apps		apps
archives		archives
docs		docs
scripts		scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
AGENTES.md		AGENTES.md
AGENTS.md		AGENTS.md
LEEME.md		LEEME.md
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
voice2machine.code-workspace		voice2machine.code-workspace

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🗣️ voice2machine

what is this

philosophy

how it works

documentation

visual flows

voice → text

text → improved text

license

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

zarvent/voice2machine

Folders and files

Latest commit

History

Repository files navigation

🗣️ voice2machine

what is this

philosophy

how it works

documentation

visual flows

voice → text

text → improved text

license

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages