4 releases
| 0.1.3 | Jan 5, 2024 |
|---|---|
| 0.1.2 | Oct 22, 2023 |
| 0.1.1 | Oct 22, 2023 |
| 0.1.0 | Oct 22, 2023 |
#402 in Audio
34KB
624 lines
desk-talk
Transcription for your desktop.
A software that records what you say when you press a button down, and types what you said when you release it.
https://github.com/sloganking/desk-talk/assets/16965931/e5da605b-3a9d-4394-b4ec-a3de65605a65
Setup
Make sure ffmpeg is installed and added to your PATH
Quickstart
Assign your OpenAI API key to the OPENAI_API_KEY environment variable and run:
desk-talk --ptt-key scroll-lock
Or pass your OpenAI API key as a flag like so:
desk-talk --ptt-key scroll-lock --api-key [YOUR_API_KEY]
desk-talk will now record every time you hold down the ptt-key, and type what you spoke every time you release it.
Other Usage
Special Keys
To find the name of a key by pressing it, run:
desk-talk show-key-presses
If your key shows as Unknown(number), pass number to the --special-ptt-key flag like so:
desk-talk --special-ptt-key 125
Non-default recording device
To use a microphone other than the system default, run
desk-talk list-devices
to get a list of system microphone names. And pass the desired microphone name to --device like so:
desk-talk --ptt-key scroll-lock --device "Microphone (3- USB Audio Device)"
Dependencies
~15–53MB
~820K SLoC