Files
gajim-plugins/stt_voice_messages/README.md
2026-05-05 05:23:23 -03:00

34 lines
996 B
Markdown

# About
This plugin allows you in conjuction with a _general-purpose speech recognition model_ to transcribe your voice messages to text.
In order to make use of this plugin, you need to have at least one of the following models installed:
#### OpenAI Whisper
- Website: https://github.com/openai/whisper
- Installable by: `pip install -U openai-whisper`
#### Faster Whisper
- Website: https://github.com/SYSTRAN/faster-whisper
- Installable by: `pip install -U faster_whisper`
# Hint
_**The plugin is very much POC at this stage!**_
Currently a chosen model will be on first downloaded in the background, during which
Gajim's UI may not respond.
Typical model sizes are in case of OpenAI Whisper:
| Multi Langual Model | Download Size |
|---------------------|---------------|
| Tiny | 70 MB |
| Base | 140 MB |
| Small | 460 MB |
| Medium | 1.4 GB |
| Large | 2.9 GB |