Local STT Models
download_local_stt_model
Downloads a local speech-to-text model from Hugging Face.Model identifier (e.g.,
nvidia/parakeet-tdt_ctc-110m, usefulsensors/moonshine-base)Canonical model name
Model provider (
parakeet, whisper, moonshine, etc.)Download method used
Local filesystem path where model is stored
Human-readable status message
warmup_local_stt_model
Preloads a local STT model into memory for faster inference.Model identifier to warm up
Model name
Model provider
Whether warmup succeeded
Status details
deactivate_local_stt_model
Unloads a local STT model from memory.Model to deactivate (omit to deactivate all)
Deactivated model name
Model provider
Whether deactivation succeeded
Status details
get_local_stt_hardware_advice
Analyzes system hardware and recommends appropriate STT models.Currently selected model to validate against hardware
CPU model name
Number of logical CPU cores
Total system RAM in gigabytes
Whether NVIDIA GPU is available
GPU model name
GPU VRAM in gigabytes
Hardware tier:
high, medium, lowRecommended model for this hardware
List of recommended models
Models that may be too demanding
Warning message if selected model is unsuitable
Additional hardware details
delete_local_stt_model
Deletes a downloaded local STT model.Model identifier to delete
Deleted model name
Hugging Face repository ID
Whether deletion succeeded
Path that was deleted
Deletion status
Ollama Models
pull_ollama_model
Downloads a model from Ollama registry.Ollama model identifier (e.g.,
llama3.2:3b, mistral:latest)Ollama API base URL (defaults to
http://localhost:11434)Ollama base URL used
Model name
Whether pull succeeded
Pull status message
get_ollama_status
Checks if Ollama is installed and running.Ollama base URL to check
Whether Ollama is installed
Whether Ollama service is reachable
Ollama version string
Additional status information
fetch_provider_models
Fetches available models from an OpenAI-compatible provider.Provider API key
API base URL (defaults to OpenAI)
Base URL used
List of available model IDs
TTS Voice Management
ensure_voice_model
Downloads and installs a Piper TTS voice model.Path to installed voice model
Path to voice config file
clone_coqui_voice
Clones a voice from audio sample using Coqui TTS.Unique ID for the cloned voice
Base64-encoded audio sample (5-30 seconds recommended)
Custom Python executable path
Coqui TTS model to use
Target language code
Original audio filename
Enable GPU acceleration
Created voice ID
Audio sample duration
Voice storage directory
List of all available voices
Base64-encoded preview audio
Example Usage
- Local STT
- Ollama
- Voice Cloning