Serve
The serve command starts a FastAPI web server that provides both a web interface and HTTP API for text-to-speech generation.
Basic Usage
This starts a server on http://localhost:8000 with the default voice model.
Command Options
--host HOST: Host to bind to (default: "localhost")--port PORT: Port to bind to (default: 8000)--reload: Enable auto-reload for development--language: Language for the TTS model, one of'english_2026-01','english_2026-04','english','french_24l','german_24l','portuguese_24l','italian_24l','spanish_24l'(default:english, which is the same model as'english_2026-04'). Incompatible with--config. The "24l" variants are bigger models, not distilled yet and here only as preview.--config: Path to a custom config .yaml. Incompatible with--language.--quantize: Use int8 quantization for the model (default: False). This can reduce memory usage and increase speed, with minimal impact on audio quality.
Examples
Basic Server
# Start with default settings
pocket-tts serve
# Custom host and port
pocket-tts serve --host "localhost" --port 8080
Custom Language
To select the default language model, pass --language:
Custom Model Config
If you'd like to override the paths from which the models are loaded, you can provide a custom YAML configuration.
Copy one of the files in pocket_tts/config (for example pocket_tts/config/english.yaml) and change weights_path, weights_path_without_voice_cloning:, and tokenizer_path: to the paths of the models you want to load.
Then, use the --config option to point to your newly created config.
Web Interface
Once the server is running, navigate to http://localhost:8000 to access the web interface.
For more advanced usage, see the Python API documentation for direct integration with the TTS model.