Evaluate the possibility of integrating via an intermediary server.
Example: running a container with Ollama + Piper TTS (open-source, runs offline).
You create a microservice that receives the prompt, calls Ollama, and passes the response to the TTS.
The final output is audio.