Skip to content

Conversation

@Prathat2006
Copy link

@Prathat2006 Prathat2006 commented Oct 8, 2025

Added a model load/unload mechanism to optimize memory usage.
The model now automatically unloads after being idle for a set period (default: 1 minute).
If the model is not loaded before generating speech, it will automatically load on demand.
This improvement makes the API more memory-efficient and enhances Open-WebUI integration by reducing VRAM usage when working with local models.

@Prathat2006 Prathat2006 changed the title Prathmesh Added Auto model load/unload support to save memory Oct 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant