Version 3.3.0

diodiogod · diodiogod · commit 561e8a535145 · 2025-08-01T23:02:33.000-03:00
Major Feature: Multilanguage ChatterBox Support 🌍 NEW: Multi-language ChatterBox TTS - Language dropdown for English, German, Norwegian models - Automatic HuggingFace model download and management - Local model prioritization for faster generation - Safetensors format support with .pt backward compatibility - Language-aware caching system to prevent model conflicts 🎯 Enhanced Nodes: - ChatterBox TTS Node: Full multilanguage support - ChatterBox SRT TTS Node: SRT timing with multilanguage models - Character switching works seamlessly with all supported languages ⚠️ BREAKING CHANGE: Workflow Compatibility - Added language parameter as second input in both TTS nodes - Existing workflows need manual parameter adjustment - All example workflows updated for new parameter structure 🔧 Technical Improvements: - Enhanced model manager with language-specific loading - Robust fallback system: local → HuggingFace → English fallback - JaneDoe84's safetensors loading fix integrated safely - Language-aware cache keys prevent cross-language conflicts
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -5,6 +5,36 @@ All notable changes to this project will be documented in this file.
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
 
+## [3.3.0] - 2025-08-01
+
+### Added
+
+- Major Feature: Multilanguage ChatterBox Support
+- 🌍 NEW: Multi-language ChatterBox TTS
+- Added language parameter as second input in both TTS nodes
+- All example workflows updated for new parameter structure
+
+### Fixed
+
+- Language dropdown for English, German, Norwegian models
+- Automatic HuggingFace model download and management
+- Local model prioritization for faster generation
+- Safetensors format support with .pt backward compatibility
+- Language-aware caching system to prevent model conflicts
+- ChatterBox TTS Node: Full multilanguage support
+- ChatterBox SRT TTS Node: SRT timing with multilanguage models
+- Character switching works seamlessly with all supported languages
+- Existing workflows need manual parameter adjustment
+- Robust fallback system: local → HuggingFace → English fallback
+- JaneDoe84's safetensors loading fix integrated safely
+- Language-aware cache keys prevent cross-language conflicts
+
+### Changed
+
+- 🎯 Enhanced Nodes:
+- ⚠️  BREAKING CHANGE: Workflow Compatibility
+- 🔧 Technical Improvements:
+- Enhanced model manager with language-specific loading
 ## [3.2.9] - 2025-08-01
 
 ### Fixed
diff --git a/README.md b/README.md
@@ -6,7 +6,7 @@
 [![Forks][forks-shield]][forks-url]
 [![Dynamic TOML Badge][version-shield]][version-url]
 
-# ComfyUI ChatterBox SRT Voice (diogod) v3.2.9
+# ComfyUI ChatterBox SRT Voice (diogod) v3.3.0
 
 *This is a refactored node, originally created by [ShmuelRonen](https://github.com/ShmuelRonen/ComfyUI_ChatterBox_Voice).*
 
@@ -174,12 +174,27 @@ Welcome to our show! [pause:1s] Today we'll discuss exciting topics.
 - ⚡ **Fast & Quality** - Production-grade TTS that outperforms ElevenLabs
 - 🎭 **Character Switching** - Multi-character TTS with `[CharacterName]` tags and alias system
 - 😤 **Emotion Control** - Unique exaggeration parameter for expressive speech
+- 🌍 **Multi-language ChatterBox** - Support for English, German, Norwegian models with automatic download and local model prioritization
 - 🌍 **Multi-language F5-TTS** - Support for English, German, Spanish, French, Japanese and more
 - 📝 **Enhanced Chunking** - Intelligent text splitting for long content with multiple combination methods
 - 📦 **Self-Contained** - Bundled ChatterBox for zero-installation-hassle experience
 - 🎵 **Advanced Audio Processing** - Optional FFmpeg support for premium audio quality with graceful fallback
 - 🌊 **Audio Wave Analyzer** - Interactive waveform visualization and precise timing extraction for F5-TTS workflows → **[📖 Complete Guide](docs/🌊_Audio_Wave_Analyzer-Complete_User_Guide.md)**
 
+### 🌍 Multi-language ChatterBox Models
+The ChatterBox TTS and SRT nodes now support multiple languages with automatic model management:
+
+**Supported Languages:**
+- 🇺🇸 **English**: Original ResembleAI model (default)
+- 🇩🇪 **German**: High-quality German ChatterBox model
+- 🇳🇴 **Norwegian**: Norwegian ChatterBox model (Bokmål and Nynorsk dialects)
+
+**Smart Model Management:**
+- Language dropdown in both TTS and SRT nodes
+- Automatic download from HuggingFace when needed
+- Local model prioritization for faster generation
+- Safetensors format support with .pt backward compatibility
+
 <div align="right"><a href="#readme-top">↗️ Back to top</a></div>
 
 ## 🚀 Quick Start
diff --git a/chatterbox_srt/__init__.py b/chatterbox_srt/__init__.py
@@ -4,7 +4,7 @@
 """
 
 # Version info
-__version__ = "3.2.9"
+__version__ = "3.3.0"
 __author__ = "Diogod"
 
 # Import the new SRT modules
diff --git a/core/__init__.py b/core/__init__.py
@@ -4,7 +4,7 @@
 """
 
 # Version info
-__version__ = "3.2.9"
+__version__ = "3.3.0"
 __author__ = "Diogod"
 
 # Make imports available at package level
diff --git a/nodes.py b/nodes.py
@@ -1,5 +1,5 @@
 # Version and constants
-VERSION = "3.2.9"
+VERSION = "3.3.0"
 IS_DEV = False  # Set to False for release builds
 VERSION_DISPLAY = f"v{VERSION}" + (" (dev)" if IS_DEV else "")
 SEPARATOR = "=" * 70
diff --git a/pyproject.toml b/pyproject.toml
@@ -1,7 +1,7 @@
 [project]
 name = "chatterbox_srt_voice"
 description = "ChatterBox SRT Voice TTS Node is a fork of 'ChatteBox Voice' with additional devolpments and full F5-TTS implementation as well. I introduced a SRT node designed to help you synchronize your generated TTS audio with `.srt` subtitle files. Audio wave analyzer will help you find speech segments for f5 speech edit and much more!"
-version = "3.2.9"
+version = "3.3.0"
 license = {file = "LICENSE"}
 dependencies = ["s3tokenizer>=0.1.7", "resemble-perth", "librosa", "scipy", "omegaconf", "accelerate", "transformers==4.46.3", "# Additional dependencies for SRT support and audio processing", "conformer>=0.3.2", "torch", "torchaudio", "numpy", "einops", "phonemizer", "g2p-en", "unidecode", "# Audio processing and timing dependencies", "soundfile", "resampy", "webrtcvad", "# Optional but recommended for better performance", "numba"]