Skip to content
This repository was archived by the owner on Aug 29, 2025. It is now read-only.

Commit 561e8a5

Browse files
committed
Version 3.3.0
Major Feature: Multilanguage ChatterBox Support 🌍 NEW: Multi-language ChatterBox TTS - Language dropdown for English, German, Norwegian models - Automatic HuggingFace model download and management - Local model prioritization for faster generation - Safetensors format support with .pt backward compatibility - Language-aware caching system to prevent model conflicts 🎯 Enhanced Nodes: - ChatterBox TTS Node: Full multilanguage support - ChatterBox SRT TTS Node: SRT timing with multilanguage models - Character switching works seamlessly with all supported languages ⚠️ BREAKING CHANGE: Workflow Compatibility - Added language parameter as second input in both TTS nodes - Existing workflows need manual parameter adjustment - All example workflows updated for new parameter structure 🔧 Technical Improvements: - Enhanced model manager with language-specific loading - Robust fallback system: local → HuggingFace → English fallback - JaneDoe84's safetensors loading fix integrated safely - Language-aware cache keys prevent cross-language conflicts
1 parent ec06f63 commit 561e8a5

File tree

6 files changed

+50
-5
lines changed

6 files changed

+50
-5
lines changed

CHANGELOG.md

Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,36 @@ All notable changes to this project will be documented in this file.
55
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
66
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
77

8+
## [3.3.0] - 2025-08-01
9+
10+
### Added
11+
12+
- Major Feature: Multilanguage ChatterBox Support
13+
- 🌍 NEW: Multi-language ChatterBox TTS
14+
- Added language parameter as second input in both TTS nodes
15+
- All example workflows updated for new parameter structure
16+
17+
### Fixed
18+
19+
- Language dropdown for English, German, Norwegian models
20+
- Automatic HuggingFace model download and management
21+
- Local model prioritization for faster generation
22+
- Safetensors format support with .pt backward compatibility
23+
- Language-aware caching system to prevent model conflicts
24+
- ChatterBox TTS Node: Full multilanguage support
25+
- ChatterBox SRT TTS Node: SRT timing with multilanguage models
26+
- Character switching works seamlessly with all supported languages
27+
- Existing workflows need manual parameter adjustment
28+
- Robust fallback system: local → HuggingFace → English fallback
29+
- JaneDoe84's safetensors loading fix integrated safely
30+
- Language-aware cache keys prevent cross-language conflicts
31+
32+
### Changed
33+
34+
- 🎯 Enhanced Nodes:
35+
- ⚠️ BREAKING CHANGE: Workflow Compatibility
36+
- 🔧 Technical Improvements:
37+
- Enhanced model manager with language-specific loading
838
## [3.2.9] - 2025-08-01
939

1040
### Fixed

README.md

Lines changed: 16 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@
66
[![Forks][forks-shield]][forks-url]
77
[![Dynamic TOML Badge][version-shield]][version-url]
88

9-
# ComfyUI ChatterBox SRT Voice (diogod) v3.2.9
9+
# ComfyUI ChatterBox SRT Voice (diogod) v3.3.0
1010

1111
*This is a refactored node, originally created by [ShmuelRonen](https://github.com/ShmuelRonen/ComfyUI_ChatterBox_Voice).*
1212

@@ -174,12 +174,27 @@ Welcome to our show! [pause:1s] Today we'll discuss exciting topics.
174174
-**Fast & Quality** - Production-grade TTS that outperforms ElevenLabs
175175
- 🎭 **Character Switching** - Multi-character TTS with `[CharacterName]` tags and alias system
176176
- 😤 **Emotion Control** - Unique exaggeration parameter for expressive speech
177+
- 🌍 **Multi-language ChatterBox** - Support for English, German, Norwegian models with automatic download and local model prioritization
177178
- 🌍 **Multi-language F5-TTS** - Support for English, German, Spanish, French, Japanese and more
178179
- 📝 **Enhanced Chunking** - Intelligent text splitting for long content with multiple combination methods
179180
- 📦 **Self-Contained** - Bundled ChatterBox for zero-installation-hassle experience
180181
- 🎵 **Advanced Audio Processing** - Optional FFmpeg support for premium audio quality with graceful fallback
181182
- 🌊 **Audio Wave Analyzer** - Interactive waveform visualization and precise timing extraction for F5-TTS workflows → **[📖 Complete Guide](docs/🌊_Audio_Wave_Analyzer-Complete_User_Guide.md)**
182183

184+
### 🌍 Multi-language ChatterBox Models
185+
The ChatterBox TTS and SRT nodes now support multiple languages with automatic model management:
186+
187+
**Supported Languages:**
188+
- 🇺🇸 **English**: Original ResembleAI model (default)
189+
- 🇩🇪 **German**: High-quality German ChatterBox model
190+
- 🇳🇴 **Norwegian**: Norwegian ChatterBox model (Bokmål and Nynorsk dialects)
191+
192+
**Smart Model Management:**
193+
- Language dropdown in both TTS and SRT nodes
194+
- Automatic download from HuggingFace when needed
195+
- Local model prioritization for faster generation
196+
- Safetensors format support with .pt backward compatibility
197+
183198
<div align="right"><a href="#readme-top">↗️ Back to top</a></div>
184199

185200
## 🚀 Quick Start

chatterbox_srt/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44
"""
55

66
# Version info
7-
__version__ = "3.2.9"
7+
__version__ = "3.3.0"
88
__author__ = "Diogod"
99

1010
# Import the new SRT modules

core/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44
"""
55

66
# Version info
7-
__version__ = "3.2.9"
7+
__version__ = "3.3.0"
88
__author__ = "Diogod"
99

1010
# Make imports available at package level

nodes.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# Version and constants
2-
VERSION = "3.2.9"
2+
VERSION = "3.3.0"
33
IS_DEV = False # Set to False for release builds
44
VERSION_DISPLAY = f"v{VERSION}" + (" (dev)" if IS_DEV else "")
55
SEPARATOR = "=" * 70

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
[project]
22
name = "chatterbox_srt_voice"
33
description = "ChatterBox SRT Voice TTS Node is a fork of 'ChatteBox Voice' with additional devolpments and full F5-TTS implementation as well. I introduced a SRT node designed to help you synchronize your generated TTS audio with `.srt` subtitle files. Audio wave analyzer will help you find speech segments for f5 speech edit and much more!"
4-
version = "3.2.9"
4+
version = "3.3.0"
55
license = {file = "LICENSE"}
66
dependencies = ["s3tokenizer>=0.1.7", "resemble-perth", "librosa", "scipy", "omegaconf", "accelerate", "transformers==4.46.3", "# Additional dependencies for SRT support and audio processing", "conformer>=0.3.2", "torch", "torchaudio", "numpy", "einops", "phonemizer", "g2p-en", "unidecode", "# Audio processing and timing dependencies", "soundfile", "resampy", "webrtcvad", "# Optional but recommended for better performance", "numba"]
77

0 commit comments

Comments
 (0)