|
| 1 | +CHATTERBOX CHARACTER SWITCHING BUG TEST CASES |
| 2 | +============================================ |
| 3 | + |
| 4 | +Test 1 fails, log: |
| 5 | + |
| 6 | +📦 Loading local ChatterBox models from: J:\stablediffusion1111s2\Data\Packages\ComfyUIPy129\ComfyUI\models\chatterbox |
| 7 | +input frame rate=25 |
| 8 | +loaded PerthNet (Implicit) at step 250,000 |
| 9 | +✅ Successfully loaded all local ChatterBox models |
| 10 | +🎭 ChatterBox: Character switching mode - found characters: narrator, female_01, male_01 |
| 11 | +🔄 Using main voice for character 'narrator' (not found in voice folders) |
| 12 | +🎭 Using character voice for 'female_01' |
| 13 | +🎭 Using character voice for 'male_01' |
| 14 | +🎤 Generating ChatterBox segment 1/6 chunk 1/1 for 'narrator'... |
| 15 | +Sampling: 0%| | 0/1000 [00:00<?, ?it/s]We detected that you are passing `past_key_values` as a tuple of tuples. This is deprecated and will be removed in v4.47. Please convert your cache or use an appropriate `Cache` class (https://huggingface.co/docs/transformers/kv_cache#legacy-cache-format) |
| 16 | +Sampling: 5%|███▋ | 52/1000 [00:01<00:32, 28.77it/s] |
| 17 | +🎤 Generating ChatterBox segment 2/6 chunk 1/1 for 'female_01'... |
| 18 | +Sampling: 2%|█▌ | 23/1000 [00:00<00:33, 28.86it/s] |
| 19 | +🎤 Generating ChatterBox segment 3/6 chunk 1/1 for 'male_01'... |
| 20 | +Reference mel length is not equal to 2 * reference token length. |
| 21 | + |
| 22 | +Sampling: 3%|█▊ | 26/1000 [00:00<00:33, 28.68it/s] |
| 23 | +🎤 Generating ChatterBox segment 4/6 chunk 1/1 for 'narrator'... |
| 24 | +Sampling: 2%|█▌ | 22/1000 [00:00<00:33, 29.33it/s] |
| 25 | +🎤 Generating ChatterBox segment 5/6 chunk 1/1 for 'female_01'... |
| 26 | +Sampling: 2%|█▎ | 18/1000 [00:00<00:33, 28.99it/s] |
| 27 | +🎤 Generating ChatterBox segment 6/6 chunk 1/1 for 'narrator'... |
| 28 | +Sampling: 4%|███ | 43/1000 [00:01<00:32, 29.13it/s] |
| 29 | +C:\actions-runner\_work\pytorch\pytorch\pytorch\aten\src\ATen\native\cuda\Indexing.cu:1553: block: [40,0,0], thread: [0,0,0] Assertion `srcIndex < srcSelectDimSize` failed. |
| 30 | +C:\actions-runner\_work\pytorch\pytorch\pytorch\aten\src\ATen\native\cuda\Indexing.cu:1553: block: [40,0,0], thread: [1,0,0] Assertion `srcIndex < srcSelectDimSize` failed. |
| 31 | + |
| 32 | + |
| 33 | + |
| 34 | +Test Case 2: Question Mark Focus (Target: punctuation) |
| 35 | +------------------------------------------------------ |
| 36 | +This is a test. |
| 37 | +[Alice] Really? |
| 38 | +[Bob] Why not? |
| 39 | +What do you think? |
| 40 | +[Alice] Maybe? |
| 41 | +Final words. |
| 42 | + |
| 43 | +Test Case 3: Very Short Segments (Target: minimal text) |
| 44 | +------------------------------------------------------- |
| 45 | +Start. |
| 46 | +[Alice] Ok. |
| 47 | +[Bob] No. |
| 48 | +Yes? |
| 49 | +[Alice] Go. |
| 50 | +End. |
| 51 | + |
| 52 | +Test Case 4: Mixed Long/Short (Target: length variation) |
| 53 | +------------------------------------------------------- |
| 54 | +This is a longer introduction that should work fine without issues. |
| 55 | +[Alice] Short. |
| 56 | +[Bob] This is a much longer segment that might work better than short ones. |
| 57 | +Brief? |
| 58 | +[Alice] Another very long segment that contains multiple sentences and should be processed without the same issues. |
| 59 | +Done. |
| 60 | + |
| 61 | +Test Case 5: Exact Position Test (Target: 5th segment) |
| 62 | +------------------------------------------------------ |
| 63 | +Segment one here. |
| 64 | +[Alice] Segment two here. |
| 65 | +[Bob] Segment three here. |
| 66 | +[Alice] Segment four here. |
| 67 | +This is segment five. |
| 68 | +[Bob] Segment six here. |
| 69 | +Final segment. |
| 70 | + |
| 71 | +Test Case 6: Character Switching Pattern (Target: same pattern as bug) |
| 72 | +---------------------------------------------------------------------- |
| 73 | +Opening statement. |
| 74 | +[crestfallen_original] Character line. |
| 75 | +[Girl] Another character. |
| 76 | +[crestfallen_original] Second time. |
| 77 | +Back to narrator. |
| 78 | +[Bob] Different character. |
| 79 | +Closing statement. |
| 80 | + |
| 81 | +Test Case 7: Special Characters & Punctuation |
| 82 | +--------------------------------------------- |
| 83 | +Hello there! |
| 84 | +[Alice] What's this? |
| 85 | +[Bob] It's... complicated. |
| 86 | +Really?! |
| 87 | +[Alice] Yes—exactly that. |
| 88 | +The end. |
| 89 | + |
| 90 | +Test Case 8: Empty/Whitespace Lines |
| 91 | +----------------------------------- |
| 92 | +First line. |
| 93 | +[Alice] Second line. |
| 94 | + |
| 95 | +[Bob] After empty line. |
| 96 | +Another gap coming. |
| 97 | + |
| 98 | +Final line. |
| 99 | + |
| 100 | +Test Case 9: Single Words (Target: minimal content) |
| 101 | +--------------------------------------------------- |
| 102 | +Beginning. |
| 103 | +[Alice] Word. |
| 104 | +[Bob] Another. |
| 105 | +Question? |
| 106 | +[Alice] Answer. |
| 107 | +Conclusion. |
| 108 | + |
| 109 | +Test Case 10: Exact Recreation (Target: original crash) |
| 110 | +------------------------------------------------------- |
| 111 | +Hello! This is the first subtitle. I'll make it long on purpose. |
| 112 | +[crestfallen_original] This is Long?! |
| 113 | + |
| 114 | +[Girl]This is the second [crestfallen_original] subtitle with precise timing. |
| 115 | +Back to me? |
| 116 | + |
| 117 | +[Bob] The audio will match these exact timings. |
| 118 | + |
| 119 | +Back to me again? This looks like a meeees... |
| 120 | + |
| 121 | +INSTRUCTIONS: |
| 122 | +- Test each case separately |
| 123 | +- Note which segment number crashes (if any) |
| 124 | +- Record any "Reference mel length" warnings |
| 125 | +- Try with same characters: crestfallen_original, Girl (maps to female_01), Bob (maps to male_01) |
| 126 | +- Look for patterns in crashes (position, text length, punctuation, etc.) |
0 commit comments