v0.7.0: [br] Removal, "correct_vuv_by_phone" support.
Pre-releasev0.7.0 brings huge improvements to the quality of resulting models. Using a new VUV correction feature of NNSVS and removing the flawed [br] phoneme.
You will need to update your dataset for this version.
-
'correct_vuv_by_phone' allows us to assign a specify the desired VUV value for specific phonemes to prevent VUV errors.
It does not require any changes to your dataset. It will take effect automatically as long as "force_fix_vuv" is true and you use the latest HED file. -
Support for the [br] phoneme is removed in this version due to a DRASTIC reduction in quality and stability for models which utilized it.
It served no practical purpose as NNSVS can handle breaths automatically as part of the already existing [pau] phoneme.
You will need to update your dataset as the phoneme is no longer provided. Remove [br] and replace it with [pau]. ONE [pau] phoneme for a single section of silence.
While we would have gladly kept the [br] phoneme, it's negative effects could not. After much testing it was decided to remove it. Thanks to the individuals who provided their datasets to confirm.