Skip to content

Releases: intunist/nnsvs-english-support

v0.4.0: Cleanup, phoneme changes, and a (good) Dictonary

09 Feb 12:12
309be11

Choose a tag to compare

v0.4.0 overhauls things a little bit and cleans a lot up. This update also breaks compatibility with older versions but is fairly easy to update to the new version!

  • Our custom [j] and [h] phonemes have been reverted back to the standard Arpabet [jh] and [hh] phonemes. This is the one breaking change and should be the last phonetic change unless we add phonemes.
  • the extra/unused phonemes have been removed from the hed files to reduce the amount of vram needed for training.
  • There is now a (GOOD) dictionary included! This dictionary is based on amepd, a modified cmu dict, by Reece H. Dunn.

The dictionary will be a constant work in progress to make it more accurate for singing. We found a lot of weird quirks in it but it's a massive first step in getting a good reference for new users.

v0.3.1: removed [eng] phoneme

27 Jan 02:39
d1a2a0f

Choose a tag to compare

Pre-release

v0.3.1 removes the [eng] phoneme from the list of supported phonemes.
It ended up being useless in all cases as [ih][ng] could (and should) be used instead.

v0.3.0: rrrrrrrr and disposable phonemes

25 Nov 19:31
df907c4

Choose a tag to compare

v0.3.0 adds three new phonemes, [rr], [rx], and [ol]. Adding more flexibility when labeling datasets.

  • [rr] is the trilled r, like in Spanish.
  • [rx] is a fricative r, like in German or French.
  • [ol] is a particularly useful phoneme. It's intended for labeling out-of-language or "junk" sounds.
    ^This allows you to label around sounds you don't want affecting your models.

We are also now working on a shiro model for auto-labeling datasets. But that's a ways off!

Be sure to update the in_dim values in your config!

v0.2.4: voiced release, documentation, and dictionary improvements

20 Oct 15:27
943203f

Choose a tag to compare

v0.2.3 v0.2.4 adds a new phoneme, [axh]. This is similar to the [exh] phoneme but for labeling voiced releases and exhales.

Along with this update:

  • all the documentation was updated for improved clarity.
  • The dictionary has been improved further, almost to the point of being useful.
  • square brackets in the dictionary have been replaced with inequality signs/angled brackets <> for compatibility reasons.

Be sure to change the in_dim value in your config files.

v0.2.2: Updated Dictionary

19 Oct 18:56
53cb983

Choose a tag to compare

Pre-release

v0.2.2: Updated the tables to be more useful. You will still need to train with blank.table but english/english1.table can be used afterwards on the finished model if desired, Note that ENUNU doesn't support multi-syllable words at the time of writing.

Some documentation was also updated to make more sense.

v0.2.1: suffix support

19 Oct 18:04
a6e3b2a

Choose a tag to compare

Pre-release

v0.2.1 Adds support for suffixed phonemes! ...with come caveats.
Note that the additional phonemes in the suffixed hed file increases the amount of vram/ram required for training. So you may wish to omit suffixes you don't need.

additional changes:

  • added [cr] phoneme: for labeling vocal cracks, if desired.
  • in_dim values are different from previous versions. Refer to README.md.
  • [cl] phoneme is now treated as a "toggle" for when the closure state of a consonant in a cluster isn't the vocalist's natural default. Still unlikely to use.

v0.2.0

08 Oct 14:50
388787a

Choose a tag to compare

v0.2.0 Pre-release
Pre-release

v0.1.18 -> v0.2.0: This is not compatible with previous versions! Please update your dataset!

Important changes:

  • [jh] was changed to [j]
  • [xx] was changed to [x] (for compatibility reasons.)

v0.1.18

30 Sep 07:48
6f6bae6

Choose a tag to compare

v0.1.18 Pre-release
Pre-release

Added "blank.table" for phoneme training.

v0.1.17

29 Sep 07:10
01c70d2

Choose a tag to compare

v0.1.17 Pre-release
Pre-release

The hed files were reformatted to use space instead of tab as whitespace. They work correctly now.
Other than some general clean-up, nothing else has changed.

v0.1.16

28 Sep 12:44
373e7a5

Choose a tag to compare

v0.1.16 Pre-release
Pre-release

Initial github version, previous versions were loosely thrown onto the NNSVS discord server.

Still untested but should work as is.