Skip to content

Graduation's final project submitted to the teaching staff from the Eletronics and Computing Engineering bachelor's course from the Polytechnic School of the Federal University of Rio de Janeiro as one of the necessary requirements for obtaining the Electronics and Computing Engineer degree.

Notifications You must be signed in to change notification settings

Guterson/Singing-Voice-Synthesis-in-Brazilian-Portuguese-by-Concatenation-of-Acoustic-Units

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

Singing Voice Synthesis in Brazilian Portuguese by Concatenation of Acoustic Units

Graduation's final project submitted to the teaching staff from the Eletronics and Computing Engineering bachelor's course from the Polytechnic School of the Federal University of Rio de Janeiro as one of the necessary requirements for obtaining the Electronics and Computing Engineer degree.

About

The project consisted in a deep study of the process of selecting acoustic units (syllabic-level utterances), with the proposal of a recording list, also know as reclist, especialy tailored to brazilian portuguese phonotactics and phonemic content. Then, using the proposed recording list, a voicebank was recorded using OREMO in a acoustically treated recording room with high quality recording equipment. Finally, a procedural Python3 script was created for synthesizing singing voice using the recorded voicebank. All the process, from the concept and design of the recording list to the parameters used in the synthesizing stage, were heavilly based on the UTAU software and its community culture of self-recording voicebanks.

Contents

  • A Python3 script used for synthesizing a specific audio example using the recorded voicebank.

  • A .pdf file (written in brazilian Portuguese) describing all the process and reasoning from recording a voicebank to synthesizing audio via a Python script. The original monograph was published here.

  • Somes samples of the monopitch voicebank recorded by me. If you want access to the full voicebank, please contact me.

Notes

  • This project does not uses machine learning techniques directly, using only "classical" signal processing techniques and Python packages that doesn't require a training stage.

  • For complete use of the voicebank, it's necesary to create time annotations in ".oto" files, in a process know as otoing.

Other Works

For similar works, please visit my portfolio.

About

Graduation's final project submitted to the teaching staff from the Eletronics and Computing Engineering bachelor's course from the Polytechnic School of the Federal University of Rio de Janeiro as one of the necessary requirements for obtaining the Electronics and Computing Engineer degree.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages