|
1 | | -# Aurelian Silva |
| 1 | +## Aurelian Silva for DiffSinger |
2 | 2 |
|
3 | | - |
| 3 | +<p align="center"> |
| 4 | +Galloping onto the runway.<br><br> |
| 5 | +<img src="Image.png" width="250" title="Iconic!"> |
| 6 | +</p> |
4 | 7 |
|
5 | | -Aurelian is a proof-of-concept vocal, with only 2 minutes and 29 seconds of data in it's dataset. |
| 8 | +*** |
| 9 | +## Summary |
| 10 | +Aurelian Silva for DiffSinger is an AI Singer utilizing the DiffSinger engine through OpenUTAU! He has a youthful, masculine, voice with a British accent and can sing in English, Japanese, Chinese, Korean, French, Spanish and Thai! (Plus many more through phoneme manipulation!) |
6 | 11 |
|
7 | | -## General information |
| 12 | +*** |
| 13 | + |
| 14 | +## Character information |
8 | 15 | - Gender: Male |
9 | | -- Height: 1.7m |
10 | | -- Weight: 600 kg |
| 16 | +- Height: 2m |
| 17 | +- Weight: 500 kg |
11 | 18 | - Age: 25 |
12 | 19 | - Optimal Range: F2 - A5 |
13 | 20 |
|
14 | | -## Diffsinger Voicebank |
15 | | -A young sounding masculine vocal, with a heavy British accent. |
| 21 | +*** |
| 22 | + |
| 23 | +## Notes |
| 24 | +This voicebank was trained with the "Multi-Dict" branch of DiffSinger. This is supported by the current beta of OpenUTAU. The following is a list of language tags used by "Aurelian Silva for DiffSinger": |
| 25 | + |
| 26 | +| Language | Tag | |
| 27 | +| :----- | ---: | |
| 28 | +| English | en/ | |
| 29 | +| Japanese | ja/ | |
| 30 | +| Chinese | zh/ | |
| 31 | +| Korean | ko/ | |
| 32 | +| French | fr/ | |
| 33 | +| Spanish | es/ | |
| 34 | +| Thai | th/ | |
| 35 | + |
| 36 | +As well as the language tags there are a few extra phonemes available to use across all languages: |
| 37 | + |
| 38 | +| Phoneme | Name | Usage | |
| 39 | +| :----- | --- | ---: | |
| 40 | +| SP | Silence | Denotes silent pauses | |
| 41 | +| AP | Breaths | Denotes pauses with an intake of breath | |
| 42 | +| cl | Plosive Modifier | This can be used after consonants to reign in their pronunciation a bit. | |
| 43 | +| q | Glottal Stop | uh-oh [ah q ow] | |
| 44 | +| vf | Vocal Fry | This can be added before vowels, and some consonants, paired with a low pitch curve/point, to add vocal fry | |
| 45 | + |
| 46 | +The following phonemes are extras for the English language natively supported by the upcoming "DIFFS-EN+" phonemizer. |
| 47 | + |
| 48 | +| Phoneme | Type | Usage | |
| 49 | +| :----- | --- | ---: | |
| 50 | +| ax | Vowel | again [**ax** g eh n] | |
| 51 | +| dr | Consonant | dream [**dr** iy m] | |
| 52 | +| tr | Consonant | train [**tr** ey n] | |
| 53 | + |
| 54 | +The Thai language works through the regular "DIFFS" phonemizer as, as far as I'm aware, there isn't one specifically for Thai yet and French requires the Millefeuille DIFFS-FR phonemizer found on their website (linked below!). |
| 55 | + |
| 56 | +*** |
16 | 57 |
|
17 | | -**REQUIRES THE BETA BRANCH OF OpenUTAU** |
| 58 | +## Credits |
| 59 | +This voicebank was trained alongside the following corpora: |
| 60 | +- Millefeuille for French support (https://utaufrance.com/millefeuille-diffsinger/) |
| 61 | +- Namine Criss Spanish Dataset by CrissZ3R0VZ for Spanish support |
| 62 | +- PJS Corpus for Japanese Support (https://sites.google.com/site/shinnosuketakamichi/research-topics/pjs_corpus) |
| 63 | + - Labels by UtaUtaUtau, edited by tigermeat |
| 64 | +- Thai datasets for Thai Support (https://thaids.printmov.com/) |
| 65 | +- Various datasets by TigerMeat for Chinese and Korean support |
| 66 | +- Project AI❤dol Public English Dataset (https://github.com/lottev1991/Project-AIdol-Public-English-Dataset)<br> |
18 | 67 |
|
19 | | -- Type: Diffsinger |
20 | | -- Languages: en, ja, fr, es, th |
| 68 | +This voicebank also utilises the "tgm_hifigan v107" vocoder, trained by TigerMeat, as it contains all of Aurelian's current data in the dataset used to train it. This allows for better replication of Aurelian's voice. |
21 | 69 |
|
22 | | -## Videos |
23 | | -[](https://www.youtube.com/watch?v=-wt2q_jmIz4) |
|
0 commit comments