Skip to content

Commit ed48491

Browse files
Update README.md
1 parent 5538778 commit ed48491

1 file changed

Lines changed: 59 additions & 13 deletions

File tree

README.md

Lines changed: 59 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -1,23 +1,69 @@
1-
# Aurelian Silva
1+
## Aurelian Silva for DiffSinger
22

3-
![Avatar](/Image.png)
3+
<p align="center">
4+
Galloping onto the runway.<br><br>
5+
<img src="Image.png" width="250" title="Iconic!">
6+
</p>
47

5-
Aurelian is a proof-of-concept vocal, with only 2 minutes and 29 seconds of data in it's dataset.
8+
***
9+
## Summary
10+
Aurelian Silva for DiffSinger is an AI Singer utilizing the DiffSinger engine through OpenUTAU! He has a youthful, masculine, voice with a British accent and can sing in English, Japanese, Chinese, Korean, French, Spanish and Thai! (Plus many more through phoneme manipulation!)
611

7-
## General information
12+
***
13+
14+
## Character information
815
- Gender: Male
9-
- Height: 1.7m
10-
- Weight: 600 kg
16+
- Height: 2m
17+
- Weight: 500 kg
1118
- Age: 25
1219
- Optimal Range: F2 - A5
1320

14-
## Diffsinger Voicebank
15-
A young sounding masculine vocal, with a heavy British accent.
21+
***
22+
23+
## Notes
24+
This voicebank was trained with the "Multi-Dict" branch of DiffSinger. This is supported by the current beta of OpenUTAU. The following is a list of language tags used by "Aurelian Silva for DiffSinger":
25+
26+
| Language | Tag |
27+
| :----- | ---: |
28+
| English | en/ |
29+
| Japanese | ja/ |
30+
| Chinese | zh/ |
31+
| Korean | ko/ |
32+
| French | fr/ |
33+
| Spanish | es/ |
34+
| Thai | th/ |
35+
36+
As well as the language tags there are a few extra phonemes available to use across all languages:
37+
38+
| Phoneme | Name | Usage |
39+
| :----- | --- | ---: |
40+
| SP | Silence | Denotes silent pauses |
41+
| AP | Breaths | Denotes pauses with an intake of breath |
42+
| cl | Plosive Modifier | This can be used after consonants to reign in their pronunciation a bit. |
43+
| q | Glottal Stop | uh-oh [ah q ow] |
44+
| vf | Vocal Fry | This can be added before vowels, and some consonants, paired with a low pitch curve/point, to add vocal fry |
45+
46+
The following phonemes are extras for the English language natively supported by the upcoming "DIFFS-EN+" phonemizer.
47+
48+
| Phoneme | Type | Usage |
49+
| :----- | --- | ---: |
50+
| ax | Vowel | again [**ax** g eh n] |
51+
| dr | Consonant | dream [**dr** iy m] |
52+
| tr | Consonant | train [**tr** ey n] |
53+
54+
The Thai language works through the regular "DIFFS" phonemizer as, as far as I'm aware, there isn't one specifically for Thai yet and French requires the Millefeuille DIFFS-FR phonemizer found on their website (linked below!).
55+
56+
***
1657

17-
**REQUIRES THE BETA BRANCH OF OpenUTAU**
58+
## Credits
59+
This voicebank was trained alongside the following corpora:
60+
- Millefeuille for French support (https://utaufrance.com/millefeuille-diffsinger/)
61+
- Namine Criss Spanish Dataset by CrissZ3R0VZ for Spanish support
62+
- PJS Corpus for Japanese Support (https://sites.google.com/site/shinnosuketakamichi/research-topics/pjs_corpus)
63+
- Labels by UtaUtaUtau, edited by tigermeat
64+
- Thai datasets for Thai Support (https://thaids.printmov.com/)
65+
- Various datasets by TigerMeat for Chinese and Korean support
66+
- Project AI❤dol Public English Dataset (https://github.com/lottev1991/Project-AIdol-Public-English-Dataset)<br>
1867

19-
- Type: Diffsinger
20-
- Languages: en, ja, fr, es, th
68+
This voicebank also utilises the "tgm_hifigan v107" vocoder, trained by TigerMeat, as it contains all of Aurelian's current data in the dataset used to train it. This allows for better replication of Aurelian's voice.
2169

22-
## Videos
23-
[![Watch the video](https://i9.ytimg.com/vi_webp/-wt2q_jmIz4/maxresdefault.webp?v=66e97ccc&sqp=CNz8h7gG&rs=AOn4CLCaaxH4q3ifwrWMWcMFxsFVyqcbmg)](https://www.youtube.com/watch?v=-wt2q_jmIz4)

0 commit comments

Comments
 (0)