“TTS-project-synthesis”版本间的差异
来自cslt Wiki
第28行: | 第28行: | ||
===With Speaker-vector=== | ===With Speaker-vector=== | ||
− | |||
*Specific person=== | *Specific person=== | ||
:*Female[http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/multi-speakers/mix/all.dvector40/female01.dvec40_5_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav] | :*Female[http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/multi-speakers/mix/all.dvector40/female01.dvec40_5_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav] | ||
第34行: | 第33行: | ||
:*Male[http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/multi-speakers/mix/all.dvector40/male01.dvec40_5_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav] | :*Male[http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/multi-speakers/mix/all.dvector40/male01.dvec40_5_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav] | ||
− | *Interpolate | + | *Interpolate of different person |
:* Female & Male with different ratio | :* Female & Male with different ratio | ||
2017年12月7日 (四) 06:04的版本
目录
Project name
Text To Speech
Project members
Dong Wang, Zhiyong Zhang
Introduction
Text To Speech
Sample waves
Synthesis text:好雨知时节,当春乃发声,随风潜入夜,润物细无声
Mono-speaker
- Female[1]
- Male[2]
- Child[3]
Multi-speaker
Without Speaker-vector
- Female & Male[4]
- Female & Child[5]
- Male & Child[6]
With Speaker-vector
- Specific person===
- Female[7]
- Male[8]
- Interpolate of different person
- Female & Male with different ratio
- (1) 0.0:1.0[9]
- (2) 0.1:0.9[10]
- (3) 0.2:0.8[11]
- (4) 0.3:0.7[12]
- (5) 0.4:0.6[13]
- (6) 0.5:0.5[14]
- (7) 0.6:0.4[15]
- (8) 0.7:0.3[16]
- (9) 0.8:0.2[17]
- (10) 0.9:0.1[18]
- (11) 1.0:0.0[19]
Mono-speaker Emotion
- Specific emotion
- Interpolation emotion
- Angry & neutral with different ratio
Multi-speaker Multi-emotion
- Synthesis text:'据了解,天津市今年粮食种植面积达六百万亩,预计全年粮食总产量可达二十公斤,比去年提高了'
- Female
- Male