“TTS-project-synthesis”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
第80行: 第80行:
  
 
==Multi-speaker Multi-emotion==
 
==Multi-speaker Multi-emotion==
*Synthesis text:'据了解,天津市今年粮食种植面积达六百万亩,预计全年粮食总产量可达二十公斤,比去年提高了'
 
  
 
*Female
 
*Female
:* female-angry [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/emotion/mix/female01.angry_1_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
+
:* angry [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/multi-speaker_multi-emotion/female01_angry_final_5_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
:* female-happy [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/emotion/mix/female01.happy_1_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
+
:* happy [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/multi-speaker_multi-emotion/female01_happy_final_5_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
:* female-neutral [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/emotion/mix/female01.neutral_1_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
+
:* neutral [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/multi-speaker_multi-emotion/female01_neutral_final_5_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
:* female-sorrow [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/emotion/mix/female01.sorrow_1_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
+
:* sorrow [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/multi-speaker_multi-emotion/female01_sorrow_final_5_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
  
 
*Male
 
*Male
:* male-angry [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/emotion/mix/male01.angry_1_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
+
:* angry [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/multi-speaker_multi-emotion/male01_angry_final_5_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
:* male-happy [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/emotion/mix/male01.happy_1_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
+
:* happy [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/multi-speaker_multi-emotion/male01_happy_final_5_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
:* male-neutral [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/emotion/mix/male01.neutral_1_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
+
:* neutral [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/multi-speaker_multi-emotion/male01_neutral_final_5_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
:* male-sorrow [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/emotion/mix/male01.sorrow_1_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]
+
:* sorrow [http://zhangzy.cslt.org/categories/tts/sample-wav/mimic-wangd-front-end/multi-speaker_multi-emotion/male01_sorrow_final_5_amdurTanh_acTanh_mlpg1_postfilter1.world.wav01.wav]

2017年12月7日 (四) 06:08的版本

Project name

Text To Speech

Project members

Dong Wang, Zhiyong Zhang

Introduction

Text To Speech

Sample waves

Synthesis text:好雨知时节,当春乃发声,随风潜入夜,润物细无声

Mono-speaker

Multi-speaker

Without Speaker-vector

  • Female & Male[4]
  • Female & Child[5]
  • Male & Child[6]


With Speaker-vector

  • Specific person===
  • Interpolate of different person
  • Female & Male with different ratio
  • (1) 0.0:1.0[9]

Mono-speaker Emotion

  • Specific emotion
  • Neutral emotion [20]
  • Happy emotion [21]
  • Sorrow emotion [22]
  • Angry emotion [23]
  • Interpolation emotion
  • Angry & neutral with different ratio

Multi-speaker Multi-emotion

  • Female
  • Male