|
|
(4位用户的58个中间修订版本未显示) |
第1行: |
第1行: |
− | =Task To Do=
| |
− | * 1, RNN speech recognition (Tied-context-dependent-state and End-to-End)
| |
− | * 2, Real environment noise cancellation(DNN-DAE/CNN-DAE/RNN-DAE: echo or reverberation)
| |
− | * 3, Integrate the class information to HCLG fst for speech recognition
| |
− | * 4, Multi-Mode features based VAD
| |
− | * 5, DNN based Language identification and Speaker identification
| |
− | * 6, Distant speech recognition(Reverberation, Mutli-microphones)
| |
− | * 7, Voice conversation
| |
− | * 8, Unbound activation function(Rectifier/Maxout/Pnorm) go-through searching method.
| |
− | * 9, Sparse DNN
| |
− | * 10, Neural network visulization
| |
− | * 11, DAE+dropout
| |
− | :* DNN-DAE -- Xiangyu Zeng
| |
− | :* CNN-DAE -- Yiye Lin
| |
| | | |
| + | =Tasks at hand= |
| | | |
− | =Technical Report To Write= | + | ==Speech Recognition== |
− | * 1, DNN-DAE based noise cancellation -- Xiangyu Zeng / Mengyuan Zhao / Zhiyong Zhang
| + | |
− | * 2, Speech Rate DNN speech recognition --Shi Yin
| + | |
− | * 3, CNN+fbank feature combination --Mian Wang /Yiye Lin /Mengyuan Zhao /Shi Yin
| + | |
− | * 4, Uyghur low-resource acoustic model enhancement -- Shi Yin / Mengyuan Zhao / Zhiyong Zhang
| + | |
− | * 5, Uyghur 20h database release --Kaer /Shi Yin
| + | |
| | | |
− | =Paper to Write= | + | ===joint learning=== |
− | * 1, DNN-DAE Xiangyu Zeng/ Mengyuan Zhao Conference: ChinaSIP-2015 | + | * Hang Luo, Zhiyuan Tang |
− | * 2, RNN-dAE Chao Liu / Zhiyiong Zhang Conference: Interspeech-2015 | + | |
| + | ===visualization=== |
| + | * Ying Shi, Zhiyuan Tang |
| + | |
| + | ==Speaker Recognition== |
| + | *Lantian Li, Yixiang Chen |
| + | |
| + | |
| + | =Tasks Done= |
| + | |
| + | =Technical Reports to write= |
| + | |
| + | =Papers to write= |
| + | |
| + | =Patents to write= |
| + | |
| + | =Patents done= |
| + | |
| + | =Projects= |
| + | |
| + | |
| + | ------------------------------ |
| + | [[task previous]] |