ASR:2015-04-27

来自cslt Wiki
跳转至: 导航搜索

Speech Processing

AM development

Environment

  • To update the wiki enviroment infomation --done

RNN AM

Mic-Array

  • Change the prediction from fbank to spectrum features
  • investigate alpha parameter in time domian and frquency domain
  • ALPHA>=0, using data generated by reverber toolkit
  • consider theta
  • make spectrom feature with Kaldi

RNN-DAE(Deep based Auto-Encode-RNN)

Speaker ID

Ivector&Dvector based ASR

Dark knowledge

  • Ensemble --Zhiyong Zhang
  • adaptation for chinglish under investigation --Mengyuan Zhao
  • chinglish adaptation task best performane is obtained ofrom retraining , dark knowledge helps adapt model,try to tune papameters layear by layer ,change cv --Mengyuan Zhao
  • unsupervised training with wsj contributes to aurora4 model --Xiangyu Zeng
  • test large database with AMIDA

bilingual recognition