ASR Status Report 2017-9-18

来自cslt Wiki
2017年9月18日 (一) 05:04Tangzy讨论 | 贡献的版本

跳转至: 导航搜索
Date People Last Week This Week
2017.9.4


Jiayin Cai
Xiaofei Kang
Miao Zhang
Yanqing Wang
Ying Shi
  • group-based softmax finished here
  • multi-decoding for group-based softmax (in progress)
  • mulit-decoding for group-based softmax
  • PTN
  • apply Lid for group-based softmax
Yixiang Chen
  • Absent
Lantian Li
  • Go on speaker segmentation tasks, see [1]
    • Make some smooth tricks (Silence limits [MDR] and window-based smooth [FAR]).
    • R.T. test.
  • Music / Noise detection, see [2]
  • Package the code for speaker segmentaion.
  • Go on music / noise detection tasks.
Zhiyuan Tang
  • Part theoretical study of mispronunciation detection.
  • Toolbook writing.
  • Experiments on phonetic LID.
  • Experiments on mispronunciation detection

Date People Last Week This Week
2017.9.4


Jiayin Cai
  • Got phonetic feat from a stronger phonetic network
  • Finished part of the experiment using stronger phonetic feature.
  • Will be absent for school.
  • But I will finish the remaining experiment.
Xiaofei Kang
  • improve the human Test website:, save the test recordings, decline the positive samples
  • Recording and cutting the audios, a total of 12 groups
  • Continue to record the audios with zhangmiao
  • Continue to ask people to do human test
Miao Zhang
  • Perform human test
  • Record some other people and do the experiments again
  • Continue to ask people to do human test
  • Recording(the goal is to record 400 to 500 people) here
Yanqing Wang
  • Absent
Ying Shi
  • multi-decoding ASR model with more pdfs. Performance better than before but not well enough
  • add sperate symbel to discriminated kazak and uyghur word set
  • group-based softmax(in progress)
  • finish group-based softmax and test the performance
Yixiang Chen
  • Absent
Lantian Li
  • Go on speaker segmentation tasks, see here
    • Complete the phonetic-aware speaker segmentation.
      • Word-level boundaries from the ASR.
      • Word-level d-vector and clustering.
  • Try some smooth tricks.
Zhiyuan Tang
  • Organized the code and doc of Parrot system[3]
  • Theoretical study of pronunciation detection