“ASR Status Report 2017-9-18”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
(以“ {| class="wikitable" !Date!!People !! Last Week !! This Week |- | rowspan="9"|2017.9.4 |Jiayin Cai || * || * |- |- |Xiaofei Kang || * || * |- |- |Miao Zhang...”为内容创建页面)
 
 
(6位用户的10个中间修订版本未显示)
第3行: 第3行:
 
!Date!!People !! Last Week !! This Week
 
!Date!!People !! Last Week !! This Week
 
|-
 
|-
| rowspan="9"|2017.9.4
+
| rowspan="9"|2017.9.18
  
  
 
|Jiayin Cai
 
|Jiayin Cai
 
||
 
||
*
+
* Absent
 
||
 
||
 
*
 
*
第17行: 第17行:
 
|Xiaofei Kang
 
|Xiaofei Kang
 
||  
 
||  
*
+
*Test and improve the IOS APP for recording audios.
 +
*Finish the experiment to test the machine error rate,the result is in my cvss [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=kangxf&step=view_request&cvssid=629 here] .
 
||  
 
||  
*
+
*Record the audios with zhangmiao using the money from wang.
 
|-
 
|-
  
第26行: 第27行:
 
|Miao Zhang
 
|Miao Zhang
 
||  
 
||  
*
+
* Finish human test website
 +
* Design recording app with Kangxf
 +
* T-SNE analysis
 
||  
 
||  
*
+
* Absent for school class
 
|-
 
|-
  
第35行: 第38行:
 
|Yanqing Wang
 
|Yanqing Wang
 
||  
 
||  
*
+
* Implementation of node-pruning.
 +
* comparison of connection-pruning and node-pruning, see [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=wangyanqing&step=view_request&cvssid=634 here]
 
||
 
||
*
+
* continue on relationship and comparison of connection-pruning and node-pruning.
 +
* Implementation of long-term dropout and experiments based on it.
 
|-
 
|-
  
第44行: 第49行:
 
|Ying Shi   
 
|Ying Shi   
 
||  
 
||  
*
+
* group-based softmax finished [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=shiying&step=view_request&cvssid=627 here]
 +
* multi-decoding for group-based softmax (in progress)
 
||  
 
||  
*
+
* mulit-decoding for group-based softmax
 +
* PTN
 +
* apply Lid for group-based softmax
 
|-
 
|-
  
第62行: 第70行:
 
|Lantian Li   
 
|Lantian Li   
 
||  
 
||  
* Go on speaker segmentation tasks, see [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=lilt&step=view_request&cvssid=615 here]
+
* Go on speaker segmentation tasks, see [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=lilt&step=view_request&cvssid=615]
 
** Make some smooth tricks (Silence limits [MDR] and window-based smooth [FAR]).
 
** Make some smooth tricks (Silence limits [MDR] and window-based smooth [FAR]).
 
** R.T. test.
 
** R.T. test.
* Music detection, see [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=lilt&step=view_request&cvssid=624]
+
* Music / Noise detection, see [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=lilt&step=view_request&cvssid=624]
 
||
 
||
*  
+
* Package the code for speaker segmentaion.
 +
* Go on music / noise detection tasks.
 
|-
 
|-
  
第74行: 第83行:
 
|Zhiyuan Tang  
 
|Zhiyuan Tang  
 
||  
 
||  
*
+
* Part theoretical study of mispronunciation detection.
 +
* Toolbook writing.
 
||
 
||
*
+
* Experiments on phonetic LID.
 +
* Experiments on mispronunciation detection
 
|-
 
|-
  
第85行: 第96行:
 
!Date!!People !! Last Week !! This Week
 
!Date!!People !! Last Week !! This Week
 
|-
 
|-
| rowspan="9"|2017.9.4
+
| rowspan="9"|2017.9.11
  
  

2017年9月24日 (日) 13:34的最后版本

Date People Last Week This Week
2017.9.18


Jiayin Cai
  • Absent
Xiaofei Kang
  • Test and improve the IOS APP for recording audios.
  • Finish the experiment to test the machine error rate,the result is in my cvss here .
  • Record the audios with zhangmiao using the money from wang.
Miao Zhang
  • Finish human test website
  • Design recording app with Kangxf
  • T-SNE analysis
  • Absent for school class
Yanqing Wang
  • Implementation of node-pruning.
  • comparison of connection-pruning and node-pruning, see here
  • continue on relationship and comparison of connection-pruning and node-pruning.
  • Implementation of long-term dropout and experiments based on it.
Ying Shi
  • group-based softmax finished here
  • multi-decoding for group-based softmax (in progress)
  • mulit-decoding for group-based softmax
  • PTN
  • apply Lid for group-based softmax
Yixiang Chen
  • Absent
Lantian Li
  • Go on speaker segmentation tasks, see [1]
    • Make some smooth tricks (Silence limits [MDR] and window-based smooth [FAR]).
    • R.T. test.
  • Music / Noise detection, see [2]
  • Package the code for speaker segmentaion.
  • Go on music / noise detection tasks.
Zhiyuan Tang
  • Part theoretical study of mispronunciation detection.
  • Toolbook writing.
  • Experiments on phonetic LID.
  • Experiments on mispronunciation detection

Date People Last Week This Week
2017.9.11


Jiayin Cai
  • Got phonetic feat from a stronger phonetic network
  • Finished part of the experiment using stronger phonetic feature.
  • Will be absent for school.
  • But I will finish the remaining experiment.
Xiaofei Kang
  • improve the human Test website:, save the test recordings, decline the positive samples
  • Recording and cutting the audios, a total of 12 groups
  • Continue to record the audios with zhangmiao
  • Continue to ask people to do human test
Miao Zhang
  • Perform human test
  • Record some other people and do the experiments again
  • Continue to ask people to do human test
  • Recording(the goal is to record 400 to 500 people) here
Yanqing Wang
  • Absent
Ying Shi
  • multi-decoding ASR model with more pdfs. Performance better than before but not well enough
  • add sperate symbel to discriminated kazak and uyghur word set
  • group-based softmax(in progress)
  • finish group-based softmax and test the performance
Yixiang Chen
  • Absent
Lantian Li
  • Go on speaker segmentation tasks, see here
    • Complete the phonetic-aware speaker segmentation.
      • Word-level boundaries from the ASR.
      • Word-level d-vector and clustering.
  • Try some smooth tricks.
Zhiyuan Tang
  • Organized the code and doc of Parrot system[3]
  • Theoretical study of pronunciation detection