“2024-05-13”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
 
(4位用户的4个中间修订版本未显示)
第10行: 第10行:
 
* Several public reports
 
* Several public reports
 
* Review for Electonics and Applied Science
 
* Review for Electonics and Applied Science
 
 
||
 
||
 
*
 
*
第21行: 第20行:
 
|Lantian Li
 
|Lantian Li
 
||
 
||
*  
+
* GPU status [https://z1et6d3xtb.feishu.cn/wiki/XGcGwRK5viJmpRkjH9AczIhynCh]
 +
* Projects (AED -> Hardware support, TSE -> Test&Analysis)
 +
* ASIP-BUPT (NeuralScoring -> Paper, CohortSS -> Data Analysis)
 +
* Check NIPS & Review theses
 
||
 
||
 
*  
 
*  
第27行: 第29行:
 
*   
 
*   
 
|-
 
|-
 +
  
  
第121行: 第124行:
 
|Wan Lin
 
|Wan Lin
 
||  
 
||  
*  
+
* EAASP in Sunine(EER)
 +
** EA:4.292(3.106 wespeaker)
 +
** Mix: 7.733(5.962 wespeaker)
 +
* Add CNN condition in test encoder: currently unsuccessful
 
||
 
||
 
*
 
*
第146行: 第152行:
 
|Zhenyu Zhou
 
|Zhenyu Zhou
 
||  
 
||  
*
+
*HUAWEI project process[https://z1et6d3xtb.feishu.cn/docx/PBAZdsiSWoq82YxWsu3cCD4Tnte]
 
||
 
||
 
*
 
*
第169行: 第175行:
 
|Jiaying Wang
 
|Jiaying Wang
 
||  
 
||  
*  
+
* find bad cases in the test set(gender confusion)
 
||
 
||
*  
+
* data analyse
 +
* focus on cohort outside masker
 
||
 
||
 
*   
 
*   
第238行: 第245行:
 
|Yue Gu
 
|Yue Gu
 
||
 
||
*  
+
* fail to reproduct the semantic paraformer
 +
* write paper:30% of experimental part
 +
* kespeech baseline
 
||
 
||
 
*
 
*

2024年5月13日 (一) 11:22的最后版本

People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • Material preparation for Xinhua Net broadcast
  • Several public reports
  • Review for Electonics and Applied Science
Lantian Li
  • GPU status [1]
  • Projects (AED -> Hardware support, TSE -> Test&Analysis)
  • ASIP-BUPT (NeuralScoring -> Paper, CohortSS -> Data Analysis)
  • Check NIPS & Review theses
Ying Shi
  • verify cohort Overlap ASR assumption
    • Identify the speech component which most similar to the cohort vector ✔
  • group work
  • cohort + conditional chain Overlap ASR
Zhenghai You
  • Speech tests and deliver real test samples for HUAWEI
  • Loudness testing and adjustment of Huawei data[2]
  • Comparative experiments on data expansion
Junming Yuan
  • Continue to add various data augmentation functions into the code
  • Prepare for live broadcast
Chen Chen
  • attend several interviews for job
  • vii group work [3]
Xiaolou Li
  • Video mamba exp (good good)
    • patch frontend
    • conv3d and resnet3d frontend
  • Paper reading
  • run exp on LRS2 and LRS3 (waiting for email feedback)
  • what is the main difference between these two frontend? (conv3d and resnet3d)
Zehua Liu
  • AKVSR (cer:49.71%) > baseline(cer: 48.76%)
    • AKVSR + pos_emb (a little worse)
    • AKVSR + attention score loss(coding)
Pengqi Li
  • Jinfu and LiuHuan's Outlines of NC
  • XueYing's Outline of NC
  • NC paper of Speech XAI overview
Wan Lin
  • EAASP in Sunine(EER)
    • EA:4.292(3.106 wespeaker)
    • Mix: 7.733(5.962 wespeaker)
  • Add CNN condition in test encoder: currently unsuccessful
Tianhao Wang
  • Baseline: SpEx+ with Detection (Failed)
    • difficult to train because vox2 has a much larger data volume than wsj0
  • Toolkit align: lr scheduler, pooling
    • pooling seems critical (same epoch, NS loss: ASP is 0.16 vs TSP is 0.22)
Zhenyu Zhou
  • HUAWEI project process[4]
Junhui Chen
  • Graduation paper
  • Neural Scoring paper writing
Jiaying Wang
  • find bad cases in the test set(gender confusion)
  • data analyse
  • focus on cohort outside masker
Yu Zhang
  • AutoML
    • EvalML test result[5]
Wenqiang Du
  • Just some project test
Yang Wei
  • Children MDD challenge
    • Refine documentation and prepare material for discuss
  • Huilan stuff
    • Reduce size of TTS Docker image
Lily
  • AIGraph PPT delivery
  • Thesis
  • Perception Experiment
Turi
  • Data Collection
    • Checking audios
  • Class works
Yue Gu
  • fail to reproduct the semantic paraformer
  • write paper:30% of experimental part
  • kespeech baseline
Qi Qu
  • KWS
    • Standardize dataset formats and test routines.
    • Data collection and processing.