“2024-05-13”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
(相同用户的一个中间修订版本未显示)
第172行: 第172行:
 
|Yu Zhang
 
|Yu Zhang
 
||
 
||
*  
+
* AutoML
 +
** EvalML test result[https://z1et6d3xtb.feishu.cn/docx/EDO1dLwHToDqiCxhHf6cLXDVnlb?from=from_copylink]
 
||
 
||
 
*
 
*

2024年5月13日 (一) 10:07的版本

People This Week Next Week Task Tracking (DeadLine)
Dong Wang
Lantian Li
Ying Shi
  • verify cohort Overlap ASR assumption
    • Identify the speech component which most similar to the cohort vector ✔
  • group work
  • cohort + conditional chain Overlap ASR
Zhenghai You
Junming Yuan
  • Continue to add various data augmentation functions into the code
  • Prepare for live broadcast
Chen Chen
Xiaolou Li
  • Video mamba exp (good good)
    • patch frontend
    • conv3d and resnet3d frontend
  • Paper reading
  • run exp on LRS2 and LRS3 (waiting for email feedback)
  • what is the main difference between these two frontend? (conv3d and resnet3d)
Zehua Liu
  • AKVSR (cer:49.71%) > baseline(cer: 48.76%)
    • AKVSR + pos_emb (a little worse)
    • AKVSR + attention score loss(coding)
Pengqi Li
  • Jinfu and LiuHuan's Outlines of NC
  • XueYing's Outline of NC
  • NC paper of Speech XAI overview
Wan Lin
Tianhao Wang
  • Baseline: SpEx+ with Detection (Failed)
    • difficult to train because vox2 has a much larger data volume than wsj0
  • Toolkit align: lr scheduler, pooling
    • pooling seems critical (same epoch, NS loss: ASP is 0.16 vs TSP is 0.22)
Zhenyu Zhou
Junhui Chen
Jiaying Wang
Yu Zhang
  • AutoML
    • EvalML test result[1]
Wenqiang Du
  • Just some project test
Yang Wei
Lily
  • PPT delivery
  • Thesis
  • Perception experiment
Turi
  • Data Collection
    • Checking audios
  • Class works
Yue Gu
Qi Qu