People |
This Week |
Next Week |
Task Tracking (DeadLine)
|
Dong Wang
|
- AI primary (middle-school) 1-6
|
|
|
Lantian Li
|
- GPU status [1]
- AI primary
- High school handbook (30/40)
|
- High school handbook (40/40)
|
|
Ying Shi
|
|
|
|
Zhenghai You
|
- Continue the work of speaker segment and Ex-Former in TSE[2]
|
|
|
Jiaying Wang
|
- reproduce conditional chain code
- on both libri and wsj: training loss hard to reduce (around -3) and the corresponding test sisdr is positive
- rewriting the code (preserving the original model)
|
|
|
Junming Yuan
|
- Verified two parameters in Hubert pretraining config file that were confused with the original paper.[3]
- Confirmed that in the second iteration of pretraining, features should be extracted from the 6-th layer of the transformer, not the 9-th layer.
- in 175k step, result of 6-th layer: 71.55/9.39, result of 9-th layer: 37.31/16.72
- Basically confirmed the setting of the parameter 'untie_final_proj' for the two iterations of pretraining.
|
|
|
Xiaolou Li
|
|
|
|
Zehua Liu
|
|
|
|
Pengqi Li
|
- Investigating Extremely Short-Utterance in speaker recognition[4]
|
|
|
Tianhao Wang
|
- reproducing CLIPSep on two datasets: MUSIC and VGGSound [5]
- MUSIC: Text query: 10.06 SDR, Image query: 12.13 SDR
- VGGSound: Text query: 2.78 SDR, Image query: 5.01 SDR
|
|
|
Zhenyu Zhou
|
|
|
|
Wan Lin
|
- VoxBlink1
- Data processing
- Baseline(ResNet34) training and NS training [6]
|
|
|
Junhui Chen
|
- VoxBlink1
- Data processing
- Baseline(ResNet34 ASP) training and NS training [7]
|
|
|
Yu Zhang
|
|
|
|
Wenqiang Du
|
- Complete Primary school handbook draft (45 + 8)
- Modify the format, expression, and distribution of knowledge points in the draft(40%)
|
|
|
Yang Wei
|
|
|
|
Lily
|
|
|
|
Turi
|
- Writing dataset paper.
- Done with Intro, Literature review, Data collection sections. Experiment, Result and Conclusion sections remaining.
- Wasn't able to do more experiment on dataset from Ethiopia due to poor network
|
|
|
Yue Gu
|
- modify the introduction
- complete the interspeech poster, and open source the paper code
- rest for two days, next I will focus on my new work
|
|
|
Qi Qu
|
|
- KWS:
- zh48 test dataset to be updated: ~30 speakers in 3 locations.
- yue10 (Cantonese 10 keywords) train dataset to be updated: ~120 speakers verified, more to come.
- Try to find suitable keyword-wise thresholds based on Recall ~ FA relation.
|
|