2024-08-19

People	This Week	Next Week
Dong Wang	AI primary (middle-school) 1-6
Lantian Li	GPU status [1] AI primary High school handbook (30/40)	High school handbook (40/40)
Ying Shi
Zhenghai You	Continue the work of speaker segment and Ex-Former in TSE[2]
Jiaying Wang	reproduce conditional chain code on both libri and wsj: training loss hard to reduce (around -3) and the corresponding test sisdr is positive rewriting the code (preserving the original model)
Junming Yuan	Verified two parameters in Hubert pretraining config file that were confused with the original paper.[3] Confirmed that in the second iteration of pretraining, features should be extracted from the 6-th layer of the transformer, not the 9-th layer. in 175k step, result of 6-th layer: 71.55/9.39, result of 9-th layer: 37.31/16.72 Basically confirmed the setting of the parameter 'untie_final_proj' for the two iterations of pretraining.
Xiaolou Li
Zehua Liu
Pengqi Li	Investigating Extremely Short-Utterance in speaker recognition[4]
Tianhao Wang	reproducing CLIPSep on two datasets: MUSIC and VGGSound [5] MUSIC: Text query: 10.06 SDR, Image query: 12.13 SDR VGGSound: Text query: 2.78 SDR, Image query: 5.01 SDR
Zhenyu Zhou
Wan Lin	VoxBlink1 Data processing Baseline(ResNet34) training and NS training [6]
Junhui Chen	VoxBlink1 Data processing Baseline(ResNet34 ASP) training and NS training [7]
Yu Zhang
Wenqiang Du	Complete Primary school handbook draft (45 + 8) Modify the format, expression, and distribution of knowledge points in the draft（40%）
Yang Wei
Lily
Turi	Writing dataset paper. Done with Intro, Literature review, Data collection sections. Experiment, Result and Conclusion sections remaining. Wasn't able to do more experiment on dataset from Ethiopia due to poor network
Yue Gu	modify the introduction complete the interspeech poster, and open source the paper code rest for two days, next I will focus on my new work
Qi Qu	Inactive due to absence.	KWS: zh48 test dataset to be updated: ~30 speakers in 3 locations. yue10 (Cantonese 10 keywords) train dataset to be updated: ~120 speakers verified, more to come. Try to find suitable keyword-wise thresholds based on Recall ~ FA relation.

2024-08-19

导航菜单

个人工具

名字空间

变种

查看

操作

搜索

导航

工具