“2024-04-08”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
 
(5位用户的8个中间修订版本未显示)
第34行: 第34行:
 
|Ying Shi
 
|Ying Shi
 
||  
 
||  
*
+
* Detect wake-up words from Continuous speech Down
 +
* SPL Paper, almost down
 +
* [https://z1et6d3xtb.feishu.cn/wiki/DYSjw7LfviU7u1kWbhEcFjE1nld?from=from_copylink Group Work]
 
||
 
||
 
*  
 
*  
第85行: 第87行:
 
|Xiaolou Li
 
|Xiaolou Li
 
||  
 
||  
*  
+
* Reproduce different VSR structure [https://z1et6d3xtb.feishu.cn/docx/BgD4djeTioCgd6xjFXNcBXXLnKg?from=from_copylink]
 +
** interCTC, Resnet3D frontend, Branchformer, E-Branchformer
 +
* Paper reading
 +
** Mamba, icassp2024
 
||
 
||
*  
+
* modify E-Branchformer
 +
* Resnet3D frontend + Branchformer test
 +
* s4 decoder (maybe), mamba paper learning
 
||
 
||
 
*   
 
*   
第97行: 第104行:
 
||  
 
||  
 
* read papper
 
* read papper
* auxiliary loss code
+
* auxiliary loss code[https://z1et6d3xtb.feishu.cn/docx/ZaTFd3A5EoK982xWBVschloanee?from=from_copylink]
 
||
 
||
 
*  
 
*  
第158行: 第165行:
 
|Junhui Chen
 
|Junhui Chen
 
||
 
||
*  
+
* Neural scoring with Frequency-channel attention
 +
** Our overlap test trial: EER 7.382% -> 7.132%
 +
* Graduation paper
 
||
 
||
*
+
*  
 
||
 
||
 
*   
 
*   
第169行: 第178行:
 
|Jiaying Wang
 
|Jiaying Wang
 
||  
 
||  
* HHuawei project:train speakerbeam on -4,4,converge,some bug in test(dataloader)
+
* HHuawei project:train speakerbeam on -4,4,converge,some bug in test
 
* cohort mini loss: check code & test
 
* cohort mini loss: check code & test
  

2024年4月8日 (一) 10:58的最后版本

People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • Primary AI design
  • NMI neuralmag paper submitted
Lantian Li
  • GPU status [1]
  • Projects (POC of Cough/Humming detection, TSE proposal)
  • ASIP-BUPT (NeuralScoring, CohortTSE)
  • AI Course Polish
  • Machine Moving
  • New machine (rabbit04)
Ying Shi
  • Detect wake-up words from Continuous speech Down
  • SPL Paper, almost down
  • Group Work
Zhenghai You
  • Checked the code of mini loss[2]
  • Test onnx of Huawei project[3]
Junming Yuan
  • Experimental report on "learn not to listen" v3 extend test[4]
Chen Chen
  • Finished my thesis :)
  • Group work [5]
    • finish training for most of the structures, but all performs a little worse
    • no good news from KD
    • good primary experiment result from char as modeling unit, need to check after whole training
    • 58 hours for CN-CVS II (too slow)
  • Try some data aug methods for VSR (Crop size)
  • Help with entropy analyze of child data
  • START ICASSP2024 PAPER READING
Xiaolou Li
  • Reproduce different VSR structure [6]
    • interCTC, Resnet3D frontend, Branchformer, E-Branchformer
  • Paper reading
    • Mamba, icassp2024
  • modify E-Branchformer
  • Resnet3D frontend + Branchformer test
  • s4 decoder (maybe), mamba paper learning
Zehua Liu
  • read papper
  • auxiliary loss code[7]
Pengqi Li
  • Summary[8] of speech processing XAI for NSFC
    • polish, reference, v1(90%)
  • Workshop report(video, slide, poster)
Wan Lin
  • Graduation paper
Tianhao Wang
  • EA-ASP (from SJTU) reproduced successfully [9]
    • EA-ASP implement, wespeaker toolkit modification, training pairs construction totally according to the paper
    • get the better EER(4.021%) comparing to the paper (5.212%) on Vox1-O-Overlap
    • evaluation on our trials (concat and weak_overlap are better, overlap and mix are worse)
Zhenyu Zhou
  • Finish NeuralScoring baseline[10]
  • ICASSP2024 report
Junhui Chen
  • Neural scoring with Frequency-channel attention
    • Our overlap test trial: EER 7.382% -> 7.132%
  • Graduation paper
Jiaying Wang
  • HHuawei project:train speakerbeam on -4,4,converge,some bug in test
  • cohort mini loss: check code & test
Yu Zhang
  • Financial Backtesting pipline
    • remake Stock and Industry return logic
    • debug Brinson Analysis result
  • Use SAC as a baseline to run through the entire process from training to policy generation to backtesting
Wenqiang Du
  • Some model update task
    • Chinese and Uyghur
Yang Wei
  • Fix problems about ASR model for mispronunciation detection task
  • Prepare the baseline system
Lily
  • Data Analysis
  • Paper Reading[11]
Qi Qu
  • First day
  • Dev environment setup
Turi
  • Prepared sentences
  • Prepared data collection app for release