“2024-09-30”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
第79行: 第79行:
 
|Xiaolou Li
 
|Xiaolou Li
 
||
 
||
*
+
* Use MFA on LRS3 to cut it into small segments
 +
* Use discrete embedding of avhubert in vsp-llm training (Still training)
 +
* Some idea of align video feature and LLM (Dense Connector, CL methods)
 +
* Handover the data collection and get familiar with the process
 +
* Data Collection: 3138 h (need to re-check, DDL: 10.15)
 
||
 
||
 
*
 
*

2024年9月30日 (一) 10:39的版本

People This Week Next Week Task Tracking (DeadLine)
Dong Wang
Lantian Li
  • AI-Graph handbook v0.1
  • AI-Graph EN (12/50)
  • Huawei TiDing 3.0 - Model Quantization
  • BUPT/AI-Radiance trivial things
Ying Shi
  • Add 4 kinds of negative sampling strategies Optimized Text-enroll KWS code
    • (deletion, substitution, insertion, and shuffle) and verify them to ensure no bugs.
    • Find that new negative sampling will increase the difficulty of training which indicates that only depending on positional embedding is not enough.
  • Reproduce conditional chain overlap asr (Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals)
    • According to Jiaying's work the code released by the published paper can not work
    • Write dominance-based conditional chain overlap asr by myself (in progress)
Zhenghai You
Junming Yuan
Chen Chen
Xiaolou Li
  • Use MFA on LRS3 to cut it into small segments
  • Use discrete embedding of avhubert in vsp-llm training (Still training)
  • Some idea of align video feature and LLM (Dense Connector, CL methods)
  • Handover the data collection and get familiar with the process
  • Data Collection: 3138 h (need to re-check, DDL: 10.15)
Zehua Liu
Pengqi Li
Wan Lin
  • Voxblink1 model training and testing [1]
Tianhao Wang
Zhenyu Zhou
Junhui Chen
Jiaying Wang
Yu Zhang
Wenqiang Du
Yang Wei
Lily
Turi
Yue Gu
  • Almost complete the revisions of my journal paper
Qi Qu