|
|
第235行: |
第235行: |
| |Qi Qu | | |Qi Qu |
| || | | || |
− | * | + | * AED: |
| + | ** New CED-based classifiers deployed onto devices, yielding acceptable performance. |
| + | * KWS: |
| + | ** Quantization and format conversion of production models for deployment on embedded device w/ NPU. Default quantization mode leads to unacceptable loss of precision. Will try hybrid quantization. |
| + | ** Text-enrollment KWS: some dynamic dimensions misinterpreted as constant duration exportation to ONNX. |
| || | | || |
| * | | * |
People |
This Week |
Next Week |
Task Tracking (DeadLine)
|
Dong Wang
|
- Primary School AI hand book (20-30)
|
|
|
Lantian Li
|
- AI-Graph EN (25/50)
- Complete CSTR intro report (11.18)
|
|
|
Ying Shi
|
- Cohort-Overlap ASR
- condition on real decode result
- Design stop criterion
- Cohort-Speech separation
- several configs for Dual-path model
- group work
|
|
|
Zhenghai You
|
|
|
|
Junming Yuan
|
- The result of feat-mask/time-mask MT-HuBERT [1]
|
|
|
Xiaolou Li
|
|
|
|
Zehua Liu
|
- Verify VSR data
- Finish Data Verification Report
- ICL work(CER: 47.87% < CER: 51.08%)
- Time Mask matters[2]
|
|
|
Pengqi Li
|
- Complete the final report of the doctoral innovation project(School)
- Exploring the Consistency of TAO and LayerCAM Results on different models and datasets.
- Conclusion and hypothesis[3]
|
|
|
Wan Lin
|
|
|
|
Tianhao Wang
|
- adjust the code of AudioSep (CLAP) to support multi-mix and audio-query (in training)
- some project testing
|
|
|
Xiaoxue Luo
|
- AudioSep reproduction
- evaluate the performance of AudioSep
- comparative experiment between AudioSep and baseline system(CLIPSep)
|
|
|
Zhenyu Zhou
|
|
|
|
Junhui Chen
|
|
|
|
Jiaying Wang
|
|
|
|
Yu Zhang
|
|
|
|
Wenqiang Du
|
- Participated in an AI competition
|
|
|
Yang Wei
|
|
|
|
Lily
|
|
|
|
Turi
|
- Whisper finetuning on sagalee
- with encoder frozen, whisper-large-v3 (20.5 WER)
- Finetuning LLM
- Finetuned Qwen2.5-0.5B on conversation dataset translated from English to Oromo
-
|
|
Yue Gu
|
|
|
|
Qi Qu
|
- AED:
- New CED-based classifiers deployed onto devices, yielding acceptable performance.
- KWS:
- Quantization and format conversion of production models for deployment on embedded device w/ NPU. Default quantization mode leads to unacceptable loss of precision. Will try hybrid quantization.
- Text-enrollment KWS: some dynamic dimensions misinterpreted as constant duration exportation to ONNX.
|
|
|