“Weekly reading”版本间的差异
来自cslt Wiki
(10位用户的99个中间修订版本未显示) | |||
第2行: | 第2行: | ||
'''清华大学语音语言中心内部学习会 | '''清华大学语音语言中心内部学习会 | ||
− | '''时间: | + | '''时间: 每周五晚19:30''' |
'''地点: 1区303''' | '''地点: 1区303''' | ||
第10行: | 第10行: | ||
! Date !! Speaker!! Title !! Materials | ! Date !! Speaker!! Title !! Materials | ||
|- | |- | ||
− | | | + | | || || PPT模板 ||[[媒体文件:Weeklyreading_template.rar]] |
|- | |- | ||
− | | 2021/ | + | | 2021/04/01 ||Haoran Sun || Zeus code regularization ||[[媒体文件:代码规范.pdf]] |
|- | |- | ||
− | | 2021/05/27 ||Di Wang | + | | 2021/05/20 ||Chen Chen || Overview of speech enhancement|| [[媒体文件:Speech_enhancement.pdf]] |
+ | |- | ||
+ | | 2021/05/27 ||Di Wang || Secret of 'hard trials' || [[媒体文件:Secret_of_hard_trials.pdf]] | ||
|- | |- | ||
| 2021/06/10 ||Jingxin Shen ||Expriments about thermal to RGB face synthesis with cycleGan and pix2pix || [[媒体文件:Expriments about thermal to RGB face synthesis with cycleGan and pix2pix.pdf]] | | 2021/06/10 ||Jingxin Shen ||Expriments about thermal to RGB face synthesis with cycleGan and pix2pix || [[媒体文件:Expriments about thermal to RGB face synthesis with cycleGan and pix2pix.pdf]] | ||
|- | |- | ||
− | | 2021/06/17 ||Yang Zhang || NIPS2020: Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect || [[媒体文件:long-tail.pdf]] | + | | 2021/06/17 ||Yang Zhang || NIPS2020: Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect || [[媒体文件:long-tail.pdf]] |
|- | |- | ||
− | | 2021/07/08 ||Tiankai Zhi || Some experiments on stargan ||[[媒体文件:Some experiments on stargan.pdf]] | + | | 2021/07/08 ||Tiankai Zhi || Some experiments on stargan ||[[媒体文件:Some experiments on stargan.pdf]] |
|- | |- | ||
− | | 2021/07/15 ||Jiao Han || MG experiments based on ASV system || [[媒体文件:MG experiments based on ASV system..pptx]] | + | | 2021/07/15 ||Jiao Han || MG experiments based on ASV system || [[媒体文件:MG experiments based on ASV system..pptx]] |
|- | |- | ||
| 2021/07/22 ||Zixi Yan & Sirui Li || Unsupervised Speech Recognition || [[媒体文件:Unsupervised_Speech_Recognition.pdf]] | | 2021/07/22 ||Zixi Yan & Sirui Li || Unsupervised Speech Recognition || [[媒体文件:Unsupervised_Speech_Recognition.pdf]] | ||
|- | |- | ||
− | | 2021/07/29 ||Pengqi Li || A Simulation Study on Robust MAML || [[媒体文件:A Simulation Study on Robust MAML.pdf]] | + | | 2021/07/29 ||Pengqi Li || A Simulation Study on Robust MAML || [[媒体文件:A Simulation Study on Robust MAML.pdf]] |
|- | |- | ||
| 2021/08/12 ||Qingyang Zhu || Noise-aware method for Speech Enhancement || [[媒体文件:Noise-aware method for Speech Enhancement.pdf]] | | 2021/08/12 ||Qingyang Zhu || Noise-aware method for Speech Enhancement || [[媒体文件:Noise-aware method for Speech Enhancement.pdf]] | ||
|- | |- | ||
− | | 2021/08/12 ||Weida Liang || Unsupervised Audio-Visual Synthesis via Exemplar Autoencoders || [[媒体文件:Bi-weekly_report_Liangwd.pdf]] | + | | 2021/08/12 ||Weida Liang || Unsupervised Audio-Visual Synthesis via Exemplar Autoencoders || [[媒体文件:Bi-weekly_report_Liangwd.pdf]] |
|- | |- | ||
− | | 2021/08/19 ||Di Wang || Inter Dataset Variability Compensation || [[媒体文件:Inter_dataset_variability_compensation.pdf]] | + | | 2021/08/19 ||Di Wang || Inter Dataset Variability Compensation || [[媒体文件:Inter_dataset_variability_compensation.pdf]] |
|- | |- | ||
− | | 2021/09/02 ||Tiankai Zhi || One Shot VC || [[媒体文件:One_shot_VC.pdf]] | + | | 2021/09/02 ||Tiankai Zhi || One Shot VC || [[媒体文件:One_shot_VC.pdf]] |
|- | |- | ||
| 2021/09/09 ||Jingxin Shen || Thermal Speaking || [[媒体文件:Thermal_Speaking_2021.pdf]] | | 2021/09/09 ||Jingxin Shen || Thermal Speaking || [[媒体文件:Thermal_Speaking_2021.pdf]] | ||
第40行: | 第42行: | ||
| 2021/09/23 ||Sirui Li & Zixi Yan || Wav2vec-U Experimental Report || [[媒体文件:Wav2vec-U_experimental_report.pdf ]] | | 2021/09/23 ||Sirui Li & Zixi Yan || Wav2vec-U Experimental Report || [[媒体文件:Wav2vec-U_experimental_report.pdf ]] | ||
|- | |- | ||
− | | 2021/10/20 ||Renmiao Chen|| Is Someone Speaking? || [[媒体文件:Is_Someone_Speaking_Exploring_Long-term_Temporal_Features.pdf ]] | + | | 2021/10/20 ||Renmiao Chen || Is Someone Speaking? || [[媒体文件:Is_Someone_Speaking_Exploring_Long-term_Temporal_Features.pdf ]] |
|- | |- | ||
− | | 2021/10/28 ||Chen Chen || WenetSpeech Introduction || [[媒体文件:WenetSpeech_Dataset_Introduction.pdf ]] | + | | 2021/10/28 ||Chen Chen || WenetSpeech Introduction || [[媒体文件:WenetSpeech_Dataset_Introduction.pdf ]] |
|- | |- | ||
− | | 2021/11/10 ||Weida Liang || Cycle-loss Exemplar Autoencoder || [[媒体文件:Cycle-loss_Exemplar_Autoencoder.pdf ]] | + | | 2021/11/10 ||Weida Liang || Cycle-loss Exemplar Autoencoder || [[媒体文件:Cycle-loss_Exemplar_Autoencoder.pdf ]] |
|- | |- | ||
− | | 2021/11/17 ||吾买尔江 || Modulation Spectrum || [[媒体文件:Modulation_Spectrum.pdf ]] | + | | 2021/11/17 ||吾买尔江 || Modulation Spectrum || [[媒体文件:Modulation_Spectrum.pdf ]] |
|- | |- | ||
− | | 2021/11/24 ||Chen Chen || S-DCCRN || [[媒体文件:S-DCCRN_pdf.pdf ]] | + | | 2021/11/24 ||Chen Chen || S-DCCRN || [[媒体文件:S-DCCRN_pdf.pdf ]] |
|- | |- | ||
− | | 2021/12/01 ||Pengqi Li || | + | | 2021/12/01 ||Pengqi Li || GuidedMix: An on-the-fly data augmentation approach for robust speaker recognition system || [[媒体文件:201201-GuidedMix-LPQ.pdf ]] |
|- | |- | ||
− | | 2021/12/08 ||Renmiao Chen || | + | | 2021/12/08 ||Renmiao Chen || Multimodal preson verification || [[媒体文件:Multimodal_preson_verification.pdf]] |
|- | |- | ||
− | | 2021/12/15 ||Ruihai Hou || | + | | 2021/12/15 ||Ruihai Hou || Crossmodal clustered contrastive learning: Grounding of spoken language to gesture || [[媒体文件:Crossmodal_clustered_contrasti.pdf]] |
|- | |- | ||
− | | 2021/12/ | + | | 2021/12/29 ||Zixi Yan || Capsules Network || [[媒体文件:Capsules_Network.pdf]] |
|- | |- | ||
− | | | + | | 2022/01/05 ||Sirui Li || Self-Supervised Learning for speech recognition with Intermediate layer supervision || [[媒体文件:SSL with Intermediate layer supervision.pdf]] |
|- | |- | ||
− | | 2022/01/ | + | | 2022/01/12 ||Weida Liang || FragmentVC || [[媒体文件:FragmentVC.pdf]] |
|- | |- | ||
− | | 2022/01/ | + | | 2022/01/19 ||Haoyu Jiang || Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video || [[媒体文件:Multi-modality_Associative_Bridging_through_Memory.pdf]] |
|- | |- | ||
− | | | + | | 2022/02/14 || || Interspeech 2021 Review || [[媒体文件:Interspeech_paper_review_min.pdf]] |
|- | |- | ||
− | | | + | | 2022/02/16 ||Chen Chen || Audio Visual HuBERT || [[媒体文件:AVHuBERT.pdf]] |
|- | |- | ||
− | | | + | | 2022/03/04 ||Pengqi Li || Study of Visualization || [[媒体文件:Visualization.pdf]] |
|- | |- | ||
− | | | + | | 2022/03/11 ||Renmiao Chen || Can audio-visual integration strengthen robustness under multimodal attacks? || [[媒体文件:Audio-Visual_Robustness_Under_Multimodal_Attacks.pdf]] |
|- | |- | ||
− | | | + | | 2022/03/11 ||吾买尔江 || Signal Separation || [[媒体文件:Signal_Separation.pdf]] |
|- | |- | ||
− | | | + | | 2022/03/18 ||Chen Chen || Overview on Lip Reading and Audio-visual Speech Recognition || [[媒体文件:LipReadingAndAVSR.pdf]] |
|- | |- | ||
− | | | + | | 2022/04/01 ||Ruihai Hou || Scalable Identity-Oriented Speech Retrieval || [[媒体文件:Scalable_Identity-Oriented_Speech_Retrieval.pdf]] |
|- | |- | ||
− | | ||Haoyu Jiang || || | + | | 2022/04/08 ||Zixi Yan || Wav2vec related papers share || [[媒体文件:Wav2vec_related_papers.pdf]] |
+ | |- | ||
+ | | 2022/04/22 ||Sirui Li || Speech-Based Language Modelling || [[媒体文件:Speech-Based Language Modelling.pdf]] | ||
+ | |- | ||
+ | | 2022/04/29 ||Haoyu Jiang || Models of Speaker Recognition || [[媒体文件:Models_of_Speaker_Recognition.pdf]] | ||
+ | |- | ||
+ | | 2022/05/13 ||Chen Chen || Audio-visual Representation Learning || [[媒体文件:Audio_visual_representation_learning.pdf]] | ||
+ | |- | ||
+ | | 2022/05/20 ||Haoran Sun || || | ||
+ | |- | ||
+ | | 2022/05/27 ||Pengqi Li || The important ”feature” for speaker recognition || [[媒体文件:The important ”feature” for speaker recognition.pdf]] | ||
+ | |- | ||
+ | | 2022/06/10 ||Zixi Yan || Paper Share || [[媒体文件:Paper_share_yzx0610.pdf]] | ||
+ | |- | ||
+ | | 2022/06/24 ||Renmiao Chen || Transformer in multimodal || [[媒体文件:Transformer_in_multimodal.pdf]] | ||
+ | |- | ||
+ | | || || ICASSP 2022 review || [[媒体文件:ICASSP2022_review.pdf]] [[媒体文件:ICASSP-2022-readinglist.pdf]] | ||
+ | |- | ||
+ | | 2022/07/04 ||Chen Chen || Video to Speech papers || [[媒体文件:VTS_cc.pdf]] | ||
+ | |- | ||
+ | | 2022/07/08 ||Ruihai Hou || ICASSP 2022 review (part) || [[媒体文件:Weeklyreading_hrh.pdf]] | ||
+ | |- | ||
+ | | 2022/07/15 ||Sirui Li || Towards End-to-end Unsupervised Speech Recognition || [[媒体文件:Towards_End_to_end_Unsupervised_Speech_Recognition.pdf]] | ||
+ | |- | ||
+ | | 2022/07/22 ||Wan Lin || AutoED: Text-independent unsupervised speaker recognition Model|| [[媒体文件:AutoED_spk_reg.pdf]] | ||
+ | |- | ||
+ | | 2022/07/29 ||Haoyu Jiang || ArcFace_iQIYI-VID || [[媒体文件:ArcFace_iQIYI-VID.pdf]] | ||
+ | |- | ||
+ | | 2022/08/05 ||Chen Chen || Recent advance in VTS task || [[媒体文件:RecentVTS.pdf]] | ||
+ | |- | ||
+ | | 2022/08/12 ||Tianhao Wang || Extremal Perturbations || [[媒体文件:Extremal_perturbations.pdf]] | ||
+ | |- | ||
+ | | 2022/08/19 ||Renmiao Chen || The correlation of face and vioce || [[媒体文件:The_correlation_of_face_and_vioce_CRM.pdf]] | ||
+ | |- | ||
+ | | 2022/09/02 ||Zixi Yan || Non-Contrastive Self-supervised Learning || [[媒体文件:Non_contrastive_Self_supervised_Learning.pdf]] | ||
+ | |- | ||
+ | | 2022/09/09 ||Sirui Li || Low Resource Speech Recognition || [[媒体文件:Low_Resource_Speech_Recognition_lsr_0909.pdf]] | ||
+ | |- | ||
+ | | 2022/09/16 ||Xipin Wei || Controllable Multi-style Music Generation Model based on simple Contrastive Learning || [[媒体文件:Controllable_Multi_style_Music_Generation_Model_based_on_simple_Contrastive_learning.pdf]] | ||
+ | |- | ||
+ | | 2022/09/23 ||Haoyu Jiang || Audio Visual Learning || [[媒体文件:Audio_Visual_Learning.pdf]] | ||
+ | |- | ||
+ | | 2022/09/30 ||Chen Chen || Speech Quality Assessment || [[媒体文件:220930_cchen_SpeechQualityAssessment.pdf]] | ||
+ | |- | ||
+ | | 2022/10/07 ||Wan Lin || Cross-Domain Speaker Recognition || [[媒体文件:Cross_Domain_Speaker_Recognition.pdf]] | ||
+ | |- | ||
+ | | 2022/10/14 ||Tianhao Wang || How do deep speaker models treat silence and noises || [[媒体文件:20221014_wth.pdf]] | ||
+ | |- | ||
+ | | 2022/10/31 ||Pengqi Li || Visualization of a specific filter in CNN || [[媒体文件:Visualization of a specific filter in CNN.pdf]] | ||
+ | |- | ||
+ | | 2022/11/04 ||Zhenyu Zhou || Acoustic-aware Training for Multi-genre Speaker Recognition || [[媒体文件:20221104_acoustic_training.pdf]] | ||
+ | |- | ||
+ | | 2022/11/07 ||Chen Chen & Renmiao Chen || Experience and perceptions of collecting Audio-Visual dataset || [[媒体文件:20221107_cc_crm.pdf]] | ||
+ | |- | ||
+ | | 2022/12/23 ||Renmiao Chen || IS22 and Perceiver IO|| [[媒体文件:221223CRM.pdf]] | ||
+ | |- | ||
+ | | 2022/12/23 ||Dong Wang || NIPS2022 || [[媒体文件:NIPS2022.pdf]] | ||
+ | |- | ||
+ | | 2022/12/30 ||Chen Chen || Perceptual in Generative Audio Models || [[媒体文件:221230_cc.pdf]] | ||
+ | |- | ||
+ | | || || IS22_review || [[媒体文件:IS22_review_all.pdf]] | ||
+ | |- | ||
+ | | 2023/02/10 ||Jiaying Wang || Ordered binary speaker embedding || [[媒体文件:230210wjy.pdf]] | ||
+ | |- | ||
+ | | 2023/02/17 ||Xipin Wei || MSAT: A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation || [[媒体文件:MSAT_wxp.pdf]] | ||
+ | |- | ||
+ | | 2023/03/10 ||Zhenyu Zhou || consistence_loss&BCE_loss || [[媒体文件:consistence_loss&BCE_loss.pdf]] | ||
+ | |- | ||
+ | | 2023/03/17 ||Tianhao Wang || Score calibration in speaker verification || [[媒体文件:Score_calibration_in_speaker_verification.pdf]] | ||
+ | |- | ||
+ | | 2023/03/31 ||Wan Lin || Understand contrast and non-contrast in self-supervised learning || [[媒体文件:Understand contrast and non-contrast in self-supervised learning.pdf]] | ||
+ | |- | ||
+ | | 2023/04/14 ||Pengqi Li || Towards Attribution Methods in Deep Speaker Recognition || [[媒体文件:Towards_Attribution_Methods_in_Deep_Speaker_Recognition_230414_lpq.pdf]] | ||
+ | |- | ||
+ | | 2023/04/21 ||Chen Chen || Masked Prediction Task Based Self-supervised Multimodal Learning || [[媒体文件:Masked_prediction_task_based_self-supervised_multimodal_learning.pdf]] | ||
+ | |- | ||
+ | | 2023/04/28 ||Xiaolou Li || Incomplete Multimodal Method Exploration || [[媒体文件:Incomplete_Multimodal_Method_Exploration.pdf]] | ||
+ | |- | ||
+ | | 2023/05/04 ||Renmiao Chen || Applications of Diffusion Model || [[媒体文件:230505_Applications_of_Diffusion_Model_CRM.pdf]] | ||
+ | |- | ||
+ | | 2023/05/19 ||Jiaying Wang || DSH based method||[[媒体文件:230519_DSH_based_paper.pptx]] | ||
+ | |- | ||
+ | | 2023/05/26 ||Zhenyu Zhou || representation learning approach for domain adaptation || [[媒体文件:Representation_learning_approach_for_domain_adaptation.pptx]] | ||
+ | |- | ||
+ | | 2023/06/02 ||Pengqi Li || || | ||
+ | |- | ||
+ | | 2023/06/30 ||Tianhao Wang || Robust Speaker Verification ICASSP2023 || [[媒体文件:20230630_Robust_Speaker_Verification_ICASSP2023.pdf]] | ||
+ | |- | ||
+ | | 2023/10/13 ||Xiaolou Li || || | ||
+ | |- | ||
+ | | 2023/10/20 ||Zehua Liu || || | ||
+ | |- | ||
+ | | 2023/10/27 ||Junhui Chen || || | ||
|- | |- | ||
|} | |} | ||
− | |||
[[Old readings|Past Events]] | [[Old readings|Past Events]] |
2023年10月10日 (二) 02:22的最后版本
清华大学语音语言中心内部学习会
时间: 每周五晚19:30
地点: 1区303
Date | Speaker | Title | Materials |
---|---|---|---|
PPT模板 | 媒体文件:Weeklyreading_template.rar | ||
2021/04/01 | Haoran Sun | Zeus code regularization | 媒体文件:代码规范.pdf |
2021/05/20 | Chen Chen | Overview of speech enhancement | 媒体文件:Speech_enhancement.pdf |
2021/05/27 | Di Wang | Secret of 'hard trials' | 媒体文件:Secret_of_hard_trials.pdf |
2021/06/10 | Jingxin Shen | Expriments about thermal to RGB face synthesis with cycleGan and pix2pix | 媒体文件:Expriments about thermal to RGB face synthesis with cycleGan and pix2pix.pdf |
2021/06/17 | Yang Zhang | NIPS2020: Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect | 媒体文件:long-tail.pdf |
2021/07/08 | Tiankai Zhi | Some experiments on stargan | 媒体文件:Some experiments on stargan.pdf |
2021/07/15 | Jiao Han | MG experiments based on ASV system | 媒体文件:MG experiments based on ASV system..pptx |
2021/07/22 | Zixi Yan & Sirui Li | Unsupervised Speech Recognition | 媒体文件:Unsupervised_Speech_Recognition.pdf |
2021/07/29 | Pengqi Li | A Simulation Study on Robust MAML | 媒体文件:A Simulation Study on Robust MAML.pdf |
2021/08/12 | Qingyang Zhu | Noise-aware method for Speech Enhancement | 媒体文件:Noise-aware method for Speech Enhancement.pdf |
2021/08/12 | Weida Liang | Unsupervised Audio-Visual Synthesis via Exemplar Autoencoders | 媒体文件:Bi-weekly_report_Liangwd.pdf |
2021/08/19 | Di Wang | Inter Dataset Variability Compensation | 媒体文件:Inter_dataset_variability_compensation.pdf |
2021/09/02 | Tiankai Zhi | One Shot VC | 媒体文件:One_shot_VC.pdf |
2021/09/09 | Jingxin Shen | Thermal Speaking | 媒体文件:Thermal_Speaking_2021.pdf |
2021/09/23 | Sirui Li & Zixi Yan | Wav2vec-U Experimental Report | 媒体文件:Wav2vec-U_experimental_report.pdf |
2021/10/20 | Renmiao Chen | Is Someone Speaking? | 媒体文件:Is_Someone_Speaking_Exploring_Long-term_Temporal_Features.pdf |
2021/10/28 | Chen Chen | WenetSpeech Introduction | 媒体文件:WenetSpeech_Dataset_Introduction.pdf |
2021/11/10 | Weida Liang | Cycle-loss Exemplar Autoencoder | 媒体文件:Cycle-loss_Exemplar_Autoencoder.pdf |
2021/11/17 | 吾买尔江 | Modulation Spectrum | 媒体文件:Modulation_Spectrum.pdf |
2021/11/24 | Chen Chen | S-DCCRN | 媒体文件:S-DCCRN_pdf.pdf |
2021/12/01 | Pengqi Li | GuidedMix: An on-the-fly data augmentation approach for robust speaker recognition system | 媒体文件:201201-GuidedMix-LPQ.pdf |
2021/12/08 | Renmiao Chen | Multimodal preson verification | 媒体文件:Multimodal_preson_verification.pdf |
2021/12/15 | Ruihai Hou | Crossmodal clustered contrastive learning: Grounding of spoken language to gesture | 媒体文件:Crossmodal_clustered_contrasti.pdf |
2021/12/29 | Zixi Yan | Capsules Network | 媒体文件:Capsules_Network.pdf |
2022/01/05 | Sirui Li | Self-Supervised Learning for speech recognition with Intermediate layer supervision | 媒体文件:SSL with Intermediate layer supervision.pdf |
2022/01/12 | Weida Liang | FragmentVC | 媒体文件:FragmentVC.pdf |
2022/01/19 | Haoyu Jiang | Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video | 媒体文件:Multi-modality_Associative_Bridging_through_Memory.pdf |
2022/02/14 | Interspeech 2021 Review | 媒体文件:Interspeech_paper_review_min.pdf | |
2022/02/16 | Chen Chen | Audio Visual HuBERT | 媒体文件:AVHuBERT.pdf |
2022/03/04 | Pengqi Li | Study of Visualization | 媒体文件:Visualization.pdf |
2022/03/11 | Renmiao Chen | Can audio-visual integration strengthen robustness under multimodal attacks? | 媒体文件:Audio-Visual_Robustness_Under_Multimodal_Attacks.pdf |
2022/03/11 | 吾买尔江 | Signal Separation | 媒体文件:Signal_Separation.pdf |
2022/03/18 | Chen Chen | Overview on Lip Reading and Audio-visual Speech Recognition | 媒体文件:LipReadingAndAVSR.pdf |
2022/04/01 | Ruihai Hou | Scalable Identity-Oriented Speech Retrieval | 媒体文件:Scalable_Identity-Oriented_Speech_Retrieval.pdf |
2022/04/08 | Zixi Yan | Wav2vec related papers share | 媒体文件:Wav2vec_related_papers.pdf |
2022/04/22 | Sirui Li | Speech-Based Language Modelling | 媒体文件:Speech-Based Language Modelling.pdf |
2022/04/29 | Haoyu Jiang | Models of Speaker Recognition | 媒体文件:Models_of_Speaker_Recognition.pdf |
2022/05/13 | Chen Chen | Audio-visual Representation Learning | 媒体文件:Audio_visual_representation_learning.pdf |
2022/05/20 | Haoran Sun | ||
2022/05/27 | Pengqi Li | The important ”feature” for speaker recognition | 媒体文件:The important ”feature” for speaker recognition.pdf |
2022/06/10 | Zixi Yan | Paper Share | 媒体文件:Paper_share_yzx0610.pdf |
2022/06/24 | Renmiao Chen | Transformer in multimodal | 媒体文件:Transformer_in_multimodal.pdf |
ICASSP 2022 review | 媒体文件:ICASSP2022_review.pdf 媒体文件:ICASSP-2022-readinglist.pdf | ||
2022/07/04 | Chen Chen | Video to Speech papers | 媒体文件:VTS_cc.pdf |
2022/07/08 | Ruihai Hou | ICASSP 2022 review (part) | 媒体文件:Weeklyreading_hrh.pdf |
2022/07/15 | Sirui Li | Towards End-to-end Unsupervised Speech Recognition | 媒体文件:Towards_End_to_end_Unsupervised_Speech_Recognition.pdf |
2022/07/22 | Wan Lin | AutoED: Text-independent unsupervised speaker recognition Model | 媒体文件:AutoED_spk_reg.pdf |
2022/07/29 | Haoyu Jiang | ArcFace_iQIYI-VID | 媒体文件:ArcFace_iQIYI-VID.pdf |
2022/08/05 | Chen Chen | Recent advance in VTS task | 媒体文件:RecentVTS.pdf |
2022/08/12 | Tianhao Wang | Extremal Perturbations | 媒体文件:Extremal_perturbations.pdf |
2022/08/19 | Renmiao Chen | The correlation of face and vioce | 媒体文件:The_correlation_of_face_and_vioce_CRM.pdf |
2022/09/02 | Zixi Yan | Non-Contrastive Self-supervised Learning | 媒体文件:Non_contrastive_Self_supervised_Learning.pdf |
2022/09/09 | Sirui Li | Low Resource Speech Recognition | 媒体文件:Low_Resource_Speech_Recognition_lsr_0909.pdf |
2022/09/16 | Xipin Wei | Controllable Multi-style Music Generation Model based on simple Contrastive Learning | 媒体文件:Controllable_Multi_style_Music_Generation_Model_based_on_simple_Contrastive_learning.pdf |
2022/09/23 | Haoyu Jiang | Audio Visual Learning | 媒体文件:Audio_Visual_Learning.pdf |
2022/09/30 | Chen Chen | Speech Quality Assessment | 媒体文件:220930_cchen_SpeechQualityAssessment.pdf |
2022/10/07 | Wan Lin | Cross-Domain Speaker Recognition | 媒体文件:Cross_Domain_Speaker_Recognition.pdf |
2022/10/14 | Tianhao Wang | How do deep speaker models treat silence and noises | 媒体文件:20221014_wth.pdf |
2022/10/31 | Pengqi Li | Visualization of a specific filter in CNN | 媒体文件:Visualization of a specific filter in CNN.pdf |
2022/11/04 | Zhenyu Zhou | Acoustic-aware Training for Multi-genre Speaker Recognition | 媒体文件:20221104_acoustic_training.pdf |
2022/11/07 | Chen Chen & Renmiao Chen | Experience and perceptions of collecting Audio-Visual dataset | 媒体文件:20221107_cc_crm.pdf |
2022/12/23 | Renmiao Chen | IS22 and Perceiver IO | 媒体文件:221223CRM.pdf |
2022/12/23 | Dong Wang | NIPS2022 | 媒体文件:NIPS2022.pdf |
2022/12/30 | Chen Chen | Perceptual in Generative Audio Models | 媒体文件:221230_cc.pdf |
IS22_review | 媒体文件:IS22_review_all.pdf | ||
2023/02/10 | Jiaying Wang | Ordered binary speaker embedding | 媒体文件:230210wjy.pdf |
2023/02/17 | Xipin Wei | MSAT: A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation | 媒体文件:MSAT_wxp.pdf |
2023/03/10 | Zhenyu Zhou | consistence_loss&BCE_loss | 媒体文件:consistence_loss&BCE_loss.pdf |
2023/03/17 | Tianhao Wang | Score calibration in speaker verification | 媒体文件:Score_calibration_in_speaker_verification.pdf |
2023/03/31 | Wan Lin | Understand contrast and non-contrast in self-supervised learning | 媒体文件:Understand contrast and non-contrast in self-supervised learning.pdf |
2023/04/14 | Pengqi Li | Towards Attribution Methods in Deep Speaker Recognition | 媒体文件:Towards_Attribution_Methods_in_Deep_Speaker_Recognition_230414_lpq.pdf |
2023/04/21 | Chen Chen | Masked Prediction Task Based Self-supervised Multimodal Learning | 媒体文件:Masked_prediction_task_based_self-supervised_multimodal_learning.pdf |
2023/04/28 | Xiaolou Li | Incomplete Multimodal Method Exploration | 媒体文件:Incomplete_Multimodal_Method_Exploration.pdf |
2023/05/04 | Renmiao Chen | Applications of Diffusion Model | 媒体文件:230505_Applications_of_Diffusion_Model_CRM.pdf |
2023/05/19 | Jiaying Wang | DSH based method | 媒体文件:230519_DSH_based_paper.pptx |
2023/05/26 | Zhenyu Zhou | representation learning approach for domain adaptation | 媒体文件:Representation_learning_approach_for_domain_adaptation.pptx |
2023/06/02 | Pengqi Li | ||
2023/06/30 | Tianhao Wang | Robust Speaker Verification ICASSP2023 | 媒体文件:20230630_Robust_Speaker_Verification_ICASSP2023.pdf |
2023/10/13 | Xiaolou Li | ||
2023/10/20 | Zehua Liu | ||
2023/10/27 | Junhui Chen |