“Weekly reading”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
 
(23位用户的200个中间修订版本未显示)
第1行: 第1行:
*[[Speech Group Reading|张之勇 2014-12-28 APSIPA paper reading]]
 
  
*[http://www.jmlr.org/proceedings/papers/v32/graves14.pdf 刘超2015-03-11 Towards End-to-End Speech Recognition with Recurrent Neural Networks]
+
'''清华大学语音语言中心内部学习会
  
*[[媒体文件:2015 Building DNN Acoustic Models for Large Vocabulary Speech Recognition.pdf|汤志远2015-3-18 - Building DNN Acoustic Models for Large Vocabulary Speech Recognition]]
+
'''时间: 每周五晚19:30'''
  
*[[媒体文件:CONTRASTIVE AUTO-ENCODER FOR PHONEME RECOGNITION.pdf|林一叶2015-4-1 - CONTRASTIVE AUTO-ENCODER FOR PHONEME RECOGNITION]]
+
'''地点: 1区303'''
  
*[[媒体文件:2014 speech dereverberation using weighted prediction error with laplacian model of the designed signal.pdf|张雪薇2015-4-1 - speech dereverberation using weighted prediction error with laplacian model of the designed signal]]
 
  
*[http://arxiv.org/pdf/1312.6184v7.pdf 王东2015-4-1 - Do Deep Nets Really Need to be Deep?]
+
{| class="wikitable"
 +
! Date !! Speaker!! Title !! Materials
 +
|-
 +
|  ||  || PPT模板 ||[[媒体文件:Weeklyreading_template.rar]]
 +
|-
 +
| 2021/04/01  ||Haoran Sun    || Zeus code regularization ||[[媒体文件:代码规范.pdf]]
 +
|-
 +
| 2021/05/20  ||Chen Chen    || Overview of speech enhancement|| [[媒体文件:Speech_enhancement.pdf]]
 +
|-
 +
| 2021/05/27  ||Di Wang      || Secret of 'hard trials' || [[媒体文件:Secret_of_hard_trials.pdf]]
 +
|-
 +
| 2021/06/10  ||Jingxin Shen  ||Expriments about thermal to RGB face synthesis with cycleGan and pix2pix || [[媒体文件:Expriments about thermal to RGB face synthesis with cycleGan and pix2pix.pdf]]
 +
|-
 +
| 2021/06/17  ||Yang Zhang    || NIPS2020: Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect || [[媒体文件:long-tail.pdf]]
 +
|-
 +
| 2021/07/08  ||Tiankai Zhi  || Some experiments on stargan ||[[媒体文件:Some experiments on stargan.pdf]]
 +
|-
 +
| 2021/07/15  ||Jiao Han      || MG experiments based on ASV system || [[媒体文件:MG experiments based on ASV system..pptx]]
 +
|-
 +
| 2021/07/22  ||Zixi Yan & Sirui Li || Unsupervised Speech Recognition || [[媒体文件:Unsupervised_Speech_Recognition.pdf]]
 +
|-
 +
| 2021/07/29  ||Pengqi Li    || A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML || [[媒体文件:A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML.pdf]]
 +
|-
 +
| 2021/08/12  ||Qingyang Zhu || Noise-aware method for Speech Enhancement || [[媒体文件:Noise-aware method for Speech Enhancement.pdf]]
 +
|-
 +
| 2021/08/12  ||Weida Liang  ||  Unsupervised Audio-Visual Synthesis via Exemplar Autoencoders  ||  [[媒体文件:Bi-weekly_report_Liangwd.pdf]]
 +
|-
 +
| 2021/08/19  ||Di Wang      || Inter Dataset Variability Compensation ||  [[媒体文件:Inter_dataset_variability_compensation.pdf]]
 +
|-
 +
| 2021/09/02  ||Tiankai Zhi  || One Shot VC || [[媒体文件:One_shot_VC.pdf]]
 +
|-
 +
| 2021/09/09  ||Jingxin Shen || Thermal Speaking || [[媒体文件:Thermal_Speaking_2021.pdf]]
 +
|-
 +
| 2021/09/23  ||Sirui Li & Zixi Yan || Wav2vec-U Experimental Report || [[媒体文件:Wav2vec-U_experimental_report.pdf ‎]]
 +
|-
 +
| 2021/10/20  ||Renmiao Chen || Is Someone Speaking? || [[媒体文件:Is_Someone_Speaking_Exploring_Long-term_Temporal_Features.pdf ‎]]
 +
|-
 +
| 2021/10/28  ||Chen Chen    || WenetSpeech Introduction || [[媒体文件:WenetSpeech_Dataset_Introduction.pdf ‎]]
 +
|-
 +
| 2021/11/10  ||Weida Liang  || Cycle-loss Exemplar Autoencoder || [[媒体文件:Cycle-loss_Exemplar_Autoencoder.pdf ‎]]
 +
|-
 +
| 2021/11/17  ||吾买尔江      || Modulation Spectrum || [[媒体文件:Modulation_Spectrum.pdf ‎]]
 +
|-
 +
| 2021/11/24  ||Chen Chen    || S-DCCRN || [[媒体文件:S-DCCRN_pdf.pdf ‎]]
 +
|-
 +
| 2021/12/01  ||Pengqi Li    || GuidedMix: An on-the-fly data augmentation approach for robust speaker recognition system || [[媒体文件:201201-GuidedMix-LPQ.pdf ‎]]
 +
|-
 +
| 2021/12/08  ||Renmiao Chen || Multimodal preson verification ||  [[媒体文件:Multimodal_preson_verification.pdf]]
 +
|-
 +
| 2021/12/15  ||Ruihai Hou  || Crossmodal clustered contrastive learning: Grounding of spoken language to gesture || [[媒体文件:Crossmodal_clustered_contrasti.pdf]]
 +
|-
 +
| 2021/12/29  ||Zixi Yan    || Capsules Network || [[媒体文件:Capsules_Network.pdf]]
 +
|-
 +
| 2022/01/05  ||Sirui Li    || Self-Supervised Learning for speech recognition with Intermediate layer supervision || [[媒体文件:SSL with Intermediate layer supervision.pdf]]
 +
|-
 +
| 2022/01/12  ||Weida Liang  || FragmentVC || [[媒体文件:FragmentVC.pdf]]
 +
|-
 +
| 2022/01/19  ||Haoyu Jiang  || Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video || [[媒体文件:Multi-modality_Associative_Bridging_through_Memory.pdf]]
 +
|-
 +
| 2022/02/14  ||            || Interspeech 2021 Review || [[媒体文件:Interspeech_paper_review_min.pdf]]
 +
|-
 +
| 2022/02/16  ||Chen Chen    || Audio Visual HuBERT || [[媒体文件:AVHuBERT.pdf]]
 +
|-
 +
| 2022/03/04  ||Pengqi Li    || Study of Visualization || [[媒体文件:Visualization.pdf]]
 +
|-
 +
| 2022/03/11  ||Renmiao Chen || Can audio-visual integration strengthen robustness under multimodal attacks? || [[媒体文件:Audio-Visual_Robustness_Under_Multimodal_Attacks.pdf]]
 +
|-
 +
| 2022/03/11  ||吾买尔江      || Signal Separation || [[媒体文件:Signal_Separation.pdf]]
 +
|-
 +
| 2022/03/18  ||Chen Chen    || Overview on Lip Reading and Audio-visual Speech Recognition || [[媒体文件:LipReadingAndAVSR.pdf]]
 +
|-
 +
| 2022/04/01  ||Ruihai Hou  || Scalable Identity-Oriented Speech Retrieval || [[媒体文件:Scalable_Identity-Oriented_Speech_Retrieval.pdf]]
 +
|-
 +
| 2022/04/08  ||Zixi Yan    || Wav2vec related papers share || [[媒体文件:Wav2vec_related_papers.pdf]]
 +
|-
 +
| 2022/04/22  ||Sirui Li    || Speech-Based Language Modelling || [[媒体文件:Speech-Based Language Modelling.pdf]]
 +
|-
 +
| 2022/04/29  ||Haoyu Jiang  || Models of Speaker Recognition || [[媒体文件:Models_of_Speaker_Recognition.pdf]]
 +
|-
 +
| 2022/05/13  ||Chen Chen    || Audio-visual Representation Learning  || [[媒体文件:Audio_visual_representation_learning.pdf]]
 +
|-
 +
| 2022/05/20  ||Haoran Sun  ||  ||
 +
|-
 +
| 2022/05/27  ||Pengqi Li    || The important ”feature” for speaker recognition || [[媒体文件:The important ”feature” for speaker recognition.pdf]]
 +
|-
 +
| 2022/06/10  ||Zixi Yan    || Paper Share || [[媒体文件:Paper_share_yzx0610.pdf]]
 +
|-
 +
| 2022/06/24  ||Renmiao Chen || Transformer in multimodal || [[媒体文件:Transformer_in_multimodal.pdf]]
 +
|-
 +
|            ||            || ICASSP 2022 review || [[媒体文件:ICASSP2022_review.pdf]]  [[媒体文件:ICASSP-2022-readinglist.pdf]]
 +
|-
 +
| 2022/07/04  ||Chen Chen    || Video to Speech papers || [[媒体文件:VTS_cc.pdf]]
 +
|-
 +
| 2022/07/08  ||Ruihai Hou  || ICASSP 2022 review (part) || [[媒体文件:Weeklyreading_hrh.pdf]]
 +
|-
 +
| 2022/07/15  ||Sirui Li    || Towards End-to-end Unsupervised Speech Recognition || [[媒体文件:Towards_End_to_end_Unsupervised_Speech_Recognition.pdf]]
 +
|-
 +
| 2022/07/22  ||Wan Lin      || AutoED: Text-independent unsupervised speaker recognition Model|| [[媒体文件:AutoED_spk_reg.pdf]]
 +
|-
 +
| 2022/07/29  ||Haoyu Jiang  || ArcFace_iQIYI-VID || [[媒体文件:ArcFace_iQIYI-VID.pdf]]
 +
|-
 +
| 2022/08/05  ||Chen Chen    || Recent advance in VTS task || [[媒体文件:RecentVTS.pdf]]
 +
|-
 +
| 2022/08/12  ||Tianhao Wang || Extremal Perturbations || [[媒体文件:Extremal_perturbations.pdf]]
 +
|-
 +
| 2022/08/19  ||Renmiao Chen || The correlation of face and vioce || [[媒体文件:The_correlation_of_face_and_vioce_CRM.pdf]]
 +
|-
 +
| 2022/09/02  ||Zixi Yan    || Non-Contrastive Self-supervised Learning || [[媒体文件:Non_contrastive_Self_supervised_Learning.pdf]]
 +
|-
 +
| 2022/09/09  ||Sirui Li    || Low Resource Speech Recognition || [[媒体文件:Low_Resource_Speech_Recognition_lsr_0909.pdf]]
 +
|-
 +
| 2022/09/16  ||Xipin Wei    || Controllable Multi-style Music Generation Model based on simple Contrastive Learning || [[媒体文件:Controllable_Multi_style_Music_Generation_Model_based_on_simple_Contrastive_learning.pdf]]
 +
|-
 +
| 2022/09/23  ||Haoyu Jiang  || Audio Visual Learning || [[媒体文件:Audio_Visual_Learning.pdf]]
 +
|-
 +
| 2022/09/30  ||Chen Chen    || Speech Quality Assessment || [[媒体文件:220930_cchen_SpeechQualityAssessment.pdf]]
 +
|-
 +
| 2022/10/07  ||Wan Lin      || Cross-Domain Speaker Recognition || [[媒体文件:Cross_Domain_Speaker_Recognition.pdf]]
 +
|-
 +
| 2022/10/14  ||Tianhao Wang || How do deep speaker models treat silence and noises || [[媒体文件:20221014_wth.pdf]]
 +
|-
 +
| 2022/10/31  ||Pengqi Li    || Visualization of a specific filter in CNN || [[媒体文件:Visualization of a specific filter in CNN.pdf]]
 +
|-
 +
| 2022/11/04  ||Zhenyu Zhou  || Acoustic-aware Training for Multi-genre Speaker Recognition || [[媒体文件:20221104_acoustic_training.pdf]]
 +
|-
 +
| 2022/11/07  ||Chen Chen & Renmiao Chen || Experience and perceptions of collecting Audio-Visual dataset || [[媒体文件:20221107_cc_crm.pdf]]
 +
|-
 +
| 2022/12/23  ||Renmiao Chen || IS22 and Perceiver IO|| [[媒体文件:221223CRM.pdf]]
 +
|-
 +
| 2022/12/23  ||Dong Wang    || NIPS2022 || [[媒体文件:NIPS2022.pdf]]
 +
|-
 +
| 2022/12/30  ||Chen Chen    || Perceptual in Generative Audio Models || [[媒体文件:221230_cc.pdf]]
 +
|-
 +
|            ||            || IS22_review || [[媒体文件:IS22_review_all.pdf]]
 +
|-
 +
| 2023/02/10  ||Jiaying Wang || Ordered binary speaker embedding || [[媒体文件:230210wjy.pdf]]
 +
|-
 +
| 2023/02/17  ||Xipin Wei    || MSAT: A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation || [[媒体文件:MSAT_wxp.pdf]]
 +
|-
 +
| 2023/03/10  ||Zhenyu Zhou  || consistence_loss&BCE_loss ||  [[媒体文件:consistence_loss&BCE_loss.pdf]]
 +
|-
 +
| 2023/03/17  ||Tianhao Wang || Score calibration in speaker verification || [[媒体文件:Score_calibration_in_speaker_verification.pdf]]
 +
|-
 +
| 2023/03/31  ||Wan Lin      || Understand contrast and non-contrast in self-supervised learning || [[媒体文件:Understand contrast and non-contrast in self-supervised learning.pdf]]
 +
|-
 +
| 2023/04/14  ||Pengqi Li    || Towards Attribution Methods in Deep Speaker Recognition || [[媒体文件:Towards_Attribution_Methods_in_Deep_Speaker_Recognition_230414_lpq.pdf]]
 +
|-
 +
| 2023/04/21  ||Chen Chen    || Masked Prediction Task Based Self-supervised Multimodal Learning || [[媒体文件:Masked_prediction_task_based_self-supervised_multimodal_learning.pdf]]
 +
|-
 +
| 2023/04/28  ||Xiaolou Li  || Incomplete Multimodal Method Exploration || [[媒体文件:Incomplete_Multimodal_Method_Exploration.pdf]]
 +
|-
 +
| 2023/05/04  ||Renmiao Chen || Applications of Diffusion Model || [[媒体文件:230505_Applications_of_Diffusion_Model_CRM.pdf]]
 +
|-
 +
| 2023/05/19  ||Jiaying Wang ||  DSH based method||[[媒体文件:230519_DSH_based_paper.pptx]]
 +
|-
 +
| 2023/05/26  ||Zhenyu Zhou  || representation learning approach for domain adaptation || [[媒体文件:Representation_learning_approach_for_domain_adaptation.pptx]]
 +
|-
 +
| 2023/06/02  ||Pengqi Li    ||  ||
 +
|-
 +
| 2023/06/30  ||Tianhao Wang || Robust Speaker Verification ICASSP2023 || [[媒体文件:20230630_Robust_Speaker_Verification_ICASSP2023.pdf]]
 +
|-
 +
| 2023/10/13  ||Xiaolou Li  ||  ||
 +
|-
 +
| 2023/10/20  ||Zehua Liu    ||  ||
 +
|-
 +
| 2023/10/27  ||Junhui Chen  ||  ||
 +
|-
 +
|}
  
*[http://arxiv.org/pdf/1503.02531v1.pdf 王东2015-4-1 - Distilling the Knowledge in a Neural Network]
 
  
*[[媒体文件:Neural_Network_Acoustic_Models_with_Superviesed_Hidden_Layers_for_Automatic_Speech_Recognition.pdf|殷实2015-4-8 - Neural_Network_Acoustic_Models_with_Superviesed_Hidden_Layers_for_Automatic_Speech_Recognition]]
 
  
*[[媒体文件:Ensemble_Deep_Learning_for_Speech_Recognition.pdf|赵梦原2015-4-8 - Ensemble_Deep_Learning_for_Speech_Recognition]]
+
[[Old readings|Past Events]]
 
+
*[[媒体文件:An evaluation of target speech for a nonaudible murmur enhancement system in noisy enviroment.pdf|曾翔宇2015-3-11 - An evaluation of target speech for a nonaudible murmur enhancement system in noisy enviroment]]
+
 
+
*[http://www.kecl.ntt.co.jp/icl/signal/hori/publications/thori_icassp2014.pdf 刘超2015-04-14 Real-time one-pass decoding with recurrent neural network language model for speech recognition]
+
 
+
*[[媒体文件:2014 Reshaping deep neural network for fast decoding by node-pruning.pdf|汤志远 2015-04-29 Reshaping deep neural network for fast decoding by node-pruning]]
+
 
+
*[[媒体文件:06843244.pdf|张雪薇2015-4-29 SPEECH DEREVERBERATION WITH MULTI-CHANNEL LINEAR PREDICTION AND SPARSE PRIORS FOR THE DESIRED SIGNAL]]
+
 
+
*[[媒体文件:0000096.pdf|张雪薇2015-4-29 MULTI-CHANNEL LINEAR PREDICTION-BASED SPEECH DEREVERBERATION WITH LOW-RANK POWER SPECTROGRAM APPROXIMATION]]
+
 
+
*[[媒体文件:0000096.pdf|张之勇2015-5-6 On the importance of initialization and momentum in deep learning.pdf]]
+
 
+
*[[媒体文件:ICASSP.rar|张雪薇2015-5-20 ICASSP_dereveration]]
+
 
+
*[[媒体文件:ICASSP Selected Readings.rar|汤志远2015-5-27 ICASSP selected readings]]
+
 
+
*[[媒体文件:2015_Submodular data selection with acoustic and phonetic features for automatic speech recognition.pdf|张之勇2015-6-24 2015_Submodular data selection with acoustic and phonetic features for automatic speech recognition.pdf]]
+
 
+
*[[媒体文件:SPEECH DEREVERBERATION USING A LEARNED SPEECH MODEL.pdf|张雪薇2015-7-1 SPEECH DEREVERBERATION USING A LEARNED SPEECH MODEL.pdf]]
+
 
+
*[http://arxiv.org/abs/1504.01482 汤志远2015-7-8 Deep Recurrent Neural Networks for Acoustic Modelling]
+
 
+
*[http://arxiv.org/abs/1503.04069 汤志远2015-7-8 LSTM: A Search Space Odyssey]
+
 
+
*[http://arxiv.org/abs/1507.01526 汤志远2015-7-8 Grid Long Short-Term Memory]
+
 
+
*[http://arxiv.org/abs/1506.02078 汤志远2015-7-8 Visualizing and Understanding Recurrent Networks]
+
 
+
*[[媒体文件:annealed_dropout_trained_maxout_networks_for_improved_lvcsr.pdf|赵梦原2015-7-8 Annealed Dropout_trained Maxout Networks for Improved LVCSR]]
+
 
+
*[[媒体文件:automatic_pronunciation_verification_for_speech_recognition.pdf|赵梦原2015-7-8 Automatic Pronunciation Verification for Speech Recognition]]
+
 
+
*[[媒体文件:2015_Frame-by-frame_language_identification_in_short_utterances_using_deep_neural_networks.pdf|张雪薇2015-7-16 2015_Frame-by-frame_language_identification_in_short_utterances_using_deep_neural_networks]]
+
 
+
*[[媒体文件:SEQUENCE_CLASSIFICATION_USING_THE_HIGH-LEVEL_FEATURES_EXTRACTED_.pdf|张雪薇2015-7-16 SEQUENCE_CLASSIFICATION_USING_THE_HIGH-LEVEL_FEATURES_EXTRACTED]]
+
 
+
*[http://www.pamitc.org/cvpr15/files/lecun-20150610-cvpr-keynote.pdf 王东2015-7-22 What is Wrong with Deep Learning (Yann Lecun at CVPR 2015)]
+
 
+
*[http://videolectures.net/icml09_bengio_lecun_tldar/ 王东2015-7-22 A tutorial from Bengio and Lecun]
+
 
+
*[[asr-read-icml|王东2015-7-22 ICML 2015 reading list]]
+
 
+
*[http://jmlr.org/proceedings/papers/v37/ioffe15.pdf 王东2015-7-22 Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift]
+
 
+
*[http://cslt.riit.tsinghua.edu.cn/cgi-bin/cvss/cvss_request.pl 王东2015-7-24 No initial learning rate anymore]
+
 
+
*[http://jmlr.org/proceedings/papers/v37/long15.pdf 王东2015-7-29 Learning Transferable Features with Deep Adaptation Networks]
+
 
+
*[https://personalrobotics.ri.cmu.edu/courses/papers/Amari1998a.pdf 王东2015-7-29 Natural Gradient Works Efficiently in Learning]
+
 
+
*[http://arxiv.org/abs/1206.5533 王东2015-7-29 Practical Recommendations for Gradient-Based Training of Deep Architectures]
+
 
+
*[http://link.springer.com/chapter/10.1007%2F978-3-642-35289-8_3 王东2015-7-29 Efficient Backprop]
+
*[[媒体文件:2015_Batch normalization Accelerating deep network training by reducing internal covariate shift.pdf|张之勇2015-7-31 2015_Batch normalization Accelerating deep network training by reducing internal covariate shift.pdf]]
+
 
+
*[[媒体文件:A_time_delay_neural_network_architecture_for_efficient_modeling_of_long_temporal_contexts.pdf|赵梦原2015-7-29 A time delay neural network architecture for efficient modeling of long temporal contexts.pdf]]
+
 
+
*[[媒体文件:1-s2.0-S0885230814000114-main.pdf|张雪薇2015-8-05 Feature enhancement by deep LSTM networks for ASRin reverberant multisource environments]]
+
 
+
*[[媒体文件:Recurrent Neural Networks for Noise Reduction in Robust ASR.pdf|张雪薇2015-8-05 Recurrent Neural Networks for Noise Reduction in Robust ASR]]
+
 
+
*[[媒体文件:2015 Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition.pdf|汤志远 2015-8-05 Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition]]
+
 
+
*[[媒体文件:CUDA_C_Programming_Guide.pdf|苏圣 2015-8-012 CUDA_C_Programming_Guide.pdf]]
+
 
+
*[[媒体文件:CUBLAS_Library.pdf|苏圣 2015-8-012 CUBLAS_Library.pdf]]
+
 
+
*[[媒体文件:2015_Deep learning with Elastic Averaging SGD.pdf|张之勇 2015-08-12 2015_Deep learning with Elastic Averaging SGD.pdf]]
+
 
+
*[[媒体文件:宁可_2015-08-21_2010_Bed-tree-_an_all-purpose_index_structure_for_string_similarity_search_based_on_edit_distance.pdf |宁可 2015-08-21 2010_Bed-tree: an all-purpose index structure for string similarity search based on edit distance.pdf]]
+

2023年10月10日 (二) 02:22的最后版本

清华大学语音语言中心内部学习会

时间: 每周五晚19:30

地点: 1区303


Date Speaker Title Materials
PPT模板 媒体文件:Weeklyreading_template.rar
2021/04/01 Haoran Sun Zeus code regularization 媒体文件:代码规范.pdf
2021/05/20 Chen Chen Overview of speech enhancement 媒体文件:Speech_enhancement.pdf
2021/05/27 Di Wang Secret of 'hard trials' 媒体文件:Secret_of_hard_trials.pdf
2021/06/10 Jingxin Shen Expriments about thermal to RGB face synthesis with cycleGan and pix2pix 媒体文件:Expriments about thermal to RGB face synthesis with cycleGan and pix2pix.pdf
2021/06/17 Yang Zhang NIPS2020: Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect 媒体文件:long-tail.pdf
2021/07/08 Tiankai Zhi Some experiments on stargan 媒体文件:Some experiments on stargan.pdf
2021/07/15 Jiao Han MG experiments based on ASV system 媒体文件:MG experiments based on ASV system..pptx
2021/07/22 Zixi Yan & Sirui Li Unsupervised Speech Recognition 媒体文件:Unsupervised_Speech_Recognition.pdf
2021/07/29 Pengqi Li A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML 媒体文件:A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML.pdf
2021/08/12 Qingyang Zhu Noise-aware method for Speech Enhancement 媒体文件:Noise-aware method for Speech Enhancement.pdf
2021/08/12 Weida Liang Unsupervised Audio-Visual Synthesis via Exemplar Autoencoders 媒体文件:Bi-weekly_report_Liangwd.pdf
2021/08/19 Di Wang Inter Dataset Variability Compensation 媒体文件:Inter_dataset_variability_compensation.pdf
2021/09/02 Tiankai Zhi One Shot VC 媒体文件:One_shot_VC.pdf
2021/09/09 Jingxin Shen Thermal Speaking 媒体文件:Thermal_Speaking_2021.pdf
2021/09/23 Sirui Li & Zixi Yan Wav2vec-U Experimental Report 媒体文件:Wav2vec-U_experimental_report.pdf ‎
2021/10/20 Renmiao Chen Is Someone Speaking? 媒体文件:Is_Someone_Speaking_Exploring_Long-term_Temporal_Features.pdf ‎
2021/10/28 Chen Chen WenetSpeech Introduction 媒体文件:WenetSpeech_Dataset_Introduction.pdf ‎
2021/11/10 Weida Liang Cycle-loss Exemplar Autoencoder 媒体文件:Cycle-loss_Exemplar_Autoencoder.pdf ‎
2021/11/17 吾买尔江 Modulation Spectrum 媒体文件:Modulation_Spectrum.pdf ‎
2021/11/24 Chen Chen S-DCCRN 媒体文件:S-DCCRN_pdf.pdf ‎
2021/12/01 Pengqi Li GuidedMix: An on-the-fly data augmentation approach for robust speaker recognition system 媒体文件:201201-GuidedMix-LPQ.pdf ‎
2021/12/08 Renmiao Chen Multimodal preson verification 媒体文件:Multimodal_preson_verification.pdf
2021/12/15 Ruihai Hou Crossmodal clustered contrastive learning: Grounding of spoken language to gesture 媒体文件:Crossmodal_clustered_contrasti.pdf
2021/12/29 Zixi Yan Capsules Network 媒体文件:Capsules_Network.pdf
2022/01/05 Sirui Li Self-Supervised Learning for speech recognition with Intermediate layer supervision 媒体文件:SSL with Intermediate layer supervision.pdf
2022/01/12 Weida Liang FragmentVC 媒体文件:FragmentVC.pdf
2022/01/19 Haoyu Jiang Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video 媒体文件:Multi-modality_Associative_Bridging_through_Memory.pdf
2022/02/14 Interspeech 2021 Review 媒体文件:Interspeech_paper_review_min.pdf
2022/02/16 Chen Chen Audio Visual HuBERT 媒体文件:AVHuBERT.pdf
2022/03/04 Pengqi Li Study of Visualization 媒体文件:Visualization.pdf
2022/03/11 Renmiao Chen Can audio-visual integration strengthen robustness under multimodal attacks? 媒体文件:Audio-Visual_Robustness_Under_Multimodal_Attacks.pdf
2022/03/11 吾买尔江 Signal Separation 媒体文件:Signal_Separation.pdf
2022/03/18 Chen Chen Overview on Lip Reading and Audio-visual Speech Recognition 媒体文件:LipReadingAndAVSR.pdf
2022/04/01 Ruihai Hou Scalable Identity-Oriented Speech Retrieval 媒体文件:Scalable_Identity-Oriented_Speech_Retrieval.pdf
2022/04/08 Zixi Yan Wav2vec related papers share 媒体文件:Wav2vec_related_papers.pdf
2022/04/22 Sirui Li Speech-Based Language Modelling 媒体文件:Speech-Based Language Modelling.pdf
2022/04/29 Haoyu Jiang Models of Speaker Recognition 媒体文件:Models_of_Speaker_Recognition.pdf
2022/05/13 Chen Chen Audio-visual Representation Learning 媒体文件:Audio_visual_representation_learning.pdf
2022/05/20 Haoran Sun
2022/05/27 Pengqi Li The important ”feature” for speaker recognition 媒体文件:The important ”feature” for speaker recognition.pdf
2022/06/10 Zixi Yan Paper Share 媒体文件:Paper_share_yzx0610.pdf
2022/06/24 Renmiao Chen Transformer in multimodal 媒体文件:Transformer_in_multimodal.pdf
ICASSP 2022 review 媒体文件:ICASSP2022_review.pdf 媒体文件:ICASSP-2022-readinglist.pdf
2022/07/04 Chen Chen Video to Speech papers 媒体文件:VTS_cc.pdf
2022/07/08 Ruihai Hou ICASSP 2022 review (part) 媒体文件:Weeklyreading_hrh.pdf
2022/07/15 Sirui Li Towards End-to-end Unsupervised Speech Recognition 媒体文件:Towards_End_to_end_Unsupervised_Speech_Recognition.pdf
2022/07/22 Wan Lin AutoED: Text-independent unsupervised speaker recognition Model 媒体文件:AutoED_spk_reg.pdf
2022/07/29 Haoyu Jiang ArcFace_iQIYI-VID 媒体文件:ArcFace_iQIYI-VID.pdf
2022/08/05 Chen Chen Recent advance in VTS task 媒体文件:RecentVTS.pdf
2022/08/12 Tianhao Wang Extremal Perturbations 媒体文件:Extremal_perturbations.pdf
2022/08/19 Renmiao Chen The correlation of face and vioce 媒体文件:The_correlation_of_face_and_vioce_CRM.pdf
2022/09/02 Zixi Yan Non-Contrastive Self-supervised Learning 媒体文件:Non_contrastive_Self_supervised_Learning.pdf
2022/09/09 Sirui Li Low Resource Speech Recognition 媒体文件:Low_Resource_Speech_Recognition_lsr_0909.pdf
2022/09/16 Xipin Wei Controllable Multi-style Music Generation Model based on simple Contrastive Learning 媒体文件:Controllable_Multi_style_Music_Generation_Model_based_on_simple_Contrastive_learning.pdf
2022/09/23 Haoyu Jiang Audio Visual Learning 媒体文件:Audio_Visual_Learning.pdf
2022/09/30 Chen Chen Speech Quality Assessment 媒体文件:220930_cchen_SpeechQualityAssessment.pdf
2022/10/07 Wan Lin Cross-Domain Speaker Recognition 媒体文件:Cross_Domain_Speaker_Recognition.pdf
2022/10/14 Tianhao Wang How do deep speaker models treat silence and noises 媒体文件:20221014_wth.pdf
2022/10/31 Pengqi Li Visualization of a specific filter in CNN 媒体文件:Visualization of a specific filter in CNN.pdf
2022/11/04 Zhenyu Zhou Acoustic-aware Training for Multi-genre Speaker Recognition 媒体文件:20221104_acoustic_training.pdf
2022/11/07 Chen Chen & Renmiao Chen Experience and perceptions of collecting Audio-Visual dataset 媒体文件:20221107_cc_crm.pdf
2022/12/23 Renmiao Chen IS22 and Perceiver IO 媒体文件:221223CRM.pdf
2022/12/23 Dong Wang NIPS2022 媒体文件:NIPS2022.pdf
2022/12/30 Chen Chen Perceptual in Generative Audio Models 媒体文件:221230_cc.pdf
IS22_review 媒体文件:IS22_review_all.pdf
2023/02/10 Jiaying Wang Ordered binary speaker embedding 媒体文件:230210wjy.pdf
2023/02/17 Xipin Wei MSAT: A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation 媒体文件:MSAT_wxp.pdf
2023/03/10 Zhenyu Zhou consistence_loss&BCE_loss 媒体文件:consistence_loss&BCE_loss.pdf
2023/03/17 Tianhao Wang Score calibration in speaker verification 媒体文件:Score_calibration_in_speaker_verification.pdf
2023/03/31 Wan Lin Understand contrast and non-contrast in self-supervised learning 媒体文件:Understand contrast and non-contrast in self-supervised learning.pdf
2023/04/14 Pengqi Li Towards Attribution Methods in Deep Speaker Recognition 媒体文件:Towards_Attribution_Methods_in_Deep_Speaker_Recognition_230414_lpq.pdf
2023/04/21 Chen Chen Masked Prediction Task Based Self-supervised Multimodal Learning 媒体文件:Masked_prediction_task_based_self-supervised_multimodal_learning.pdf
2023/04/28 Xiaolou Li Incomplete Multimodal Method Exploration 媒体文件:Incomplete_Multimodal_Method_Exploration.pdf
2023/05/04 Renmiao Chen Applications of Diffusion Model 媒体文件:230505_Applications_of_Diffusion_Model_CRM.pdf
2023/05/19 Jiaying Wang DSH based method 媒体文件:230519_DSH_based_paper.pptx
2023/05/26 Zhenyu Zhou representation learning approach for domain adaptation 媒体文件:Representation_learning_approach_for_domain_adaptation.pptx
2023/06/02 Pengqi Li
2023/06/30 Tianhao Wang Robust Speaker Verification ICASSP2023 媒体文件:20230630_Robust_Speaker_Verification_ICASSP2023.pdf
2023/10/13 Xiaolou Li
2023/10/20 Zehua Liu
2023/10/27 Junhui Chen


Past Events