“ASR-nsfc-publication”版本间的差异
来自cslt Wiki
(相同用户的4个中间修订版本未显示) | |||
第1行: | 第1行: | ||
==Journal papers (SCI)== | ==Journal papers (SCI)== | ||
− | # Lantian Li, etc., A Principle Solution for Enroll-Test Mismatch, IEEE Transaction on Audio, Speech and Language Processing [https://arxiv.org/pdf/2012.12471.pdf] | + | # Lantian Li, Ruiqi Liu, Jiawen Kang, Yue Fa, Hao Cui, Yunqi Cai, Ravichander Vipperla, Thomas Fang Zheng and Dong Wang. "CN-Celeb: multi-genre speaker recognition", Speech Communication, 2022. [https://arxiv.org/pdf/2012.12468 pdf] |
+ | # Lantian Li, Dong Wang etc., A Principle Solution for Enroll-Test Mismatch, IEEE Transaction on Audio, Speech and Language Processing, 2021 [https://arxiv.org/pdf/2012.12471.pdf pdf] | ||
# Yunqi Cai, Lantian Li, Andrew Abel, Xiaoyan Zhu, Dong Wang, "Deep Normalization for Speaker Vectors", IEEE Transactions on Audio, Speech and Language Processing, 2020. [https://arxiv.org/pdf/2004.04095.pdf pdf] | # Yunqi Cai, Lantian Li, Andrew Abel, Xiaoyan Zhu, Dong Wang, "Deep Normalization for Speaker Vectors", IEEE Transactions on Audio, Speech and Language Processing, 2020. [https://arxiv.org/pdf/2004.04095.pdf pdf] | ||
# Dong Wang, "A Simulation Study on Optimal Scores for Speaker Recognition", EURASIP Journal on Audio, Speech, and Music Processing, 2020. [http://wangd.cslt.org/public/pdf/nl-eurosip.pdf pdf] | # Dong Wang, "A Simulation Study on Optimal Scores for Speaker Recognition", EURASIP Journal on Audio, Speech, and Music Processing, 2020. [http://wangd.cslt.org/public/pdf/nl-eurosip.pdf pdf] | ||
# Gulnur Arkin, Askar Hamdulla and Mijit Ablimit , Analysis of phonemes and tones confusion rules obtained by ASR,Wireless Networks,2020.[https://link.springer.com/article/10.1007%2Fs11276-019-02220-2 link] | # Gulnur Arkin, Askar Hamdulla and Mijit Ablimit , Analysis of phonemes and tones confusion rules obtained by ASR,Wireless Networks,2020.[https://link.springer.com/article/10.1007%2Fs11276-019-02220-2 link] | ||
# Zhiyuan Tang, Lantian Li, Dong Wang, Ravichander Vipperla, "Collaborative Joint Training With Multitask Recurrent Model for Speech and Speaker Recognition", IEEE Transactions on Audio, Speech and Language Processing 2018, vol 25, no.3. [http://ieeexplore.ieee.org/document/7782371 online] | # Zhiyuan Tang, Lantian Li, Dong Wang, Ravichander Vipperla, "Collaborative Joint Training With Multitask Recurrent Model for Speech and Speaker Recognition", IEEE Transactions on Audio, Speech and Language Processing 2018, vol 25, no.3. [http://ieeexplore.ieee.org/document/7782371 online] | ||
− | # Zhiyuan Tang,Dong Wang,Yixiang Chen,Lantian Li,Andrew Abel, "Phonetic Temporal Neural Model for Language Identification", IEEE Transactions on Audio, Speech and Language Processing 2017. [http://ieeexplore.ieee.org/document/8070977 online] | + | # Zhiyuan Tang,Dong Wang,Yixiang Chen,Lantian Li,Andrew Abel, "Phonetic Temporal Neural Model for Language Identification", IEEE Transactions on Audio, Speech and Language Processing 2017. [http://ieeexplore.ieee.org/document/8070977 online] |
==Journal papers (EI)== | ==Journal papers (EI)== | ||
第14行: | 第15行: | ||
==Conference papers (EI)== | ==Conference papers (EI)== | ||
+ | # Tiankai Zhi, Ying Shi, Wenqiang Du, Guanyu Li and Dong Wang, "A Free Mongolian Speech Database and Accompanied Baselines", O-COCOSDA 2021.[] | ||
+ | # Jiao Han, Yunqi Cai, Lantian Li, Guanyu Li, Dong Wang, "An MAP Estimation for Between-Class Variance", APSIPA 2021. [] | ||
+ | # Di Wang, Lantian Li, Hongzhi Yu, Dong Wang,A STUDY ON DECOUPLED PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS, APSIPA 2021. | ||
+ | # Haoran Sun, Lantian Li, Thomas Fang Zheng, Dong Wang, HOW SPEECH IS RECOGNIZED TO BE EMOTIONAL - A STUDY BASED ON INFORMATION DECOMPOSITION, APSIPA 2021. | ||
+ | # Lantian Li, Yang Zhang, Jiawen Kang, Thomas Fang Zheng, Dong Wang, "SQUEEZING VALUE OF CROSS-DOMAIN LABELS: A DECOUPLED SCORING APPROACH FOR SPEAKER VERIFICATION", ICASSP 2021. [pdf] | ||
# Ying Shi, Haolin Chen, Zhiyuan Tang, Lantian Li, Dong Wang, Jiqing Han, Can We Trust Deep Speech Prior?, SLT 2021[https://arxiv.org/abs/2011.02110 pdf pdf] | # Ying Shi, Haolin Chen, Zhiyuan Tang, Lantian Li, Dong Wang, Jiqing Han, Can We Trust Deep Speech Prior?, SLT 2021[https://arxiv.org/abs/2011.02110 pdf pdf] | ||
# Zheng Li, Miao Zhao, Qingyang Hong, Lin Li, Zhiyuan Tang, Dong Wang, Liming Song, Cheng Yang, "AP20-OLR Challenge: Three Tasks and TheirBaselines", APSIPA 2020. [https://arxiv.org/pdf/2006.03473.pdf pdf] | # Zheng Li, Miao Zhao, Qingyang Hong, Lin Li, Zhiyuan Tang, Dong Wang, Liming Song, Cheng Yang, "AP20-OLR Challenge: Three Tasks and TheirBaselines", APSIPA 2020. [https://arxiv.org/pdf/2006.03473.pdf pdf] |
2022年1月7日 (五) 13:10的最后版本
Journal papers (SCI)
- Lantian Li, Ruiqi Liu, Jiawen Kang, Yue Fa, Hao Cui, Yunqi Cai, Ravichander Vipperla, Thomas Fang Zheng and Dong Wang. "CN-Celeb: multi-genre speaker recognition", Speech Communication, 2022. pdf
- Lantian Li, Dong Wang etc., A Principle Solution for Enroll-Test Mismatch, IEEE Transaction on Audio, Speech and Language Processing, 2021 pdf
- Yunqi Cai, Lantian Li, Andrew Abel, Xiaoyan Zhu, Dong Wang, "Deep Normalization for Speaker Vectors", IEEE Transactions on Audio, Speech and Language Processing, 2020. pdf
- Dong Wang, "A Simulation Study on Optimal Scores for Speaker Recognition", EURASIP Journal on Audio, Speech, and Music Processing, 2020. pdf
- Gulnur Arkin, Askar Hamdulla and Mijit Ablimit , Analysis of phonemes and tones confusion rules obtained by ASR,Wireless Networks,2020.link
- Zhiyuan Tang, Lantian Li, Dong Wang, Ravichander Vipperla, "Collaborative Joint Training With Multitask Recurrent Model for Speech and Speaker Recognition", IEEE Transactions on Audio, Speech and Language Processing 2018, vol 25, no.3. online
- Zhiyuan Tang,Dong Wang,Yixiang Chen,Lantian Li,Andrew Abel, "Phonetic Temporal Neural Model for Language Identification", IEEE Transactions on Audio, Speech and Language Processing 2017. online
Journal papers (EI)
- Siamese Attention-based LSTM for Speech Emotion Recognition,IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, v E103A, n 7, p 937-941, July 1, 2020 link
- Uyghur short-text classification based on reliable sub-word morphology, International Journal of Reasoning-based Intelligent Systems,v 11, n 3, p 250-255, 2019 link
- A Robust Morpheme Sequence and Convolutional Neural Network-Based Uyghur and Kazakh Short Text Classification, Information (Switzerland), v 10, n 12, December 1, 2019 pdf
Conference papers (EI)
- Tiankai Zhi, Ying Shi, Wenqiang Du, Guanyu Li and Dong Wang, "A Free Mongolian Speech Database and Accompanied Baselines", O-COCOSDA 2021.[]
- Jiao Han, Yunqi Cai, Lantian Li, Guanyu Li, Dong Wang, "An MAP Estimation for Between-Class Variance", APSIPA 2021. []
- Di Wang, Lantian Li, Hongzhi Yu, Dong Wang,A STUDY ON DECOUPLED PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS, APSIPA 2021.
- Haoran Sun, Lantian Li, Thomas Fang Zheng, Dong Wang, HOW SPEECH IS RECOGNIZED TO BE EMOTIONAL - A STUDY BASED ON INFORMATION DECOMPOSITION, APSIPA 2021.
- Lantian Li, Yang Zhang, Jiawen Kang, Thomas Fang Zheng, Dong Wang, "SQUEEZING VALUE OF CROSS-DOMAIN LABELS: A DECOUPLED SCORING APPROACH FOR SPEAKER VERIFICATION", ICASSP 2021. [pdf]
- Ying Shi, Haolin Chen, Zhiyuan Tang, Lantian Li, Dong Wang, Jiqing Han, Can We Trust Deep Speech Prior?, SLT 2021pdf pdf
- Zheng Li, Miao Zhao, Qingyang Hong, Lin Li, Zhiyuan Tang, Dong Wang, Liming Song, Cheng Yang, "AP20-OLR Challenge: Three Tasks and TheirBaselines", APSIPA 2020. pdf
- Jiawen Kang,Ruiqi Liu,Lantian Li,Yunqi Cai,Dong Wang,Thomas Fang Zheng, "Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning", Interspeech 2020. pdf
- Sitong Cheng,Zhixin Liu,Lantian Li,Zhiyuan Tang,Dong Wang,Thomas Fang Zheng, "ASR-Free Pronunciation Assessment", Interspeech 2020. pdf
- Lantian Li,Dong Wang,Thomas Fang Zheng, "Neural Discriminant Analysis for Deep Speaker Embedding", Interspeech 2020. pdf
- Yongmin Li, Guanyu Lia, Pengqi Lia, Sixuan Lia, Xinyu Yuan, ”A Survey of Multimodal Fusion for Identity Verification”, International Symposium on Electronic Information Technology and Communication Engineering(ISEITCE), 2020 pdf
- Yue Fan, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai, Dong Wang, "CN-CELEB: A Challenging Chinese Speaker Recognition Dataset", ICASSP 2020. pdf
- Wupeng Wang, Chao Xing, Dong Wang, Xiao Chen, Fengyu Sun, "A ROBUST AUDIO-VISUAL SPEECH ENHANCEMENT MODEL", ICASSP 2020, pdf
- Yang Zhang and Lantian Li and Dong Wang, "VAE-based regularization for deep speaker embedding", Interspeech 2019 pdf
- Lantian Li,Zhiyuan Tang,Ying Shi,Dong Wang, "Gaussian-Constrained Training for Speaker Verification", ICASSP 2019pdf
- Sardar Parhat, Gao Ting, Mijit Ablimit, Askar Hamdulla, A morpheme sequence and convolutional neural network based Kazakh text classification,2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019, p 1903-1906, November 2019 pdf
- Arkin G, Alijan G, Hamdulla A, A Comparative Analysis of Acoustic Characteristics between Kazak Uyghur Mandarin Learners and Standard Mandarin Speakers,Proceedings of the 2019 International Conference on Asian Language Processing, IALP 2019, p 474-479, November 2019 link
- Statistical Analysis of Syllable Duration of Uyghur Language,Proceedings of the 2019 International Conference on Asian Language Processing, IALP 2019, p 468-473, November 2019 [1]
- Lantian Li,Zhiyuan Tang,Ying Shi,Dong Wang, "Phonetic-Attention Scoring for Deep Speaker Features in Speaker Verification", APSIPA 2019 pdf
- Lantian Li*,Xueyi Wang*,Dong Wang, "VAE-based Domain Adaptation for Speaker Verification", APSIPA 2019. pdf
- Jiayao Wu, Zhiyuan Tang and Dong Wang, "Structure Growth for Small-Footprint Speech Recognition", APSIPA 2019. pdf
- Zhiyuan Tang, Dong Wang, Liming Song, "AP19-OLR Challenge: Three Tasks and Their Baselines", APSIPA 2019. pdf
- Yunqi Cai, Dong Wang, "Question Mark Prediction By Bert", APSIPA 2019 pdf
- Guanyu Li, Lisai Luo, Chunwei Gong, Shiliang Lv, “End-to-end Tibetan Speech Synthesis Based on Phones and Semi-syllables”, Proceedings of APSIPA Annual Summit and Conference(APSIPA) 2019 pdf
- Duo Ma, Guanyu Li, Haihua Xu, Eng Siong Chng, “Improving code-switching speech recognition with data augmentation and system combination”, Proceedings of APSIPA Annual Summit and Conference(APSIPA) 2019 [2]
- Jiyuan Zhang,Dong Wang, "Chinese Poetry Generation with Flexible Styles", ISCSLP 2018pdf.
- Jiyuan Zhang,Zheling Zhang,Shiyue Zhang, Dong Wang,"VV-COUPLET: AN OPEN SOURCE CHINESE COUPLET GENERATION SYSTEM", APSIPA 2018. pdf
- Zhiyuan Tang,Dong Wang,Qing Chen, "AP18-OLR CHALLENGE: THREE TASKS AND THEIR BASELINES",APSIPA 2018.pdf
- Ying Shi,Zhiyuan Tang, Lantian Li,Zheling Zhang,Dong Wang, "MAP AND RELABEL: TOWARDS ALMOST-ZERO RESOURCE SPEECH RECOGNITION",APSIPA 2018.pdf
- Miao Zhang, Xiaofei Kang, Yanqing Wang, Lantian Li, Zhiyuan Tang, Haisheng Dai, Dong Wang*, HUMAN AND MACHINE SPEAKER RECOGNITION BASED ON SHORT TRIVIAL EVENT, ICASSP 2018 arXiv
- Jinghao Yan, Hongzhi Yu, Guanyu Li,"Tibetan acoustic model research based on TDNN", APSIPA ASC 2018 pdf
- Yultuz Rapkat; Gulnur Arkin; Mijit Ablimit; Askar Hamdulla, Acoustic Features of Mandarin Diphthongs by Uyghur Learners at Primary Level,2018 Oriental COCOSDA - International Conference on Speech Database and Assessments, ICSDA 2018 - Proceedings, p 60-66, July 2, 2018, link
- Mijit Ablimit*, Sardar Parhat*, Askar Hamdulla*, Thomas Fang Zheng, Multilingual Stemming and Term extraction for Uyghur, Kazak and Kirghiz,2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, p 587-590, July 2, 2018 pdf
- Lisai Luo, Guanyu Li1, Chunwei Gong, Hailan Ding, “End-to-end Speech Synthesis for Tibetan Lhasa Dialect”, International Symposium on Power Electronics and Control Engineering (ISPECE), 2018 pdf
- Ning Yang, Guanyu Li, Hailan Ding, Chunwei Gong, Study on Tibetan Word Vector based on Word2vec, International Symposium on Power Electronics and Control Engineering (ISPECE), 2018 pdf
- Lantian Li, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng, FULL-INFO TRAINING FOR DEEP SPEAKER FEATURE LEARNING, ICASSP 2018.arXiv
- Lantian Li, Dong Wang*, Yixiang Chen, Ying Shing, Zhiyuan Tang, Thomas Fang Zheng, DEEP FACTORIZATION FOR SPEECH SIGNAL, ICASSP 2018 arXiv
- Dong Wang, Thomas Fang Zheng, Zhiyuan Tang, Ying Shi, Lantian Li, Shiyue Zhang Hongzhi Yu, Guanyu Li, Shipeng Xu, Askar Hummdulla, Mijit Ablimit, Gulnigar Mahmut, M2ASR: AMBITIONS AND FIRST YEAR PROGRESS, O-COCOSDA 2017. pdf
- Yang Feng, Shiyue Zhang, Andy Zhang, Dong Wang and Andrew Abel, Memory-augmented Neural Machine Translation, EMNLP 2017 pdf
- Lantian Li, Yixiang Chen, Dong Wang, Thomas Fang Zheng, A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification, Interspeech 2017 pdf
- Lantian Li, Yixiang Chen, Ying Shi, Zhiyuan Tang, Dong Wang, "Deep Speaker Feature Learning for Text-independent Speaker Verification", Interspeech 2017pdf
- Jiyuan Zhang, Yang Feng, Dong Wang, Yang Wang, Andrw Abel, Shiyue Zhang, Andi Zhangi, "Flexible and Creative Chinese Poetry Generation Using Neural Memory", ACL 2017 link
- Zhiyuan Tang, Ying Shi, Dong Wang, Yang Feng, and Shiyue Zhang, "Memory Visualization for Gated Recurrent Neural Networks in Speech Recognition", ICASSP 2017.link
- Zhiyuan Tang, Dong Wang, Yixiang Chen, Qing Chen, AP17-OLR Challenge: Data, Plan, and Baseline, APSIPA 2017, link: arXiv
- Shiyue Zhang, Gulnigar Mahmut, Dong Wang, Askar Hamdulla, Memory-augmented Chinese-Uyghur Neural Machine Translation, APSIPA 2017, link: arXiv
- Shipeng Xu , Hongzhi Yu, Thomas Fang Zheng and Jinghao Yan, Language Resource Construction for Mongolian, APSIPA 2017, pdf
- Guanyu Li, Hongzhi Yu, Thomas Fang Zheng, Jinghao Yan, Free Linguistic and Speech Resources for Tibetan, APSIPA 2017, link: pdf
- Ying Shi, Askar Hamdulla, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng, A Free Kazak Speech Database and a Speech Recognition Baseline, APSIPA 2017, link: pdf
- Mijit Ablimit, Sardar Parhat, Askar Hamdulla, Thomas Fang Zheng , A Multilingual Language Processing Tool for Uyghur, Kazak and Kirghiz, APSIPA 2017, link: pdf
- Aodong Li, Shiyue Zhangy, Dong Wangz and Thomas Fang Zheng, Enhanced Neural Machine Translation by Learning from Draft, APSIPA 2017, link: pdf
- Lantian Li, Dong Wang, Askar Rozi, Thomas Fang Zheng, Cross-lingual Speaker Verification with Deep Feature Learning, APSIPA 2017, link: arXiv
- Dong Wang, Lantian Li, Zhiyuan Tang, Thomas Fang Zheng, Deep Speaker Verification: Do We Need End to End?, APSIPA 2017, link: arXiv
- Miao Zhang, Yixiang Chen, Lantian Li and Dong Wang, Speaker Recognition with Cough, Laugh and “Wei”, APSIPA 2017, link: arXiv
- Rehmutulla Memet; Mewlude Nijat; Gulnigar Mahmut; Askar Hamdulla, A rule and statistical modeling based stem extraction method for Kazakh words,Proceedings of the 2017 International Conference on Asian Language Processing, IALP 2017, v 2018-January, p 231-234, July 2, 2017 link
- Shipeng Xu, Hongzhi Yu, Thomas Fang Zheng, Guanyu Li, Gegeentana. “Language Resource Construction for Mongolian”, Proceedings of APSIPA Annual Summit and Conference(APSIPA), 2017 pdf