“ASR-nsfc-publication”版本间的差异
来自cslt Wiki
第18行: | 第18行: | ||
# Sitong Cheng,Zhixin Liu,Lantian Li,Zhiyuan Tang,Dong Wang,Thomas Fang Zheng, "ASR-Free Pronunciation Assessment", Interspeech 2020. [https://arxiv.org/pdf/2005.11902.pdf] | # Sitong Cheng,Zhixin Liu,Lantian Li,Zhiyuan Tang,Dong Wang,Thomas Fang Zheng, "ASR-Free Pronunciation Assessment", Interspeech 2020. [https://arxiv.org/pdf/2005.11902.pdf] | ||
# Lantian Li,Dong Wang,Thomas Fang Zheng, "Neural Discriminant Analysis for Deep Speaker Embedding", Interspeech 2020. [https://arxiv.org/pdf/2005.11905.pdf] | # Lantian Li,Dong Wang,Thomas Fang Zheng, "Neural Discriminant Analysis for Deep Speaker Embedding", Interspeech 2020. [https://arxiv.org/pdf/2005.11905.pdf] | ||
+ | # Yongmin Li, Guanyu Lia, Pengqi Lia, Sixuan Lia, Xinyu Yuan, ”A Survey of Multimodal Fusion for Identity Verification”, International Symposium on Electronic Information Technology and Communication Engineering(ISEITCE), 2020 | ||
# Yue Fan, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai, Dong Wang, "CN-CELEB: A Challenging Chinese Speaker Recognition Dataset", ICASSP 2020. [https://arxiv.org/pdf/1911.01799.pdf] | # Yue Fan, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai, Dong Wang, "CN-CELEB: A Challenging Chinese Speaker Recognition Dataset", ICASSP 2020. [https://arxiv.org/pdf/1911.01799.pdf] | ||
# Wupeng Wang, Chao Xing, Dong Wang, Xiao Chen, Fengyu Sun, "A ROBUST AUDIO-VISUAL SPEECH ENHANCEMENT MODEL", ICASSP 2020, [http://166.111.134.19:7777/wangd/public/pdf/visual.pdf] | # Wupeng Wang, Chao Xing, Dong Wang, Xiao Chen, Fengyu Sun, "A ROBUST AUDIO-VISUAL SPEECH ENHANCEMENT MODEL", ICASSP 2020, [http://166.111.134.19:7777/wangd/public/pdf/visual.pdf] | ||
第30行: | 第31行: | ||
# Zhiyuan Tang, Dong Wang, Liming Song, "AP19-OLR Challenge: Three Tasks and Their Baselines", APSIPA 2019. [https://arxiv.org/pdf/1907.07626.pdf] | # Zhiyuan Tang, Dong Wang, Liming Song, "AP19-OLR Challenge: Three Tasks and Their Baselines", APSIPA 2019. [https://arxiv.org/pdf/1907.07626.pdf] | ||
# Yunqi Cai, Dong Wang, "Question Mark Prediction By Bert", APSIPA 2019 [http://www.apsipa.org/proceedings/2019/pdfs/180.pdf] | # Yunqi Cai, Dong Wang, "Question Mark Prediction By Bert", APSIPA 2019 [http://www.apsipa.org/proceedings/2019/pdfs/180.pdf] | ||
+ | # Guanyu Li, Lisai Luo, Chunwei Gong, Shiliang Lv, “End-to-end Tibetan Speech Synthesis Based on Phones and Semi-syllables”, Proceedings of APSIPA Annual Summit and Conference(APSIPA) 2019 | ||
+ | # Duo Ma, Guanyu Li, Haihua Xu, Eng Siong Chng, “Improving code-switching speech recognition with data augmentation and system combination”, Proceedings of APSIPA Annual Summit and Conference(APSIPA) 2019 | ||
# Jiyuan Zhang,Dong Wang, "Chinese Poetry Generation with Flexible Styles", ISCSLP 2018[https://arxiv.org/abs/1807.06500]. | # Jiyuan Zhang,Dong Wang, "Chinese Poetry Generation with Flexible Styles", ISCSLP 2018[https://arxiv.org/abs/1807.06500]. | ||
# Jiyuan Zhang,Zheling Zhang,Shiyue Zhang, Dong Wang,"VV-COUPLET: AN OPEN SOURCE CHINESE COUPLET GENERATION SYSTEM", APSIPA 2018. [http://www.apsipa.org/proceedings/2018/pdfs/0001756.pdf] | # Jiyuan Zhang,Zheling Zhang,Shiyue Zhang, Dong Wang,"VV-COUPLET: AN OPEN SOURCE CHINESE COUPLET GENERATION SYSTEM", APSIPA 2018. [http://www.apsipa.org/proceedings/2018/pdfs/0001756.pdf] | ||
第38行: | 第41行: | ||
# Acoustic Features of Mandarin Diphthongs by Uyghur Learners at Primary Level,2018 Oriental COCOSDA - International Conference on Speech Database and Assessments, ICSDA 2018 - Proceedings, p 60-66, July 2, 2018, | # Acoustic Features of Mandarin Diphthongs by Uyghur Learners at Primary Level,2018 Oriental COCOSDA - International Conference on Speech Database and Assessments, ICSDA 2018 - Proceedings, p 60-66, July 2, 2018, | ||
# Multilingual Stemming and Term extraction for Uyghur, Kazak and Kirghiz,2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, p 587-590, July 2, 2018 | # Multilingual Stemming and Term extraction for Uyghur, Kazak and Kirghiz,2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, p 587-590, July 2, 2018 | ||
+ | # inghao Yan, Hongzhi Yu, Guanyu Li, “Tibetan acoustic model research based on TDNN”, Proceedings of APSIPA Annual Summit and Conference(APSIPA), 2018, pp. 601-604 | ||
+ | # Lisai Luo, Guanyu Li1, Chunwei Gong, Hailan Ding, “End-to-end Speech Synthesis for Tibetan Lhasa Dialect”, International Symposium on Power Electronics and Control Engineering (ISPECE), 2018 | ||
+ | # Ning Yang, Guanyu Li, Hailan Ding, Chunwei Gong, Study on Tibetan Word Vector based on Word2vec, International Symposium on Power Electronics and Control Engineering (ISPECE), 2018 | ||
# Lantian Li, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng, FULL-INFO TRAINING FOR DEEP SPEAKER FEATURE LEARNING, ICASSP 2018.[https://arxiv.org/pdf/1711.00366 arXiv] | # Lantian Li, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng, FULL-INFO TRAINING FOR DEEP SPEAKER FEATURE LEARNING, ICASSP 2018.[https://arxiv.org/pdf/1711.00366 arXiv] | ||
# Lantian Li, Dong Wang*, Yixiang Chen, Ying Shing, Zhiyuan Tang, Thomas Fang Zheng, DEEP FACTORIZATION FOR SPEECH SIGNAL, ICASSP 2018 [https://arxiv.org/pdf/1803.00886 arXiv] | # Lantian Li, Dong Wang*, Yixiang Chen, Ying Shing, Zhiyuan Tang, Thomas Fang Zheng, DEEP FACTORIZATION FOR SPEECH SIGNAL, ICASSP 2018 [https://arxiv.org/pdf/1803.00886 arXiv] | ||
第57行: | 第63行: | ||
# Miao Zhang, Yixiang Chen, Lantian Li and Dong Wang, Speaker Recognition with Cough, Laugh and “Wei”, APSIPA 2017, link: [https://arxiv.org/abs/1706.07860 arXiv] | # Miao Zhang, Yixiang Chen, Lantian Li and Dong Wang, Speaker Recognition with Cough, Laugh and “Wei”, APSIPA 2017, link: [https://arxiv.org/abs/1706.07860 arXiv] | ||
# A rule and statistical modeling based stem extraction method for Kazakh words,Proceedings of the 2017 International Conference on Asian Language Processing, IALP 2017, v 2018-January, p 231-234, July 2, 2017 | # A rule and statistical modeling based stem extraction method for Kazakh words,Proceedings of the 2017 International Conference on Asian Language Processing, IALP 2017, v 2018-January, p 231-234, July 2, 2017 | ||
+ | # Guanyu Li, Hongzhi Yu, Thomas Fang Zheng, Jinghao Yan, Shipeng Xu. “Free Linguistic and Speech Resources for Tibetan”, Proceedings of APSIPA Annual Summit and Conference(APSIPA), 2017 | ||
+ | # Shipeng Xu, Hongzhi Yu, Thomas Fang Zheng, Guanyu Li, Gegeentana. “Language Resource Construction for Mongolian”, Proceedings of APSIPA Annual Summit and Conference(APSIPA), 2017 | ||
− | + | ||
+ | ==Other papers== | ||
# 沙尔旦尔·帕尔哈提,米吉提·阿不里米提,艾斯卡尔·艾木都拉. 基于稳健词素序列和LSTM的维吾尔语短文本分类[J]. 中文信息学报,2020,34(01):63-70. | # 沙尔旦尔·帕尔哈提,米吉提·阿不里米提,艾斯卡尔·艾木都拉. 基于稳健词素序列和LSTM的维吾尔语短文本分类[J]. 中文信息学报,2020,34(01):63-70. | ||
# 沙尔旦尔·帕尔哈提,米吉提·阿不里米提,艾斯卡尔·艾木都拉. 词干单元和卷积神经网络的哈萨克短文本分类[J]. 小型微型计算机系统,2020,41(08):1627-1633. | # 沙尔旦尔·帕尔哈提,米吉提·阿不里米提,艾斯卡尔·艾木都拉. 词干单元和卷积神经网络的哈萨克短文本分类[J]. 小型微型计算机系统,2020,41(08):1627-1633. | ||
# 维-哈-柯多语言词素切分集成环境研究[J]. 电视技术,2020,44(06):46-51+63. | # 维-哈-柯多语言词素切分集成环境研究[J]. 电视技术,2020,44(06):46-51+63. |
2020年12月18日 (五) 12:53的版本
Journal papers (SCI)
- Yunqi Cai, Lantian Li, Andrew Abel, Xiaoyan Zhu, Dong Wang, "Deep Normalization for Speaker Vectors", IEEE Transactions on Audio, Speech and Language Processing, 2020. [1]
- Dong Wang, "A Simulation Study on Optimal Scores for Speaker Recognition", EURASIP Journal on Audio, Speech, and Music Processing, 2020. [2]
- Analysis of phonemes and tones confusion rules obtained by ASR,Wireless Networks,2020.link
- Zhiyuan Tang, Lantian Li, Dong Wang, Ravichander Vipperla, "Collaborative Joint Training With Multitask Recurrent Model for Speech and Speaker Recognition", IEEE TASLP 2018, vol 25, no.3. online
- Zhiyuan Tang,Dong Wang,Yixiang Chen,Lantian Li,Andrew Abel, "Phonetic Temporal Neural Model for Language Identification", IEEE TASLP 2017. online
Journal papers (EI)
- Siamese Attention-based LSTM for Speech Emotion Recognition,IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, v E103A, n 7, p 937-941, July 1, 2020
- Uyghur short-text classification based on reliable sub-word morphology, International Journal of Reasoning-based Intelligent Systems,v 11, n 3, p 250-255, 2019
- A Robust Morpheme Sequence and Convolutional Neural Network-Based Uyghur and Kazakh Short Text Classification, Information (Switzerland), v 10, n 12, December 1, 2019
- Investigation of the phonological error rules of Mandarin by Uyghur second language learners,Quarterly Journal of Indian Pulp and Paper Technical Association,v 30, n 1, p 492-500, March 1, 2018
Conference papers (EI)
- Ying Shi, Haolin Chen, Zhiyuan Tang, Lantian Li, Dong Wang, Jiqing Han, Can We Trust Deep Speech Prior?, SLT 2021[3]
- Zheng Li, Miao Zhao, Qingyang Hong, Lin Li, Zhiyuan Tang, Dong Wang, Liming Song, Cheng Yang, "AP20-OLR Challenge: Three Tasks and TheirBaselines", APSIPA 2020. [4]
- Jiawen Kang,Ruiqi Liu,Lantian Li,Yunqi Cai,Dong Wang,Thomas Fang Zheng, "Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning", Interspeech 2020. [5]
- Sitong Cheng,Zhixin Liu,Lantian Li,Zhiyuan Tang,Dong Wang,Thomas Fang Zheng, "ASR-Free Pronunciation Assessment", Interspeech 2020. [6]
- Lantian Li,Dong Wang,Thomas Fang Zheng, "Neural Discriminant Analysis for Deep Speaker Embedding", Interspeech 2020. [7]
- Yongmin Li, Guanyu Lia, Pengqi Lia, Sixuan Lia, Xinyu Yuan, ”A Survey of Multimodal Fusion for Identity Verification”, International Symposium on Electronic Information Technology and Communication Engineering(ISEITCE), 2020
- Yue Fan, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai, Dong Wang, "CN-CELEB: A Challenging Chinese Speaker Recognition Dataset", ICASSP 2020. [8]
- Wupeng Wang, Chao Xing, Dong Wang, Xiao Chen, Fengyu Sun, "A ROBUST AUDIO-VISUAL SPEECH ENHANCEMENT MODEL", ICASSP 2020, [9]
- Yang Zhang and Lantian Li and Dong Wang, "VAE-based regularization for deep speaker embedding", Interspeech 2019 [10].
- Lantian Li,Zhiyuan Tang,Ying Shi,Dong Wang, "Gaussian-Constrained Training for Speaker Verification", ICASSP 2019[11]
- A morpheme sequence and convolutional neural network based Kazakh text classification,2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019, p 1903-1906, November 2019
- A Comparative Analysis of Acoustic Characteristics between Kazak Uyghur Mandarin Learners and Standard Mandarin Speakers,Proceedings of the 2019 International Conference on Asian Language Processing, IALP 2019, p 474-479, November 2019
- Statistical Analysis of Syllable Duration of Uyghur Language,Proceedings of the 2019 International Conference on Asian Language Processing, IALP 2019, p 468-473, November 2019
- Lantian Li,Zhiyuan Tang,Ying Shi,Dong Wang, "Phonetic-Attention Scoring for Deep Speaker Features in Speaker Verification", APSIPA 2019 [12]
- Lantian Li*,Xueyi Wang*,Dong Wang, "VAE-based Domain Adaptation for Speaker Verification", APSIPA 2019. [13]
- Jiayao Wu, Zhiyuan Tang and Dong Wang, "Structure Growth for Small-Footprint Speech Recognition", APSIPA 2019. [14]
- Zhiyuan Tang, Dong Wang, Liming Song, "AP19-OLR Challenge: Three Tasks and Their Baselines", APSIPA 2019. [15]
- Yunqi Cai, Dong Wang, "Question Mark Prediction By Bert", APSIPA 2019 [16]
- Guanyu Li, Lisai Luo, Chunwei Gong, Shiliang Lv, “End-to-end Tibetan Speech Synthesis Based on Phones and Semi-syllables”, Proceedings of APSIPA Annual Summit and Conference(APSIPA) 2019
- Duo Ma, Guanyu Li, Haihua Xu, Eng Siong Chng, “Improving code-switching speech recognition with data augmentation and system combination”, Proceedings of APSIPA Annual Summit and Conference(APSIPA) 2019
- Jiyuan Zhang,Dong Wang, "Chinese Poetry Generation with Flexible Styles", ISCSLP 2018[17].
- Jiyuan Zhang,Zheling Zhang,Shiyue Zhang, Dong Wang,"VV-COUPLET: AN OPEN SOURCE CHINESE COUPLET GENERATION SYSTEM", APSIPA 2018. [18]
- Zhiyuan Tang,Dong Wang,Qing Chen, "AP18-OLR CHALLENGE: THREE TASKS AND THEIR BASELINES",APSIPA 2018.[19]
- Ying Shi,Zhiyuan Tang, Lantian Li,Zheling Zhang,Dong Wang, "MAP AND RELABEL: TOWARDS ALMOST-ZERO RESOURCE SPEECH RECOGNITION",APSIPA 2018.[20]
- Miao Zhang, Xiaofei Kang, Yanqing Wang, Lantian Li, Zhiyuan Tang, Haisheng Dai, Dong Wang*, HUMAN AND MACHINE SPEAKER RECOGNITION BASED ON SHORT TRIVIAL EVENT, ICASSP 2018 arXiv
- Jinghao Yan, Hongzhi Yu, Guanyu Li,"Tibetan acoustic model research based on TDNN", APSIPA ASC 2018
- Acoustic Features of Mandarin Diphthongs by Uyghur Learners at Primary Level,2018 Oriental COCOSDA - International Conference on Speech Database and Assessments, ICSDA 2018 - Proceedings, p 60-66, July 2, 2018,
- Multilingual Stemming and Term extraction for Uyghur, Kazak and Kirghiz,2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, p 587-590, July 2, 2018
- inghao Yan, Hongzhi Yu, Guanyu Li, “Tibetan acoustic model research based on TDNN”, Proceedings of APSIPA Annual Summit and Conference(APSIPA), 2018, pp. 601-604
- Lisai Luo, Guanyu Li1, Chunwei Gong, Hailan Ding, “End-to-end Speech Synthesis for Tibetan Lhasa Dialect”, International Symposium on Power Electronics and Control Engineering (ISPECE), 2018
- Ning Yang, Guanyu Li, Hailan Ding, Chunwei Gong, Study on Tibetan Word Vector based on Word2vec, International Symposium on Power Electronics and Control Engineering (ISPECE), 2018
- Lantian Li, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng, FULL-INFO TRAINING FOR DEEP SPEAKER FEATURE LEARNING, ICASSP 2018.arXiv
- Lantian Li, Dong Wang*, Yixiang Chen, Ying Shing, Zhiyuan Tang, Thomas Fang Zheng, DEEP FACTORIZATION FOR SPEECH SIGNAL, ICASSP 2018 arXiv
- Dong Wang, Thomas Fang Zheng, Zhiyuan Tang, Ying Shi, Lantian Li, Shiyue Zhang Hongzhi Yu, Guanyu Li, Shipeng Xu, Askar Hummdulla, Mijit Ablimit, Gulnigar Mahmut, M2ASR: AMBITIONS AND FIRST YEAR PROGRESS, O-COCOSDA 2017. pdf
- Yang Feng, Shiyue Zhang, Andy Zhang, Dong Wang and Andrew Abel, Memory-augmented Neural Machine Translation, EMNLP 2017 [21] .
- Lantian Li, Yixiang Chen, Dong Wang, Thomas Fang Zheng, A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification, Interspeech 2017 [22].
- Lantian Li, Yixiang Chen, Ying Shi, Zhiyuan Tang, Dong Wang, "Deep Speaker Feature Learning for Text-independent Speaker Verification", Interspeech 2017[23].
- Jiyuan Zhang, Yang Feng, Dong Wang, Yang Wang, Andrw Abel, Shiyue Zhang, Andi Zhangi, "Flexible and Creative Chinese Poetry Generation Using Neural Memory", ACL 2017 [24]
- Zhiyuan Tang, Ying Shi, Dong Wang, Yang Feng, and Shiyue Zhang, "Memory Visualization for Gated Recurrent Neural Networks in Speech Recognition", ICASSP 2017.[25]
- Zhiyuan Tang, Dong Wang, Yixiang Chen, Qing Chen, AP17-OLR Challenge: Data, Plan, and Baseline, APSIPA 2017, link: arXiv
- Shiyue Zhang, Gulnigar Mahmut, Dong Wang, Askar Hamdulla, Memory-augmented Chinese-Uyghur Neural Machine Translation, APSIPA 2017, link: arXiv
- Shipeng Xu , Hongzhi Yu, Thomas Fang Zheng and Jinghao Yan, Language Resource Construction for Mongolian, APSIPA 2017, pdf
- Guanyu Li, Hongzhi Yu, Thomas Fang Zheng, Jinghao Yan, Free Linguistic and Speech Resources for Tibetan, APSIPA 2017, link: pdf
- Ying Shi, Askar Hamdulla, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng, A Free Kazak Speech Database and a Speech Recognition Baseline, APSIPA 2017, link: pdf
- Mijit Ablimit, Sardar Parhat, Askar Hamdulla, Thomas Fang Zheng , A Multilingual Language Processing Tool for Uyghur, Kazak and Kirghiz, APSIPA 2017, link: pdf
- Aodong Li, Shiyue Zhangy, Dong Wangz and Thomas Fang Zheng, Enhanced Neural Machine Translation by Learning from Draft, APSIPA 2017, link: pdf
- Lantian Li, Dong Wang, Askar Rozi, Thomas Fang Zheng, Cross-lingual Speaker Verification with Deep Feature Learning, APSIPA 2017, link: arXiv
- Dong Wang, Lantian Li, Zhiyuan Tang, Thomas Fang Zheng, Deep Speaker Verification: Do We Need End to End?, APSIPA 2017, link: arXiv
- Miao Zhang, Yixiang Chen, Lantian Li and Dong Wang, Speaker Recognition with Cough, Laugh and “Wei”, APSIPA 2017, link: arXiv
- A rule and statistical modeling based stem extraction method for Kazakh words,Proceedings of the 2017 International Conference on Asian Language Processing, IALP 2017, v 2018-January, p 231-234, July 2, 2017
- Guanyu Li, Hongzhi Yu, Thomas Fang Zheng, Jinghao Yan, Shipeng Xu. “Free Linguistic and Speech Resources for Tibetan”, Proceedings of APSIPA Annual Summit and Conference(APSIPA), 2017
- Shipeng Xu, Hongzhi Yu, Thomas Fang Zheng, Guanyu Li, Gegeentana. “Language Resource Construction for Mongolian”, Proceedings of APSIPA Annual Summit and Conference(APSIPA), 2017
Other papers
- 沙尔旦尔·帕尔哈提,米吉提·阿不里米提,艾斯卡尔·艾木都拉. 基于稳健词素序列和LSTM的维吾尔语短文本分类[J]. 中文信息学报,2020,34(01):63-70.
- 沙尔旦尔·帕尔哈提,米吉提·阿不里米提,艾斯卡尔·艾木都拉. 词干单元和卷积神经网络的哈萨克短文本分类[J]. 小型微型计算机系统,2020,41(08):1627-1633.
- 维-哈-柯多语言词素切分集成环境研究[J]. 电视技术,2020,44(06):46-51+63.