“OLR Challenge 2017”版本间的差异
(→Sponsor) |
(→News) |
||
第14行: | 第14行: | ||
Number of teams that already registered the challenge: '''26'''. | Number of teams that already registered the challenge: '''26'''. | ||
− | Number of teams that wait to register the challenge: ''' | + | Number of teams that wait to register the challenge: '''35-26'''. |
==Data== | ==Data== |
2017年8月31日 (四) 08:08的版本
目录
Oriental Language Recognition (OLR) 2017 Challenge
Oriental languages involve interesting specialties. The OLR challenge series aim at boosting language recognition technology for oriental languages. Following the success of OLR Challenge 2016, the new challenge in 2017 follows the same theme, but sets up more challenging tasks in the sense of:
- more languages: OLR 2016 involves 7 languages, OLR 2017 involves 10 languages.
- shorter speech segments. OLR 2017 sets individual tasks for 1 second, 3 second and the original segments separately.
We will publish the results on a special session of APSIPA ASC 2017. See more details for the AP17 special session.
News
Number of teams that already registered the challenge: 26.
Number of teams that wait to register the challenge: 35-26.
Data
The challenge is based on two multilingual databases, AP16-OL7 that was designed for the OLR challenge 2016, and a new complementary AP17-OL3 database.
AP16-OL7 is provided by SpeechOcean (www.speechocean.com), and AP17-OL3 is provided by Tsinghua University, Northwest Minzu University and Xinjiang University, under the M2ASR project supported by NSFC.
The features for AP16-OL7 involve:
- Mobile channel
- 7 languages in total
- 71 hours of speech signals in total
- Transcriptions and lexica are provided
- The data profile is here
- The License for the data is here
The features for AP17-OL3 involve:
- Mobile channel
- 3 languages in total
- Tibetan provided by Prof. Guanyu Li@Northwest Minzu Univ.
- Uyghur and Kazak provided by Prof. Askar Hamdulla@Xinjiang University.
- 35 hours of speech signals in total
- Transcriptions and lexica are provided
- The data profile is here
- The License for the data is here
Evaluation plan
Refer to the scripts/paper following.
Evaluation tools
- The Kaldi-based baseline scripts here
Participation rules
- Participants from both academy and industry are welcome
- Publications based on the data provided by the challenge should cite the following paper:
Dong Wang, Lantian Li, Difei Tang, Qing Chen, AP16-OL7: a multilingual database for oriental languages and a language recognition baseline, APSIPA ASC 2016. pdf
Zhiyuan Tang, Dong Wang, Yixiang Chen, Qing Chen: AP17-OLR Challenge: Data, Plan, and Baseline, submitted to APSIPA ASC 2017. pdf
Important dates
- Jun. 20, AP17-OLR training/dev data release.
- Sep. 20, register deadline.
- Oct. 1, test data release.
- Oct. 2, 12:00 PM, Beijing time, submission deadline.
- APSIPA ASC 2017, results announcement.
Registration procedure
If you intend to participate the challenge, or if you have any questions, comments or suggestions about the challenge, please send email to the organizers:
- Dr. Dong Wang (wangdong99@mails.tsinghua.edu.cn)
- Dr. Zhiyuan Tang (tangzy@cslt.riit.tsinghua.edu.cn)
- Ms. Qing Chen (chenqing@speechocean.com)
Organizers
- Dong Wang, Tsinghua University [home]
Error code: 127
- Zhiyuan Tang, Tsinghua University [home]
- Qing Chen, SpeechOcean