“Public data”版本间的差异
来自cslt Wiki
(以“ [http://www.cccforum.org CCC resource] [http://cslt.riit.tsinghua.edu.cn:8081/download/uygh/zip/data.tar.gzz Uyghur database]”为内容创建页面) |
|||
第1行: | 第1行: | ||
+ | ==CCC data resource== | ||
− | + | CSLT holds a close collaboration with Chinese Corpus Consortium (CCC) to collect and publish databases in China. The aim of the CCC is to provide corpora for Chinese ASR, TTS, NLP, perception analysis, phonetics analysis, linguistic analysis, and other related tasks. The corpora can be speech- or text-based; read or spontaneous; wideband or narrowband; standard or dialectal Chinese; clean or with noise; or of any other kinds which are deemed helpful for the foresaid purposes. | |
+ | [http://www.cccforum.org Visit CCC] | ||
+ | |||
+ | ==Uyghur text database== | ||
[http://cslt.riit.tsinghua.edu.cn:8081/download/uygh/zip/data.tar.gzz Uyghur database] | [http://cslt.riit.tsinghua.edu.cn:8081/download/uygh/zip/data.tar.gzz Uyghur database] |
2014年9月30日 (二) 08:34的版本
CCC data resource
CSLT holds a close collaboration with Chinese Corpus Consortium (CCC) to collect and publish databases in China. The aim of the CCC is to provide corpora for Chinese ASR, TTS, NLP, perception analysis, phonetics analysis, linguistic analysis, and other related tasks. The corpora can be speech- or text-based; read or spontaneous; wideband or narrowband; standard or dialectal Chinese; clean or with noise; or of any other kinds which are deemed helpful for the foresaid purposes.