<?xml version="1.0"?>
<?xml-stylesheet type="text/css" href="http://cslt.org/mediawiki/skins/common/feed.css?303"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="zh-cn">
		<id>http://cslt.org/mediawiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Litc</id>
		<title>cslt Wiki - 用户贡献 [zh-cn]</title>
		<link rel="self" type="application/atom+xml" href="http://cslt.org/mediawiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Litc"/>
		<link rel="alternate" type="text/html" href="http://cslt.org/mediawiki/index.php/%E7%89%B9%E6%AE%8A:%E7%94%A8%E6%88%B7%E8%B4%A1%E7%8C%AE/Litc"/>
		<updated>2026-05-10T10:04:06Z</updated>
		<subtitle>用户贡献</subtitle>
		<generator>MediaWiki 1.23.3</generator>

	<entry>
		<id>http://cslt.org/mediawiki/index.php/Hulan-2013-10-18</id>
		<title>Hulan-2013-10-18</title>
		<link rel="alternate" type="text/html" href="http://cslt.org/mediawiki/index.php/Hulan-2013-10-18"/>
				<updated>2013-10-18T02:58:17Z</updated>
		
		<summary type="html">&lt;p&gt;Litc：&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ASR=&lt;br /&gt;
&lt;br /&gt;
==ASR Kernel development==&lt;br /&gt;
&lt;br /&gt;
[[http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/2013-10-18  ASR group weekly report]]&lt;br /&gt;
&lt;br /&gt;
==TTS==&lt;br /&gt;
&lt;br /&gt;
* full-lab training is ready. Trained the first full-lab system with 16k/pseduo 48k data.&lt;br /&gt;
* re-recording 48k data using F00 (500 sentences) and retrain the model. The quality of the signal sounds better, while the quality of pitch is a bit strange. Need more investigation on parameter settings.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Next week:&lt;br /&gt;
&lt;br /&gt;
* Check the signal parameters and solve the problem of pitch.&lt;br /&gt;
* Prepare the large data training with both all-F 863 data.&lt;br /&gt;
* Prepare the large data training with online novel.&lt;br /&gt;
&lt;br /&gt;
=Dialog system=&lt;br /&gt;
&lt;br /&gt;
* The search system migrated to the custom domain, with significant performance reduction&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
  Customs:&lt;br /&gt;
n	TF	TFIDF	&lt;br /&gt;
1	0.496	0.485&lt;br /&gt;
2	0.619	0.615&lt;br /&gt;
3	0.676	0.673&lt;br /&gt;
4	0.713	0.715&lt;br /&gt;
5	0.740	0.738&lt;br /&gt;
&lt;br /&gt;
Agriculture:&lt;br /&gt;
n	TF	TFIDF&lt;br /&gt;
1	0.75	0.8&lt;br /&gt;
2	0.85	0.883&lt;br /&gt;
3	0.867	0.917&lt;br /&gt;
4	0.867	0.95&lt;br /&gt;
5	0.95	0.967&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Two problems: &lt;br /&gt;
# short of semantic cluster.&lt;br /&gt;
# limited training data for idf.&lt;br /&gt;
&lt;br /&gt;
* Next week&lt;br /&gt;
# Analyse the QA database, to extract useful domain dependent data&lt;br /&gt;
# Analyse the data to expand the key words &amp;amp; phrases&lt;br /&gt;
# Analyse the data to attain better IDF.&lt;br /&gt;
&lt;br /&gt;
=Summary system=&lt;br /&gt;
&lt;br /&gt;
* Be familiar with the dragon system. Combing the system and extract the summary-only code.&lt;br /&gt;
* Sentence based summary done. But request to migrate to Chinese.&lt;br /&gt;
* Start to build the textrank-based keyword extraction. Re-write the Lexrank code to handle word level similarity matrices. &lt;br /&gt;
&lt;br /&gt;
Next week:&lt;br /&gt;
&lt;br /&gt;
* Test data set: 100 articles&lt;br /&gt;
* TextRank Done&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Template matching==&lt;br /&gt;
&lt;br /&gt;
* Start to work on the self coding, while some requests have not been considered. &lt;br /&gt;
* consider if to use the standard FSM toolkit by next Tuesday.&lt;/div&gt;</summary>
		<author><name>Litc</name></author>	</entry>

	<entry>
		<id>http://cslt.org/mediawiki/index.php/Hulan-2013-10-18</id>
		<title>Hulan-2013-10-18</title>
		<link rel="alternate" type="text/html" href="http://cslt.org/mediawiki/index.php/Hulan-2013-10-18"/>
				<updated>2013-10-18T01:58:21Z</updated>
		
		<summary type="html">&lt;p&gt;Litc：/* TTS */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ASR=&lt;br /&gt;
&lt;br /&gt;
==ASR Kernel development==&lt;br /&gt;
&lt;br /&gt;
[[http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/2013-10-18  ASR group weekly report]]&lt;br /&gt;
&lt;br /&gt;
==TTS==&lt;br /&gt;
&lt;br /&gt;
* full-lab training is ready. Trained the first full-lab system with 16k/pseduo 48k data.&lt;br /&gt;
* re-recording 48k data using F00 (500 sentences) and retrain the model. The quality of the signal sounds better, while the quality of pitch is a bit strange. Need more investigation on parameter settings.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Next week:&lt;br /&gt;
&lt;br /&gt;
* Check the signal parameters and solve the problem of pitch.&lt;br /&gt;
* Prepare the large data training with both all-F 863 data.&lt;br /&gt;
* Prepare the large data training with online novel.&lt;br /&gt;
&lt;br /&gt;
=Dialog system=&lt;br /&gt;
&lt;br /&gt;
* The search system migrated to the custom domain, with significant performance reduction&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
  Customs:&lt;br /&gt;
n	TF	TFIDF	&lt;br /&gt;
1	0.496	0.485&lt;br /&gt;
2	0.619	0.615&lt;br /&gt;
3	0.676	0.673&lt;br /&gt;
4	0.713	0.715&lt;br /&gt;
5	0.740	0.738&lt;br /&gt;
&lt;br /&gt;
Agriculture:&lt;br /&gt;
n	TF	TFIDF&lt;br /&gt;
1	0.75	0.8&lt;br /&gt;
2	0.85	0.883&lt;br /&gt;
3	0.867	0.917&lt;br /&gt;
4	0.867	0.95&lt;br /&gt;
5	0.95	0.967&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Two problems: &lt;br /&gt;
# short of semantic cluster.&lt;br /&gt;
# limited training data for idf.&lt;br /&gt;
&lt;br /&gt;
* Next week&lt;br /&gt;
# Analyse the QA database, to extract useful domain dependent data&lt;br /&gt;
# Analyse the data to expand the key words &amp;amp; phrases&lt;br /&gt;
# Analyse the data to attain better IDF.&lt;/div&gt;</summary>
		<author><name>Litc</name></author>	</entry>

	<entry>
		<id>http://cslt.org/mediawiki/index.php/Hulan-2013-10-18</id>
		<title>Hulan-2013-10-18</title>
		<link rel="alternate" type="text/html" href="http://cslt.org/mediawiki/index.php/Hulan-2013-10-18"/>
				<updated>2013-10-18T01:40:25Z</updated>
		
		<summary type="html">&lt;p&gt;Litc：/* ASR Kernel development */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ASR=&lt;br /&gt;
&lt;br /&gt;
==ASR Kernel development==&lt;br /&gt;
&lt;br /&gt;
[[http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/2013-10-18  ASR group weekly report]]&lt;br /&gt;
&lt;br /&gt;
==TTS==&lt;br /&gt;
&lt;br /&gt;
* CD lab files done. Refining the script. &lt;br /&gt;
* Training toolkit is cleaned up. Now no alignment is required. Parallel training is done.&lt;br /&gt;
* Tried syllable based system instead of phones.&lt;br /&gt;
* Collected an online-novel reading. &lt;br /&gt;
&lt;br /&gt;
Next week:&lt;br /&gt;
&lt;br /&gt;
* Refine the script&lt;br /&gt;
* Clean up the online reading.&lt;br /&gt;
&lt;br /&gt;
=Dialog system=&lt;br /&gt;
&lt;br /&gt;
* The search system migrated to the custom domain, with significant performance reduction&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
  Customs:&lt;br /&gt;
n	TF	TFIDF	&lt;br /&gt;
1	0.496	0.485&lt;br /&gt;
2	0.619	0.615&lt;br /&gt;
3	0.676	0.673&lt;br /&gt;
4	0.713	0.715&lt;br /&gt;
5	0.740	0.738&lt;br /&gt;
&lt;br /&gt;
Agriculture:&lt;br /&gt;
n	TF	TFIDF&lt;br /&gt;
1	0.75	0.8&lt;br /&gt;
2	0.85	0.883&lt;br /&gt;
3	0.867	0.917&lt;br /&gt;
4	0.867	0.95&lt;br /&gt;
5	0.95	0.967&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Two problems: &lt;br /&gt;
# short of semantic cluster.&lt;br /&gt;
# limited training data for idf.&lt;br /&gt;
&lt;br /&gt;
* Next week&lt;br /&gt;
# Analyse the QA database, to extract useful domain dependent data&lt;br /&gt;
# Analyse the data to expand the key words &amp;amp; phrases&lt;br /&gt;
# Analyse the data to attain better IDF.&lt;/div&gt;</summary>
		<author><name>Litc</name></author>	</entry>

	<entry>
		<id>http://cslt.org/mediawiki/index.php/Hulan-2013-10-18</id>
		<title>Hulan-2013-10-18</title>
		<link rel="alternate" type="text/html" href="http://cslt.org/mediawiki/index.php/Hulan-2013-10-18"/>
				<updated>2013-10-18T01:40:11Z</updated>
		
		<summary type="html">&lt;p&gt;Litc：/* Dialog system */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ASR=&lt;br /&gt;
&lt;br /&gt;
==ASR Kernel development==&lt;br /&gt;
&lt;br /&gt;
[[http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/2013-10-11  ASR group weekly report]]&lt;br /&gt;
&lt;br /&gt;
==TTS==&lt;br /&gt;
&lt;br /&gt;
* CD lab files done. Refining the script. &lt;br /&gt;
* Training toolkit is cleaned up. Now no alignment is required. Parallel training is done.&lt;br /&gt;
* Tried syllable based system instead of phones.&lt;br /&gt;
* Collected an online-novel reading. &lt;br /&gt;
&lt;br /&gt;
Next week:&lt;br /&gt;
&lt;br /&gt;
* Refine the script&lt;br /&gt;
* Clean up the online reading.&lt;br /&gt;
&lt;br /&gt;
=Dialog system=&lt;br /&gt;
&lt;br /&gt;
* The search system migrated to the custom domain, with significant performance reduction&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
  Customs:&lt;br /&gt;
n	TF	TFIDF	&lt;br /&gt;
1	0.496	0.485&lt;br /&gt;
2	0.619	0.615&lt;br /&gt;
3	0.676	0.673&lt;br /&gt;
4	0.713	0.715&lt;br /&gt;
5	0.740	0.738&lt;br /&gt;
&lt;br /&gt;
Agriculture:&lt;br /&gt;
n	TF	TFIDF&lt;br /&gt;
1	0.75	0.8&lt;br /&gt;
2	0.85	0.883&lt;br /&gt;
3	0.867	0.917&lt;br /&gt;
4	0.867	0.95&lt;br /&gt;
5	0.95	0.967&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Two problems: &lt;br /&gt;
# short of semantic cluster.&lt;br /&gt;
# limited training data for idf.&lt;br /&gt;
&lt;br /&gt;
* Next week&lt;br /&gt;
# Analyse the QA database, to extract useful domain dependent data&lt;br /&gt;
# Analyse the data to expand the key words &amp;amp; phrases&lt;br /&gt;
# Analyse the data to attain better IDF.&lt;/div&gt;</summary>
		<author><name>Litc</name></author>	</entry>

	<entry>
		<id>http://cslt.org/mediawiki/index.php/Hulan-2013-10-18</id>
		<title>Hulan-2013-10-18</title>
		<link rel="alternate" type="text/html" href="http://cslt.org/mediawiki/index.php/Hulan-2013-10-18"/>
				<updated>2013-10-18T01:39:52Z</updated>
		
		<summary type="html">&lt;p&gt;Litc：以内容“=ASR=  ==ASR Kernel development==  http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/2013-10-11  ASR group weekly report  ==TTS==  * CD lab files done. Refining ...”创建新页面&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=ASR=&lt;br /&gt;
&lt;br /&gt;
==ASR Kernel development==&lt;br /&gt;
&lt;br /&gt;
[[http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/2013-10-11  ASR group weekly report]]&lt;br /&gt;
&lt;br /&gt;
==TTS==&lt;br /&gt;
&lt;br /&gt;
* CD lab files done. Refining the script. &lt;br /&gt;
* Training toolkit is cleaned up. Now no alignment is required. Parallel training is done.&lt;br /&gt;
* Tried syllable based system instead of phones.&lt;br /&gt;
* Collected an online-novel reading. &lt;br /&gt;
&lt;br /&gt;
Next week:&lt;br /&gt;
&lt;br /&gt;
* Refine the script&lt;br /&gt;
* Clean up the online reading.&lt;br /&gt;
&lt;br /&gt;
=Dialog system=&lt;br /&gt;
&lt;br /&gt;
* The search system migrated to the custom domain, with significant performance reduction&lt;br /&gt;
&lt;br /&gt;
  Customs:&lt;br /&gt;
n	TF	TFIDF	&lt;br /&gt;
1	0.496	0.485&lt;br /&gt;
2	0.619	0.615&lt;br /&gt;
3	0.676	0.673&lt;br /&gt;
4	0.713	0.715&lt;br /&gt;
5	0.740	0.738&lt;br /&gt;
&lt;br /&gt;
Agriculture:&lt;br /&gt;
n	TF	TFIDF&lt;br /&gt;
1	0.75	0.8&lt;br /&gt;
2	0.85	0.883&lt;br /&gt;
3	0.867	0.917&lt;br /&gt;
4	0.867	0.95&lt;br /&gt;
5	0.95	0.967&lt;br /&gt;
&lt;br /&gt;
* Two problems: &lt;br /&gt;
# short of semantic cluster.&lt;br /&gt;
# limited training data for idf.&lt;br /&gt;
&lt;br /&gt;
* Next week&lt;br /&gt;
# Analyse the QA database, to extract useful domain dependent data&lt;br /&gt;
# Analyse the data to expand the key words &amp;amp; phrases&lt;br /&gt;
# Analyse the data to attain better IDF.&lt;/div&gt;</summary>
		<author><name>Litc</name></author>	</entry>

	<entry>
		<id>http://cslt.org/mediawiki/index.php/Weekly_status</id>
		<title>Weekly status</title>
		<link rel="alternate" type="text/html" href="http://cslt.org/mediawiki/index.php/Weekly_status"/>
				<updated>2013-10-18T01:27:41Z</updated>
		
		<summary type="html">&lt;p&gt;Litc：&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[hulan-2013-06-28|2013-06-28]]&lt;br /&gt;
&lt;br /&gt;
[[hulan-2013-07-22|2013-07-22]]&lt;br /&gt;
&lt;br /&gt;
[[hulan-2013-07-26|2013-07-26]]&lt;br /&gt;
&lt;br /&gt;
[[hulan-2013-08-02|2013-08-02]]&lt;br /&gt;
&lt;br /&gt;
[[hulan-2013-08-09|2013-08-09]]&lt;br /&gt;
&lt;br /&gt;
[[hulan-2013-09-06|2013-09-06]]&lt;br /&gt;
&lt;br /&gt;
[[hulan-2013-09-13|2013-09-13]]&lt;br /&gt;
&lt;br /&gt;
[[hulan-2013-09-27|2013-09-27]]&lt;br /&gt;
&lt;br /&gt;
[[hulan-2013-10-11|2013-10-11]]&lt;br /&gt;
&lt;br /&gt;
[[hulan-2013-10-18|2013-10-18]]&lt;/div&gt;</summary>
		<author><name>Litc</name></author>	</entry>

	<entry>
		<id>http://cslt.org/mediawiki/index.php/2013-10-18</id>
		<title>2013-10-18</title>
		<link rel="alternate" type="text/html" href="http://cslt.org/mediawiki/index.php/2013-10-18"/>
				<updated>2013-10-18T01:26:59Z</updated>
		
		<summary type="html">&lt;p&gt;Litc：/* Noisy training */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Data sharing ==&lt;br /&gt;
&lt;br /&gt;
* LM count files still undelivered!&lt;br /&gt;
&lt;br /&gt;
== DNN progress ==&lt;br /&gt;
&lt;br /&gt;
=== Sparse DNN ===&lt;br /&gt;
&lt;br /&gt;
* Optimal Brain Damage(OBD). Code ready, bug found. &lt;br /&gt;
&lt;br /&gt;
=== Tencent exps ===&lt;br /&gt;
N/A&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Noisy training ==&lt;br /&gt;
&lt;br /&gt;
1. With 863 clean test, by adding car &amp;amp; white noise at various levels, obtained significant performance improvement.&lt;br /&gt;
&lt;br /&gt;
2. The test with both car &amp;amp; white noise benefits from the noisy training.&lt;br /&gt;
&lt;br /&gt;
==Continuous LM ==&lt;br /&gt;
&lt;br /&gt;
1. Lattice rescoring toolkit is ready.&lt;br /&gt;
2. Rescoring is slow with some dense lattices.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==QA LM==&lt;br /&gt;
&lt;br /&gt;
1. use the QA word segment system&lt;br /&gt;
2. train the Q LM &amp;amp; QA ASR system&lt;/div&gt;</summary>
		<author><name>Litc</name></author>	</entry>

	<entry>
		<id>http://cslt.org/mediawiki/index.php/2013-10-18</id>
		<title>2013-10-18</title>
		<link rel="alternate" type="text/html" href="http://cslt.org/mediawiki/index.php/2013-10-18"/>
				<updated>2013-10-18T01:26:48Z</updated>
		
		<summary type="html">&lt;p&gt;Litc：以内容“== Data sharing ==  * LM count files still undelivered!  == DNN progress ==  === Sparse DNN ===  * Optimal Brain Damage(OBD). Code ready, bug found.   === Tencent exps ...”创建新页面&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Data sharing ==&lt;br /&gt;
&lt;br /&gt;
* LM count files still undelivered!&lt;br /&gt;
&lt;br /&gt;
== DNN progress ==&lt;br /&gt;
&lt;br /&gt;
=== Sparse DNN ===&lt;br /&gt;
&lt;br /&gt;
* Optimal Brain Damage(OBD). Code ready, bug found. &lt;br /&gt;
&lt;br /&gt;
=== Tencent exps ===&lt;br /&gt;
N/A&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Noisy training ==&lt;br /&gt;
&lt;br /&gt;
1. With 863 clean test, by adding car &amp;amp; white noise at various levels, obtained significant performance improvement.&lt;br /&gt;
2. The test with both car &amp;amp; white noise benefits from the noisy training.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Continuous LM ==&lt;br /&gt;
&lt;br /&gt;
1. Lattice rescoring toolkit is ready.&lt;br /&gt;
2. Rescoring is slow with some dense lattices.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==QA LM==&lt;br /&gt;
&lt;br /&gt;
1. use the QA word segment system&lt;br /&gt;
2. train the Q LM &amp;amp; QA ASR system&lt;/div&gt;</summary>
		<author><name>Litc</name></author>	</entry>

	<entry>
		<id>http://cslt.org/mediawiki/index.php/CSLT_cluster_nodes</id>
		<title>CSLT cluster nodes</title>
		<link rel="alternate" type="text/html" href="http://cslt.org/mediawiki/index.php/CSLT_cluster_nodes"/>
				<updated>2013-10-18T01:17:05Z</updated>
		
		<summary type="html">&lt;p&gt;Litc：&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''警告：'''“CSLT cluster nodes”指向这里，但您没有足够的权限来访问它。&lt;/div&gt;</summary>
		<author><name>Litc</name></author>	</entry>

	<entry>
		<id>http://cslt.org/mediawiki/index.php/Use_CSLT_cluster</id>
		<title>Use CSLT cluster</title>
		<link rel="alternate" type="text/html" href="http://cslt.org/mediawiki/index.php/Use_CSLT_cluster"/>
				<updated>2013-10-18T01:15:22Z</updated>
		
		<summary type="html">&lt;p&gt;Litc：/* Login and run jobs */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''警告：'''“Use CSLT cluster”指向这里，但您没有足够的权限来访问它。&lt;/div&gt;</summary>
		<author><name>Litc</name></author>	</entry>

	</feed>