Chinese data gigword
来自cslt Wiki
chinese data
prepare data
- now data
- gigaword: /work2/xingchao/corpus/Chinese_corpus/gigaword
- bing parallel corpus:/nfs/disk/work/users/xingchao/bing_dict
- baidu:
- sougou:
- using data
- sample gigword about 344M
- dict:tencent11w
- train set
Parameters | hidden | class | direct | bbt | bptt_block | threads | direct-order | rand_seed | nwords | time(min) |
---|---|---|---|---|---|---|---|---|---|---|
set1 | 320 | 300 | 2000 | 2 | 20 | 1 | 4 | 1 | 10000 | 3380(56h) |