|
|
第49行: |
第49行: |
| |Zhiyuan Tang | | |Zhiyuan Tang |
| || | | || |
− | * | + | * A brief survey on oral language evaluation, both on application and algorithm |
| || | | || |
− | * | + | * A test version of Parrot |
| + | * Patent of Parrot |
| || | | || |
| * | | * |
Date |
People |
Last Week |
This Week |
Task Tracking
|
2017.12.25
|
Miao Zhang
|
|
- Read the 16k model script
- The cough recognition codes left by Xiaofei
|
- check the trivial database, make it more reasonable
- test the 16k model on the database
|
Ying Shi
|
- some function for voice-printer
- speaker vector per utterance here
- speaker vector minus base speaker vector here
- CTC for Haibo Wang (Token accuracy on train set 92.80%, on cv set 89.74%) haven't test on test set
- QRcode
- speaker vector merge phone grayscale here
- speaker vector merge phone black-and-white map here
- speaker vector merge phone black-and-white map minus base vector here
- ivector baseline for kazak-uyghur LRE performance is 81.85% (Utt level)
|
- Finish voice-checker copyright and submit the copyright in this Wednesday
|
|
Lantian Li
|
- Complete the recipe for `VV_FACTOR`.
- 16K and 8K deep speaker model comparison.[5]
|
- Patent for `VV_QuickMark`.
- Complete the demo for `VV_FACTOR`.[Assign to Shouyi Dai]
- Phonetic speaker embedding.
- Overlap training for speaker features.
|
|
Zhiyuan Tang
|
- word level pronunciation accuracy based on likelihood (tell which word is well pronounced as '0' or badly pronounced '1')
|
- model adaptation
- if possible, an alpha version Parrot for test inside lab to collect some data for better configurature
|
|