1.prepare data (wsj) include all_test and 433 speeches on C and G the coming work will focus on the (wsj)