last week:
debug four-gpu training.
This week:
do some research about model parallel and data parallel