Sheng Su 2015-10-12

来自cslt Wiki
2015年10月12日 (一) 12:07Susheng讨论 | 贡献的版本

(差异) ←上一版本 | 最后版本 (差异) | 下一版本→ (差异)
跳转至: 导航搜索

four GPU training: --

  • having tried to change learning rate, mini-batch size and the gap, still diverge.
  • having tried to use asynchronous way to update, still diverge.
  • keep going to find the reason of divergency, and going to use some other methods to try.