Paper之DL之BP:《Understanding the difficulty of training deep feedforward neural networks》

Understanding the difficulty of training deep feedforward neural networks Sigmoid的四层局限 sigmoid函数的test loss和training loss要经过很多轮数一直为0.5,后再有到0.1的差强人意的变化。        We hypothesize that this behavior is due t
相关文章
相关标签/搜索