Babysitting the Learning Process

在预处理完data和选择合适的network architecture后就开始train了, 1. Double check that the loss is reasonable:最开始disable regularization的话loss应该是与1/# class相关的一个数, 2。 Make sure that you can overfit very small portion of t
相关文章
相关标签/搜索