各种优化方法总结笔记(sgd/momentum/Nesterov/adagrad/adadelta)

http://blog.csdn.net/luo123n/article/details/48239963

别忘看评语

http://sebastianruder.com/optimizing-gradient-descent/index.html#gradientdescentvariants

AdaptiveGradient (ADAGRAD)

你可能感兴趣的:(机器学习)