stochastic Gradient descent with momentum