$$\begin{eqnarray}
\mathbf{v}_t &=& \gamma\mathbf{v}_{t-1} + \alpha\nabla_{\mathbf{\theta}}J(\mathbf{\theta}) \nonumber \\
\mathbf{\theta}_t &=& \mathbf{\theta}_{t-1} - \mathbf{v}_t \nonumber
\end{eqnarray}$$
Nesterov Accelerated Gradient:
$$\begin{eqnarray}
\mathbf{v}_t &=& \gamma\mathbf{v}_{t-1} + \alpha\nabla_{\mathbf{\theta}}J(\mathbf{\theta-\gamma\mathbf{v}_{t-1}}) \nonumber \\
\mathbf{\theta}_t &=& \mathbf{\theta}_{t-1} - \mathbf{v}_t \nonumber
\end{eqnarray}$$