RL

Reinforcement Learning

SGD

Stochastic Gradient Descent