Day: April 2, 2015
-
Learning machines. Unlearning humans. Void. Arkanoid.
“We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari…
You must be logged in to post a comment.