Day: April 2, 2015

  • Learning machines. Unlearning humans. Void. Arkanoid.

    “We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari…

Verified by ExactMetrics