Harmonia Philosophica

Day: April 2, 2015

AI, Brain, Computers, Learning

2015/04/02

Learning machines. Unlearning humans. Void. Arkanoid.

“We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari…