The document summarizes research on developing artificial intelligence that can master the game of Go. It describes how researchers at DeepMind used a combination of deep neural networks and Monte Carlo tree search to create the AlphaGo agent. The AlphaGo agent uses a policy network trained through supervised and reinforcement learning to select moves, and a value network trained through reinforcement learning to evaluate board positions. Researchers found that AlphaGo was able to defeat human champions by a wide margin, demonstrating that its approach had achieved a level of play beyond human expertise.