← All methods

Search and simulation

Compute interpretation

Inference-time or training-time structured exploration layered on top of learned policy, value, model, or scientific scoring components.

Supporting reading cards

Human-level control through deep reinforcement learning (2015, search_simulation_science_compute)
Mastering the game of Go with deep neural networks and tree search (2016, search_simulation_science_compute)
Mastering the game of Go without human knowledge (2017, search_simulation_science_compute)
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play (2018, search_simulation_science_compute)
Grandmaster level in StarCraft II using multi-agent reinforcement learning (2019, search_simulation_science_compute)
Dota 2 with Large Scale Deep Reinforcement Learning (2019, search_simulation_science_compute)
Mastering Atari, Go, chess and shogi by planning with a learned model (2019, search_simulation_science_compute)
Improved protein structure prediction using potentials from deep learning (2020, search_simulation_science_compute)
Highly accurate protein structure prediction with AlphaFold (2021, search_simulation_science_compute)
Tree of Thoughts: Deliberate Problem Solving with Large Language Models (2023, inference_time_compute_post_training)
Accurate structure prediction of biomolecular interactions with AlphaFold 3 (2024, search_simulation_science_compute)
Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 (2025, search_simulation_science_compute)
AlphaEvolve: A coding agent for scientific and algorithmic discovery (2025, search_simulation_science_compute)

Obsolete or less central under later compute

Track this only through linked reading cards; do not treat this method page as standalone evidence.