← 返回方法列表

Search and simulation

英文原文文件：search.md

计算解释

在推理或训练过程中，基于已学习的策略、价值函数、模型或科学评分组件所进行的结构化探索。

支撑阅读卡

Human-level control through deep reinforcement learning (2015, search_simulation_science_compute)
Mastering the game of Go with deep neural networks and tree search (2016, search_simulation_science_compute)
Mastering the game of Go without human knowledge (2017, search_simulation_science_compute)
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play (2018, search_simulation_science_compute)
Grandmaster level in StarCraft II using multi-agent reinforcement learning (2019, search_simulation_science_compute)
Dota 2 with Large Scale Deep Reinforcement Learning (2019, search_simulation_science_compute)
Mastering Atari, Go, chess and shogi by planning with a learned model (2019, search_simulation_science_compute)
Improved protein structure prediction using potentials from deep learning (2020, search_simulation_science_compute)
Highly accurate protein structure prediction with AlphaFold (2021, search_simulation_science_compute)
Tree of Thoughts: Deliberate Problem Solving with Large Language Models (2023, inference_time_compute_post_training)
Accurate structure prediction of biomolecular interactions with AlphaFold 3 (2024, search_simulation_science_compute)
Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 (2025, search_simulation_science_compute)
AlphaEvolve: A coding agent for scientific and algorithmic discovery (2025, search_simulation_science_compute)

后续计算范式下过时或退居次要的内容

仅通过已链接的阅读卡追踪，不将本方法页视为独立证据来源。