Tool use and agents
英文原文文件:tool_use.md
计算解释
将推理阶段计算分配给工具、环境、检索或动作循环的方法。
支撑阅读卡
- ReAct: Synergizing Reasoning and Acting in Language Models (2022,
inference_time_compute_post_training) - Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks (2022,
inference_time_compute_post_training) - Toolformer: Language Models Can Teach Themselves to Use Tools (2023,
inference_time_compute_post_training) - Voyager: An Open-Ended Embodied Agent with Large Language Models (2023,
inference_time_compute_post_training) - AlphaEvolve: A coding agent for scientific and algorithmic discovery (2025,
search_simulation_science_compute) - Kimi K2: Open Agentic Intelligence (2025,
sparse_memory_efficient_scaling) - DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models (2025,
sparse_memory_efficient_scaling) - Kimi K2.5: Visual Agentic Intelligence (2026,
inference_time_compute_post_training) - Qwen3.5-Omni Technical Report (2026,
generative_media_compute)
后续计算范式下过时或退居次要的内容
仅通过已链接的阅读卡追踪,不将本方法页视为独立证据来源。