Retrieval-augmented generation
英文原文文件:rag.md
计算解释
推理时外部记忆模式,以检索延迟和索引开销换取 grounding 与信息时效性。
支撑阅读卡
- Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks (2020,
inference_time_compute_post_training) - REALM: Retrieval-Augmented Language Model Pre-Training (2020,
inference_time_compute_post_training) - WebGPT: Browser-assisted question-answering with human feedback (2021,
inference_time_compute_post_training)
后续计算范式下过时或退居次要的内容
仅通过已链接的阅读卡追踪,不将本方法页视为独立证据来源。