Vincent
(2026-03-31 15:00):
#paper https://doi.org/10.1038/s41586-026-10265-5
Nature 2026. Towards end-to-end automation of AI research. 这篇文章首次构建了一个能够端到端自动完成科研全流程的 AI 系统,覆盖从想法生成、实验执行到论文写作与同行评审的完整闭环。系统基于多智能体架构,并通过分阶段实验流程与 agentic tree search在研究空间中进行系统性探索。实验表明,AI 生成论文已具备真实科研质量,其中一篇通过 ICLR workshop 盲审,达到人类接受阈值 。同时,自动审稿系统与人类评审一致性相当,用于大规模评估生成结果。研究进一步发现,论文质量随模型能力与 test-time compute 显著提升,揭示科研能力的可扩展性。尽管当前仍存在实现错误与幻觉问题,该工作将科研过程形式化为可搜索的计算问题,标志着从“AI 辅助科研”向“AI 自动科研”的范式转变。
Nature,
2026-3-26.
DOI: 10.1038/s41586-026-10265-5
Towards end-to-end automation of AI research
翻译
Abstract:
Abstract The automation of science is a long-standing ambition in artificial intelligence (AI) research 1,2 . Although the community has made substantial progress in automating individual components of the scientific process, a system that autonomously navigates the entire research life cycle—from conception to publication—has remained out of reach. Here we present a pipeline for automating the entire scientific process end to end. We present The AI Scientist, which creates research ideas, writes code, runs experiments, plots and analyses data, writes the entire scientific manuscript, and performs its own peer review. Its ideas, execution and presentation are of sufficient quality that the manuscript generated by this AI system passed the first round of peer review for a workshop of a top-tier machine learning conference. The workshop had an acceptance rate of 70%. Our system leverages modern foundation models 3–5 within a complex agentic system. We evaluate The AI Scientist in two settings: a focused mode using human-provided code templates as an initial scaffold for conducting research on a specific topic and a template-free, open-ended mode that leverages agentic search for wider scientific exploration 6,7 . Both settings produce diverse ideas and automatically test, report on and evaluate them. This achievement demonstrates the growing capacity of AI for making scientific contributions and signifies a potential paradigm shift in how research is conducted. As with any impactful new technology, there could be important risks, including taxing overwhelmed review systems and adding noise to the scientific literature. However, if developed responsibly, such autonomous systems could greatly accelerate scientific discovery.
翻译
Related Links: