文献收藏与分享平台

1.

刘昊辰 (2026-03-02 09:15):

#paper Resource-Efficient Model-Free Reinforcement Learning for Board Games. 本文介绍了一种名为 KLENT (Kullback-Leibler and Entropy Regularized Policy Optimization) 的新型无模型（Model-Free）强化学习算法，旨在解决传统基于搜索的棋类游戏AI（如AlphaZero）计算资源消耗巨大的问题。KLENT 展示了通过合理组合现有的RL技术（KL正则、熵正则、λ-returns），可以在不牺牲性能的前提下，大幅降低棋类AI的训练门槛。下载地址：https://arxiv.org/pdf/2602.10894

arXiv, 2026-02-11T14:25:38Z. DOI: 10.48550/arXiv.2602.10894

Resource-Efficient Model-Free Reinforcement Learning for Board Games

翻译

Kazuki Ota, Takayuki Osa, Motoki Omura, Tatsuya Harada

Abstract:

Board games have long served as complex decision-making benchmarks in artificial intelligence. In this field, search-based reinforcement learning methods such as AlphaZero have achieved remarkable success. However, their significant computational … >>>

翻译

2.

尹志 (2026-02-28 23:14):

#paper，DOI: arXiv:2601.10144，Bridging Superconducting and Neutral-Atom Platforms for Efficient Fault-Tolerant Quantum Architectures，本文提出了一种整合超导和中性原子方案的混合量子计算架构，且面向容错。很有启发性很有前瞻性。考虑到不同量子计算体系的特点，混合方案确实有机会在未来带来有价值的变革。今年我们也会从问题域视角进行混合架构的探索。

arXiv, 2026-01-15T07:39:05Z. DOI: 10.48550/arXiv.2601.10144

Bridging Superconducting and Neutral-Atom Platforms for Efficient Fault-Tolerant Quantum Architectures

翻译

Xiang Fang, Jixuan Ruan, Sharanya Prabhu, Ang Li, Travis Humble, Dean Tullsen, Yufei Ding

Abstract:

The transition to the fault-tolerant era exposes the limitations of homogeneous quantum systems, where no single qubit modality simultaneously offers optimal operation speed, connectivity, and scalability. In this work, we … >>>

翻译

3.

刘昊辰 (2026-02-02 09:27):

#paper Particle Builder A Board Game for the Teaching of the Standard Model of Particle Physics at a Secondary Level.《Particle Builder》是一款于2016年由国际物理教师团队研发的桌游，后推出浏览器在线版本（支持与基础AI对战），专为高中阶段教学设计，通过7个难度递增的关卡，以互动gameplay传授粒子物理学标准模型的核心知识（如夸克、轻子、反物质等），经281名澳大利亚高中生测试，225人完成前后测，平均学习增益达0.16，媲美1.5周（约7小时）传统教学效果，且94%的学生认为其比常规科学课更有趣，88%认为更具参与感，物理版和在线版均免费向教师开放。下载地址：https://arxiv.org/pdf/2511.21116

arXiv, 2025-11-26T07:02:18Z. DOI: 10.48550/arXiv.2511.21116

Particle Builder A Board Game for the Teaching of the Standard Model of Particle Physics at a Secondary Level

翻译

Abstract:

We present Particle Builder, an online board game which teaches students about concepts from the Standard Model of Particle Physics at a high school level. This short activity resulted in … >>>

翻译

4.

林海onrush (2026-01-31 23:55):

#paper，DOI: arXiv:2406.03816，ReST-MCTS: LLM Self-Training via Process Reward Guided Tree Search,本文提出ReST-MCTS，一种将过程奖励（Process Reward）与改进的蒙特卡洛树搜索（MCTS)相结合的大语言模型自训练框架，旨在解决现有自训练方法仅依赖最终正确答案、却容易引入低质量中间推理的问题。该方法在仅已知最终正确答案的情况下，通过树搜索中的多次 rollout 自动推断每一步中间推理对通向正确解的贡献概率，从而生成高质量的过程奖励信号，用于同时训练策略模型和过程奖励模型。实验结果表明，在相同搜索预算下，ReST-MCTS*在推理准确率上优于 Best-of-N、Tree-of-Thought 等方法，并在多轮自训练中持续提升模型性能，显著超过 ReSTEM、Self-Rewarding 等已有自训练范式，验证了其在高质量推理轨迹获取和稳定自提升方面的有效性

arXiv, 2024-06-06T07:40:00Z. DOI: 10.48550/arXiv.2406.03816

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

翻译

Dan Zhang, Sining Zhoubian, Ziniu Hu, Yisong Yue, Yuxiao Dong, Jie Tang

Abstract:

Recent methodologies in LLM self-training mostly rely on LLM generating responses and filtering those with correct output answers as training data. This approach often yields a low-quality fine-tuning training set … >>>

翻译

5.

尹志 (2026-01-31 23:53):

#paper https://arxiv.org/abs/2601.21571. arxiv 2026. Shaping capabilities with token-level data filtering。文档级过滤过渡到Token 级过滤确实是很直接的想法，但用良好的工程实现获得洞见，确实是alec的风格。

arXiv, 2026-01-29T11:34:01Z. DOI: 10.48550/arXiv.2601.21571

Shaping capabilities with token-level data filtering

翻译

Neil Rathi, Alec Radford

Abstract:

Current approaches to reducing undesired capabilities in language models are largely post hoc, and can thus be easily bypassed by adversaries. A natural alternative is to shape capabilities during pretraining … >>>

翻译

6.

Vincent (2026-01-31 17:31):

#paper https://arxiv.org/abs/2201.11903 arxiv 2022. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. 这篇文首次提出了Chain-of-Thought（CoT）的思路，通过在少样本提示中显式提供中间自然语言推理步骤，可以显著提升大语言模型在复杂推理任务上的表现。作者在多种推理任务基准测试上展示了 CoT 的显著增益，尤其在 100B+ 参数规模模型上表现为一种随规模涌现（emergent）的能力。消融实验表明，性能提升并非仅来自“多算一步”，而是顺序化、可读的推理过程本身在发挥作用。该方法无需额外训练或微调，仅通过提示即可实现，因而得以广泛运用，为大模型的可解释推理研究开辟了新方向

arXiv, 2022-01-28T02:33:07Z. DOI: 10.48550/arXiv.2201.11903

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

翻译

Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, Denny Zhou

Abstract:

We explore how generating a chain of thought -- a series of intermediate reasoning steps -- significantly improves the ability of large language models to perform complex reasoning. In particular, … >>>

翻译

7.

刘昊辰 (2026-01-04 09:37):

#paper Collapsi is strongly solved. 2025年6月由Mark S. Ball发布的两人完全信息游戏Collapsi，在16张牌（含4张A、4张2、4张3、2张4、2张Joker）组成的4×4环形棋盘上进行，玩家轮流依据所在牌面数值移动棋子，移动后起始牌翻面，无合法移动者输；Michael Young通过对称破缺将初始16!（约2.1×10¹³）种牌局简化，用带α-β剪枝的极小极大搜索算法开发求解器，20毫秒内可找最优移动，在13代Intel Core i5-13500处理器上耗时7小时29分钟完成47,297,250种等效牌局分析，发现先手（红方）仅37.5%牌局可必赢，后手（蓝方）62.5%牌局可必赢，游戏最短必赢步数为7回合，6.4%牌局中败方能将游戏拖至最大14回合，最终证明该游戏被强解。下载地址：https://arxiv.org/pdf/2507.16823

arXiv, 4 Jul 2025. DOI: 10.48550/arXiv.2507.16823

Collapsi is strongly solved

翻译

Abstract: No abstract available.

8.

林海onrush (2025-12-31 21:49):

#paper, Superposition Yields Robust Neural Scaling, DOI: 10.48550/arXiv.2505.10465. NIPS2025的亚军论文奖，MIT物理团队出身的AI工作，这篇论文提出：神经网络的幂律缩放（模型越宽/维度越大，loss 越低）可能主要源自表示层的“叠加/超位置（superposition）”机制——当需要表示的特征数远大于隐藏维度时，模型会把许多特征压进同一组维度里，导致表示向量之间的重叠干扰；随着维度 (m) 增大，随机几何使这种重叠的平均强度自然按 (~ 1/m) 下降，从而产生鲁棒的 (L∝ 1/m) 幂律缩放。作者用可控的 toy model 对比了弱与强 superposition：弱 superposition 下缩放更依赖数据特征频率的幂律尾部，而强 superposition 下则更普遍地产生接近指数 1 的缩放；并进一步在多种真实 LLM 上测得 token输出权重向量的重叠随宽度近似 (1/m) 下降、宽度指数约 0.9，支持“大模型处于强 superposition、几何干扰驱动缩放”的解释。

arXiv, 2025-05-15T16:18:13Z. DOI: 10.48550/arXiv.2505.10465

Superposition Yields Robust Neural Scaling

翻译

Yizhou Liu, Ziming Liu, Jeff Gore

Abstract:

The success of today's large language models (LLMs) depends on the observation that larger models perform better. However, the origin of this neural scaling law, that loss decreases as a … >>>

翻译

9.

Vincent (2025-12-31 20:29):

#paper https://arxiv.org/abs/1706.03762 arxiv 2017. Attention Is All You Need. 这篇经典论文提出了Transformer，一种全新设计的序列转换模型，完全基于注意力机制而不再使用循环神经网络（RNN）或卷积神经网络（CNN），通过自注意力（Self-Attention）和多头注意力（Multi-Head Attention）有效建模序列中不同位置之间的依赖关系，使得训练可以大规模并行化而不受序列顺序计算的限制。Transformer 采用标准的编码器-解码器架构，其中编码器和解码器都由多个注意力层与前馈网络层堆叠构成，并通过位置编码注入序列中的位置信息，从而弥补没有序列结构时丢失的顺序信息。实验结果表明，该模型在 WMT 2014 英德翻译和英法翻译任务上分别显著优于传统的循环与卷积基线模型，同时训练速度更快，展现出强大的长距离依赖建模能力，并为后续大规模语言模型与多模态 Transformer 架构奠定了基础

arXiv, 2017-06-12T17:57:34Z. DOI: 10.48550/arXiv.1706.03762

Attention Is All You Need

翻译

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin

Abstract:

The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an … >>>

翻译

10.

符毓 (2025-12-31 17:21):

#paper doi: 10.48550/arXiv.2512.16907, 2025, Flowing from Reasoning to Motion: Learning 3D Hand Trajectory Prediction from Egocentric Human Interaction Videos Meta推出了 EgoMAN 数据集，这是一个大规模的以第一视角的基准数据集，用于6DoF手部轨迹预测。以及对应的预测模型，这是一个模块化的推理到运动框架，它通过轨迹标记接口和渐进式训练，将高层意图与基于物理的 6DoF 轨迹对齐。实验表明，与仅基于运动和基于VLM基线模型相比，EgoMAN 模型取得了显著优势：流匹配能够生成更平滑、更稳定的轨迹；VLM 驱动的推理提高了语义对齐和对新场景及意图的泛化能力；轨迹标记接口实现了高效的推理，将基于意图的阶段感知推理与精确的底层运动生成相结合。总而言之，EgoMAN 为实现上下文动作预测提供了一个切实可行的步骤，支持机器人操作、语言感知运动合成和意图感知辅助系统等应用。之前数据集的一个主要瓶颈在于缺乏大规模、高质量的3D轨迹数据。部分数据集提供了准确的标注，但多样性有限；而大规模的以自我为中心的视频数据集包含丰富的真实世界交互，但轨迹噪声较大、目标导向性较弱，且缺乏时间结构。关键在于，它们缺乏明确的交互阶段，例如接近和操作，而这些阶段对于将有目的的运动与背景区分开来，以及将轨迹与意图联系起来至关重要。基于此类原始视频训练的模型通常泛化能力较差，因为缺乏意图、空间关系和运动动态之间的联系。

arXiv, 2025-12-18T18:59:01Z. DOI: 10.48550/arXiv.2512.16907

Flowing from Reasoning to Motion: Learning 3D Hand Trajectory Prediction from Egocentric Human Interaction Videos

翻译

Abstract:

Prior works on 3D hand trajectory prediction are constrained by datasets that decouple motion from semantic supervision and by models that weakly link reasoning and action. To address these, we … >>>

翻译

11.

刘昊辰 (2025-12-01 09:56):

#paper Superhuman AI for Stratego Using Self-Play Reinforcement Learning and Test-Time Search. 研究团队开发出名为Ataraxos的 Stratego 超级 AI，通过自博弈强化学习与测试时搜索技术突破了该游戏海量隐藏信息的挑战，仅花费约数千美元（16 块 H100 训练 1 周 + 4 块 H100 训练 4 天，成本低于 8000 美元），便在 20 场对局中以15 胜 1 负 4 平（85% 有效胜率）击败史上最杰出的 Stratego 选手 Pim Niemeijer，且在 2025 年 Stratego 世界锦标赛演示中对普通选手取得 95% 有效胜率；其核心创新在于动态阻尼的自博弈强化学习（协调正则化强度、策略更新规模与策略强度）、分离的布局网络与移动网络（均基于 Transformer 架构），以及基于信念网络的测试时搜索，同时通过 GPU 加速模拟器（每秒约 1000 万状态更新）和数据处理优化（如 bfloat16 数据类型、零检索数据传输）实现低成本高效训练，大幅超越此前 DeepNash 等方案的性能与成本水平。下载地址：https://arxiv.org/pdf/2511.07312

arXiv, 2025-11-10T17:13:41Z. DOI: 10.48550/arXiv.2511.07312

Superhuman AI for Stratego Using Self-Play Reinforcement Learning and Test-Time Search

翻译

Samuel Sokota, Eugene Vinitsky, Hengyuan Hu, J. Zico Kolter, Gabriele Farina

Abstract:

Few classical games have been regarded as such significant benchmarks of artificial intelligence as to have justified training costs in the millions of dollars. Among these, Stratego -- a board … >>>

翻译

12.

Vincent (2025-11-30 21:07):

#paper https://arxiv.org/abs/2104.09864 Arxiv. 2021. RoFormer: Enhanced Transformer with Rotary Position Embedding 这篇论文提出 RoFormer，一种通过旋转式位置编码（Rotary Position Embedding, RoPE）增强 Transformer 推理能力的新方法。传统 Transformer 需要依赖绝对或相对位置向量“相加”到 token 表示中，而 RoPE 另辟蹊径，通过对 query 与 key 施加与位置相关的旋转变换，使自注意力在点积阶段自然地体现相对位置信息。该方法在数学上更优雅、在实现上轻量，并具备更好的长程依赖建模能力，同时与线性注意力等高效变体完全兼容。实验结果显示，RoFormer 在多个长文本任务上均显著优于传统位置编码方案，不需要额外训练成本却能带来更强表示能力，展示出其在更大规模语言模型与复杂序列任务中的广泛应用潜力。

arXiv, 2021-04-20T09:54:06Z. DOI: 10.48550/arXiv.2104.09864

RoFormer: Enhanced Transformer with Rotary Position Embedding

翻译

Jianlin Su, Yu Lu, Shengfeng Pan, Ahmed Murtadha, Bo Wen, Yunfeng Liu

Abstract:

Position encoding recently has shown effective in the transformer architecture. It enables valuable supervision for dependency modeling between elements at different positions of the sequence. In this paper, we first … >>>

翻译

13.

符毓 (2025-11-28 00:14):

#paper doi: 10.48550/arXiv.2511.21366, 2025, THybrid Control for Robotic Nut Tightening Task 本文所提出的机器人螺母紧固系统由两部分组成：1是基于运动基元的规划框架，该框架在任务空间中运行；2是混合控制器，该控制器利用感知到的交互力来更高效地执行规划轨迹中接触密集的部分。实验评估表明，与基准系统相比，该系统完成目标的速度提高了 14.5%，同时由于施加在机械臂上的接触力比基准系统小两个数量级，因此更加安全高效。所提出系统的规划和控制组件的计算成本都很低，与运行它们的仿真软件相比，消耗的 CPU 资源可以忽略不计。该系统对初始配置的变化表现出很高的鲁棒性，并指明了进一步改进的方向。目前存在的一个鲁棒性瓶颈在于规划框架中的回缩运动基元。规划和控制之间更紧密的耦合将缓解问题。

arXiv, 2025/11/26. DOI: 10.48550/arXiv.2511.21366

Hybrid Control for Robotic Nut Tightening Task

翻译

Dmitri Kovalenko

Abstract:

An autonomous robotic nut tightening system for a serial manipulator equipped with a parallel gripper is proposed. The system features a hierarchical motion-primitive-based planner and a control-switching scheme that alternates … >>>

翻译

14.

刘昊辰 (2025-11-01 14:44):

#paper Generating Creative Chess Puzzles. Google DeepMind 于 2025 年 10 月提出一种生成创意国际象棋谜题的方法，先通过基准测试多种生成式 AI 架构（如自回归 Transformer、潜在扩散模型等），再引入基于国际象棋引擎搜索统计数据的强化学习（RL）框架，设计奖励函数提升谜题的独特性、反直觉性、多样性和真实性；该 RL 方法使反直觉谜题生成率从监督学习的 0.22% 提升 10 倍至 2.5%，超过现有数据集（2.1%）和最佳 Lichess 训练模型（0.4%），生成的谜题在新颖性和多样性上达标且保留美学主题，经人类专家评估，其创意性、趣味性和反直觉性优于书籍谜题，最终形成的精选谜题手册获三位世界知名专家认可。下载地址：https://arxiv.org/pdf/2510.23881

arXiv, 2025-10-27T21:43:39Z. DOI: 10.48550/arXiv.2510.23881

Generating Creative Chess Puzzles

翻译

Abstract:

While Generative AI rapidly advances in various domains, generating truly creative, aesthetic, and counter-intuitive outputs remains a challenge. This paper presents an approach to tackle these difficulties in the domain … >>>

翻译

15.

林海onrush (2025-10-31 23:18):

#paper, PALQO: Physics-informed Model for Accelerating Large-scale Quantum Optimization，DOI:10.48550/arXiv.2509.20733。这篇论文提出了 PALQO，一种基于物理约束神经网络（PINN）的新方法用于加速大规模变分量子算法（VQAs）的训练。作者将 VQA 的参数更新过程重新表述为非线性偏微分方程（PDE）问题，并利用 PINN 在经典计算机上学习优化动力学，仅需少量量子测量数据即可预测后续参数更新，从而显著减少量子设备调用。理论分析表明，PALQO 具有良好的泛化性能，其所需训练样本数量随参数规模多项式增长。在横场 Ising 模型、Heisenberg 模型及分子体系（如 LiH、BeH₂）等任务上的实验显示，PALQO 能在保持能量精度（误差约 (10^{-3})）的同时，将量子测量开销降低约90%，实现最高30倍加速。该方法在多体系统和量子化学计算中表现出良好的可扩展性，为在受限量子资源条件下推进大规模量子优化提供了新的思路。

arXiv, 2025-09-25T04:26:02Z. DOI: 10.48550/arXiv.2509.20733

PALQO: Physics-informed Model for Accelerating Large-scale Quantum Optimization

翻译

Yiming Huang, Yajie Hao, Jing Zhou, Xiao Yuan, Xiaoting Wang, Yuxuan Du

Abstract:

Variational quantum algorithms (VQAs) are leading strategies to reachpractical utilities of near-term quantum devices. However, the no-cloningtheorem in quantum mechanics precludes standard backpropagation, leading toprohibitive quantum resource costs when applying … >>>

翻译

16.

符毓 (2025-10-31 22:50):

#paper doi: 10.48550/arXiv.2510.10903, 2025, Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey 一篇全面涵盖机器人操作领域的全景视角综述。超 1000 篇参考系统地梳理了机器人操作领域的全景图谱，涵盖硬件与控制基础、任务与数据体系、高低层控制框架，以及跨本体与跨模态的泛化研究，并提出了一个统一的理解框架，揭示机器人如何从“执行任务”走向“理解与学习任务”。

arXiv, 2025-10-13T01:59:27Z. DOI: 10.48550/arXiv.2510.10903

Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey

翻译

Abstract:

Embodied intelligence has witnessed remarkable progress in recent years,driven by advances in computer vision, natural language processing, and therise of large-scale multimodal models. Among its core challenges, robotmanipulation stands out … >>>

翻译

17.

Vincent (2025-10-31 16:28):

#paper https://doi.org/10.48550/arXiv.2510.14901 Arxiv. 2025. Reasoning with Sampling: Your Base Model is Smarter Than You Think. 大语言模型(LLM)+ 强化学习(RL)在众多领域展现出了强大的推理能力，以往研究多集中于探讨强化学习如何赋予基础模型其原本不具备的能力。这篇文章另辟蹊径，提出一个发人深省的问题：是否仅通过采样，而非额外训练，就能让基础模型展现出与强化学习策略相当的推理能力？这篇文章基于模型自身的似然值，提出了一种简单的基于马尔可夫蒙特卡罗（MCMC）的迭代采样方法。实验结果显示，该方法在多种基础模型上均取得了与强化学习算法相当甚至更优的表现。更为重要的是，这一方法避免了强化学习中常见的多样性缺失问题，且无需额外数据或者训练，展现出其在不同领域中的广泛应用潜力

arXiv, 2025-10-16T17:18:11Z. DOI: 10.48550/arXiv.2510.14901

Reasoning with Sampling: Your Base Model is Smarter Than You Think

翻译

Aayush Karan, Yilun Du

Abstract:

Frontier reasoning models have exhibited incredible capabilities across awide array of disciplines, driven by posttraining large language models (LLMs)with reinforcement learning (RL). However, despite the widespread success ofthis paradigm, much … >>>

翻译

18.

刘昊辰 (2025-10-27 14:21):

#paper Strongly Solving 2048 4×3. 本文由日本东京大学研究者提出，成功强解了 2048 游戏的 4×3 变体（2048₄ₓ₃），核心关键技术是基于 ”年龄（age）”（定义为棋盘上所有方块数字之和）对状态空间进行划分 —— 状态与后续动作后的过渡态（afterstate）年龄保持不变，过渡态到新状态时年龄因新增方块（2 或 4）增加 2 或 4，据此可分阶段枚举状态并控制内存占用；同时采用Elias-Fano 编码实现状态的紧凑存储，将约 4.4TiB 的原始存储需求压缩至 1.4TiB（最优玩法专用存储仅需 300GiB）。研究结果显示，最常见初始状态（两个 2 方块，年龄 4）的最优策略期望得分为50724.26，可到达状态数与过渡态数分别为1.15×10¹²和7.40×10¹¹，且验证了 “生成大数字方块（如 2048）时难度显著提升” 等玩家直觉。下载地址：https://arxiv.org/pdf/2510.04580

arXiv, 2025-10-06T08:31:59Z. DOI: 10.48550/arXiv.2510.04580

Strongly Solving 2048 4x3

翻译

Tomoyuki Kaneko, Shuhei Yamashita

Abstract:

2048 is a stochastic single-player game involving 16 cells on a 4 by 4 grid,where a player chooses a direction among up, down, left, and right to obtain ascore by … >>>

翻译

19.

符毓 (2025-09-30 23:42):

#paper doi: 10.48550/arXiv.2509.13311, 2025, Towards General Agentic Intelligence via Environment Scaling. 以往训练这类“代理智能”的主要瓶颈在于缺乏高质量、大规模、多样化的交互数据。人工标注成本极高，而单纯用模型生成的数据又往往不够真实或难以验证。这篇由阿里巴巴通义实验室团队发表的论文（通过环境扩展迈向通用代理智能）提出了一条全新的路径：通过程序化、自动化地构建海量、异构、可验证的模拟环境，让语言模型能在其中自主交互、收集经验、学习成长。基于该方法训练的AgentScaler模型系列，仅用数十亿参数就在多项权威测试中达到了与万亿级模型或闭源商业系统媲美的性能，为高效、轻量级代理智能的发展打开了新的可能性。

arXiv, 2025-09-16T17:57:20Z. DOI: 10.48550/arXiv.2509.13311

Towards General Agentic Intelligence via Environment Scaling

翻译

Abstract:

Advanced agentic intelligence is a prerequisite for deploying Large LanguageModels in practical, real-world applications. Diverse real-world APIs demandprecise, robust function-calling intelligence, which needs agents to developthese capabilities through interaction in … >>>

翻译

20.

尹志 (2025-09-30 22:39):

#paper Quantum computing and artificial intelligence: status and perspectives. doi: 10.48550/arXiv.2505.23860 比较新的一篇QAI的综述。比较细致的介绍了Quantum for AI及AI for Quantum，还有基础问题。最后介绍了一些目前这个领域所遇到的挑战。有两个特点值得一提，一个就是确实很新，目前基本的QAI的问题都有涉及；第二个就是这是一个全欧洲阵容的研究人员写的QAI综述，文章的开头就明确了自己的位置，这点还是很耐人寻味的。

arXiv, 2025-05-29T08:15:23Z. DOI: 10.48550/arXiv.2505.23860

Quantum computing and artificial intelligence: status and perspectives

翻译

Giovanni Acampora, Andris Ambainis, Natalia Ares, Leonardo Banchi, Pallavi Bhardwaj, Daniele Binosi, G. Andrew D. Briggs, Tommaso Calarco, Vedran Dunjko, Jens Eisert, ... >>>

Abstract:

This white paper discusses and explores the various points of intersectionbetween quantum computing and artificial intelligence (AI). It describes howquantum computing could support the development of innovative AI solutions. Italso … >>>

翻译