当前共找到 1 篇文献分享。
1.
刘昊辰 (2026-03-02 09:15):
#paper Resource-Efficient Model-Free Reinforcement Learning for Board Games. 本文介绍了一种名为 KLENT (Kullback-Leibler and Entropy Regularized Policy Optimization) 的新型无模型(Model-Free)强化学习算法,旨在解决传统基于搜索的棋类游戏AI(如AlphaZero)计算资源消耗巨大的问题。KLENT 展示了通过合理组合现有的RL技术(KL正则、熵正则、λ-returns),可以在不牺牲性能的前提下,大幅降低棋类AI的训练门槛。下载地址:https://arxiv.org/pdf/2602.10894
arXiv, 2026-02-11T14:25:38Z. DOI: 10.48550/arXiv.2602.10894
Abstract:
Board games have long served as complex decision-making benchmarks in artificial intelligence. In this field, search-based reinforcement learning methods such as AlphaZero have achieved remarkable success. However, their significant computational … >>>
Board games have long served as complex decision-making benchmarks in artificial intelligence. In this field, search-based reinforcement learning methods such as AlphaZero have achieved remarkable success. However, their significant computational demands have been pointed out as barriers to their reproducibility. In this study, we propose a model-free reinforcement learning algorithm designed for board games to achieve more efficient learning. To validate the efficiency of the proposed method, we conducted comprehensive experiments on five board games: Animal Shogi, Gardner Chess, Go, Hex, and Othello. The results demonstrate that the proposed method achieves more efficient learning than existing methods across these environments. In addition, our extensive ablation study shows the importance of core techniques used in the proposed method. We believe that our efficient algorithm shows the potential of model-free reinforcement learning in domains traditionally dominated by search-based methods. <<<
翻译
回到顶部