符毓 (2025-10-31 22:50):
#paper doi: 10.48550/arXiv.2510.10903, 2025, Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey 一篇全面涵盖机器人操作领域的全景视角综述。超 1000 篇参考系统地梳理了机器人操作领域的全景图谱,涵盖硬件与控制基础、任务与数据体系、高低层控制框架,以及跨本体与跨模态的泛化研究,并提出了一个统一的理解框架,揭示机器人如何从“执行任务”走向“理解与学习任务”。
arXiv, 2025-10-13T01:59:27Z. DOI: 10.48550/arXiv.2510.10903
Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey
翻译
Abstract:
Embodied intelligence has witnessed remarkable progress in recent years,driven by advances in computer vision, natural language processing, and therise of large-scale multimodal models. Among its core challenges, robotmanipulation stands out as a fundamental yet intricate problem, requiring theseamless integration of perception, planning, and control to enable interactionwithin diverse and unstructured environments. This survey presents acomprehensive overview of robotic manipulation, encompassing foundationalbackground, task-organized benchmarks and datasets, and a unified taxonomy ofexisting methods. We extend the classical division between high-level planningand low-level control by broadening high-level planning to include language,code, motion, affordance, and 3D representations, while introducing a newtaxonomy of low-level learning-based control grounded in training paradigmssuch as input modeling, latent learning, and policy learning. Furthermore, weprovide the first dedicated taxonomy of key bottlenecks, focusing on datacollection, utilization, and generalization, and conclude with an extensivereview of real-world applications. Compared with prior surveys, our work offersboth a broader scope and deeper insight, serving as an accessible roadmap fornewcomers and a structured reference for experienced researchers. All relatedresources, including research papers, open-source datasets, and projects, arecurated for the community athttps://github.com/BaiShuanghao/Awesome-Robotics-Manipulation.
翻译
回到顶部