TR2025-169
Sim-to-Real Contact-Rich Pivoting via Optimization-Guided RL with Vision and Touch
-
- , "Sim-to-Real Contact-Rich Pivoting via Optimization-Guided RL with Vision and Touch", Embodied World Models for Decision Making, NeurIPS Workshop, December 2025.BibTeX TR2025-169 PDF
- @inproceedings{Shirai2025dec,
- author = {Shirai, Yuki and Ota, Kei and Jha, Devesh K. and Romeres, Diego},
- title = {{Sim-to-Real Contact-Rich Pivoting via Optimization-Guided RL with Vision and Touch}},
- booktitle = {NeurIPS 2025 Workshop on Embodied World Models for Decision Making},
- year = 2025,
- month = dec,
- url = {https://www.merl.com/publications/TR2025-169}
- }
- , "Sim-to-Real Contact-Rich Pivoting via Optimization-Guided RL with Vision and Touch", Embodied World Models for Decision Making, NeurIPS Workshop, December 2025.
-
MERL Contacts:
-
Research Area:
Abstract:
Non-prehensile manipulation is challenging due to complex contact interactions between objects, the environment, and robots. Model-based approaches can effi- ciently generate complex trajectories of robots and objects under contact constraints. However, they tend to be sensitive to model inaccuracies and require access to privileged information (e.g., object mass, size, pose), making them less suitable for novel objects. In contrast, learning-based approaches are typically more robust to modeling errors but require large amounts of data. In this paper, we bridge these two approaches to propose a framework for learning closed-loop pivoting manipulation. By leveraging computationally efficient Contact-Implicit Trajec- tory Optimization (CITO), we design demonstration-guided deep Reinforcement Learning (RL), leading to sample-efficient learning. We also present a sim-to-real transfer approach using a privileged training strategy, enabling the robot to perform pivoting manipulation using only proprioception, vision, and force sensing without access to privileged information. Our method is evaluated on several pivoting tasks, demonstrating that it can successfully perform sim-to-real transfer.

