TR2025-053

FDPP: Fine-tune Diffusion Policy with Human Preference

- Chen, Y., Jha, D.K., Tomizuka, M., Romeres, D., "FDPP: Fine-tune Diffusion Policy with Human Preference", IEEE International Conference on Robotics and Automation (ICRA), May 2025.
  BibTeX TR2025-053 PDF Video
  - @inproceedings{Chen2025may,
  - author = {Chen, Yuxin and Jha, Devesh K. and Tomizuka, Masayoshi and Romeres, Diego},
  - title = {{FDPP: Fine-tune Diffusion Policy with Human Preference}},
  - booktitle = {IEEE International Conference on Robotics and Automation (ICRA)},
  - year = 2025,
  - month = may,
  - url = {https://www.merl.com/publications/TR2025-053}
  - }
MERL Contacts:
- Devesh K.
  Jha
- Diego
  Romeres
Research Areas:

Machine Learning, Optimization

Abstract:

Imitation learning from human demonstrations enables robots to perform complex manipulation tasks and has recently witnessed huge success. However, these techniques often struggle to adapt behavior to new preferences or changes in the environment. To address these limitations, we propose Fine-tuning Diffusion Policy with Human Preference (FDPP). FDPP learns a reward function through preference-based learning. This reward is then used to fine-tune the pre-trained policy with reinforcement learning (RL), resulting in alignment of pre-trained policy with new human preferences while still solving the original task. Our experiments across various robotic tasks and preferences demonstrate that FDPP effectively customizes policy behavior without compromising performance. Additionally, we show that incorporating Kullback–Leibler (KL) regularization during fine-tuning prevents over-fitting and helps maintain the competencies of the initial policy.

Related News & Events

NEWS Diego Romeres Delivers Invited Talks at Fraunhofer Italia and the University of Padua
Date: July 16, 2025 - July 18, 2025
MERL Contact: Diego Romeres
Research Areas: Artificial Intelligence, Control, Machine Learning, Optimization, Robotics, Human-Computer Interaction
Brief
- MERL researcher Diego Romeres was invited to present MERL's latest research at two institutions in Italy this July, focusing on human-robot collaboration and LLM-driven assembly systems.
  
  On July 16th, Dr. Romeres delivered a talk titled “Human-Robot Collaborative Assembly” at Fraunhofer Italia – Innovation Engineering Center (EIC) in Bolzano. His presentation showcased research on human-robot collaboration for efficient and flexible assembly processes. Fraunhofer Italia EIC is a non-profit research institute focused on enabling digital and sustainable transformation through applied innovation in close collaboration with both public and private sectors.
  
  Two days later, on July 18th, Dr. Romeres was hosted by the University of Padua, one of Europe’s oldest and most renowned universities. His invited lecture, “Robot Assembly through Human Collaboration & Large Language Models”, explored how artificial intelligence can enhance human-robot synergy in complex assembly tasks.
NEWS MERL contributes to ICRA 2025
Date: May 19, 2025 - May 23, 2025
Where: IEEE ICRA
MERL Contacts: Stefano Di Cairano; Jianlin Guo; Chiori Hori; Siddarth Jain; Devesh K. Jha; Toshiaki Koike-Akino; Philip V. Orlik; Arvind Raghunathan; Diego Romeres; Yuki Shirai; Abraham P. Vinod; Yebin Wang
Research Areas: Artificial Intelligence, Computer Vision, Control, Dynamical Systems, Machine Learning, Optimization, Robotics, Human-Computer Interaction
Brief
- MERL made significant contributions to both the organization and the technical program of the International Conference on Robotics and Automation (ICRA) 2025, which was held in Atlanta, Georgia, USA, from May 19th to May 23rd.
  
  MERL was a Bronze sponsor of the conference, and MERL researchers chaired four sessions in the areas of Manipulation Planning, Human-Robot Collaboration, Diffusion Policy, and Learning for Robot Control.
  
  MERL researchers presented four papers in the main conference on the topics of contact-implicit trajectory optimization, proactive robotic assistance in human-robot collaboration, diffusion policy with human preferences, and dynamic and model learning of robotic manipulators. In addition, five more papers were presented in the workshops: “Structured Learning for Efficient, Reliable, and Transparent Robots,” “Safely Leveraging Vision-Language Foundation Models in Robotics: Challenges and Opportunities,” “Long-term Human Motion Prediction,” and “The Future of Intelligent Manufacturing: From Innovation to Implementation.”
  
  MERL researcher Diego Romeres delivered an invited talk titled “Dexterous Robotics: From Multimodal Sensing to Real-World Physical Interactions.”
  
  MERL also collaborated with the University of Padua on one of the conference’s challenges: the “3rd AI Olympics with RealAIGym” (https://ai-olympics.dfki-bremen.de).
  
  During the conference, MERL researchers received the IEEE Transactions on Automation Science and Engineering Best New Application Paper Award for their paper titled “Smart Actuation for End-Edge Industrial Control Systems.”
  
  About ICRA
  
  The IEEE International Conference on Robotics and Automation (ICRA) is the flagship conference of the IEEE Robotics and Automation Society and the world’s largest and most comprehensive technical conference focused on research advances and the latest technological developments in robotics. The event attracts over 7,000 participants, 143 partners and exhibitors, and receives more than 4,000 paper submissions.

Related Publication

Chen, Y., Jha, D.K., Tomizuka, M., Romeres, D., "FDPP: Fine-tune Diffusion Policy with Human Preference", arXiv, January 2025.

BibTeX arXiv

@article{Chen2025jan,
author = {Chen, Yuxin and Jha, Devesh K. and Tomizuka, Masayoshi and Romeres, Diego},
title = {{FDPP: Fine-tune Diffusion Policy with Human Preference}},
journal = {arXiv},
year = 2025,
month = jan,
url = {https://arxiv.org/abs/2501.08259}
}

MERL Contacts:

Devesh K.Jha

DiegoRomeres

Research Areas:

Abstract:

Devesh K.
Jha

Diego
Romeres