This paper presents a comprehensive sim-to-real pipeline for autonomous strawberry picking from dense clusters using a Franka Panda robot. Our approach leverages a custom Mujoco simulation environment that integrates domain randomization techniques. In this environment, a deep reinforcement learning agent is trained using the dormant ratio minimization algorithm. The proposed pipeline bridges low-level control with high-level perception and decision making, demonstrating promising performance in both simulation and in a real laboratory environment, laying the groundwork for successful transfer to real-world autonomous fruit harvesting.
Funding
EPSRC Centre for Doctoral Training in Agri-Food Robotics: AgriFoRwArdS
Engineering and Physical Sciences Research Council
2025 IEEE 21st International Conference on Automation Science and Engineering
Event Dates
17 – 21 August 2025
Date Document First Uploaded
2025-06-04
Publisher statement
"Where required by their funder, authors may retain the right to distribute their accepted manuscript (AM) via an institutional and/or subject repository (e.g., Europe PubMed Central) under a Creative Commons Attribution 4.0 International (CC BY 4.0) license, to be made available for release no later than the date of first online publication."
Will your conference paper be published in proceedings?