Efficient Learning and Optimization for Robotic Manipulator Motion Generation

Pavlichenko, Dmytro

Volltext

View/Open (51.9MB)

Author

Pavlichenko, Dmytro

ORCID

https://orcid.org/0000-0003-0904-9524

Type of Scholarly Publication

Dissertation

Date of Exam

07.07.2025

Date of Publication

29.09.2025

Advisor

Behnke, Sven

Co-Referee

Neumann, Gerhard

Degree Granting Institutions

Rheinische Friedrich-Wilhelms-Universität Bonn

Metadata

Show full item record

Citable Links

Handle: https://hdl.handle.net/20.500.11811/13464
URN: https://nbn-resolving.org/urn:nbn:de:hbz:5-84353
DOI: https://doi.org/10.48565/bonndoc-656

Abstract

A central goal of robotics research is to develop autonomous systems capable of achieving and ultimately surpassing human-level task performance in unstructured environments. Robotic manipulation plays a critical role in this aspiration. In this thesis, we address the problem of robotic manipulator motion generation with data-driven and optimization-based approaches. The high-dimensional state spaces with complex underlying dynamics and real-time operational constraints pose major challenges. Our methods address these problems and provide solutions to a sequence of interconnected tasks. These include planning and tracking manipulator trajectories followed by dexterous object manipulation.
First, a feed-forward open-loop reference correction policy improves the joint trajectory tracking accuracy. The policy is learned offline in a supervised manner on a small real-world dataset. We propose to incorporate a hardwired one-step future prediction into the model to facilitate planning behavior. Next, we introduce a methodology for learning a closed-loop policy with deep reinforcement learning directly on the real robot. Our policy leverages the advantages of online feedback to significantly improve trajectory tracking accuracy.
Second, we address dual-arm trajectory optimization with multiple constraints. We propose an obstacle cost function based on the estimation of the worst-case overlap volume. Additionally, we handle the closed kinematic chain constraint by subdividing the system into active and passive sub-chains, with an implicit redundancy resolution for the passive sub-chain. These components significantly decrease the method's runtime when optimizing high-dimensional dual-arm trajectories.
Third, we propose a method for learning dexterous pre-grasp manipulation for functional grasping using a human-like hand. The policy is trained with deep reinforcement learning. Our dense multi-component reward function and curriculum avoid the need for expert demonstrations and other costly data collection processes. We propose two target grasp representations and analyze their effects on the behavior of the policy. The policy quickly learns to dexterously manipulate novel object instances of known categories and achieve provided functional grasps that enable object use, such as operating a drill.
We showcase the effectiveness of our methods in simulation and real-world experiments. Our approaches significantly improve trajectory tracking accuracy, quickly generate high-dimensional trajectories that satisfy multiple constraints, and dexterously manipulate complex objects using a human-like hand.

Subjects

Robotic Manipulators, Trajectory Tracking, Trajectory Optimization, Dexterous Manipulation, Deep Reinforcement Learning, Real-World Robot Learning, Functional Grasping

Classification (DDC)

004 Informatik

Related Publications

https://doi.org/10.1109/HUMANOIDS.2018.8624922
https://doi.org/10.1109/IRC52146.2021.00008
https://doi.org/10.1142/S1793351X22430036
https://doi.org/10.1109/ICRA46639.2022.9812023
https://doi.org/10.1109/CASE56687.2023.10260385
https://doi.org/10.1109/TASE.2025.3541768
https://doi.org/10.1109/HUMANOIDS43949.2019.9035030

Zitiervorschlag
BibTeX

Pavlichenko, Dmytro: Efficient Learning and Optimization for Robotic Manipulator Motion Generation. - Bonn, 2025. - Dissertation, Rheinische Friedrich-Wilhelms-Universität Bonn.
Online-Ausgabe in bonndoc: https://nbn-resolving.org/urn:nbn:de:hbz:5-84353

@phdthesis{handle:20.500.11811/13464,
urn: https://nbn-resolving.org/urn:nbn:de:hbz:5-84353,
doi: https://doi.org/10.48565/bonndoc-656,
author = {{Dmytro Pavlichenko}},
title = {Efficient Learning and Optimization for Robotic Manipulator Motion Generation},
school = {Rheinische Friedrich-Wilhelms-Universität Bonn},
year = 2025,
month = sep,
note = {A central goal of robotics research is to develop autonomous systems capable of achieving and ultimately surpassing human-level task performance in unstructured environments. Robotic manipulation plays a critical role in this aspiration. In this thesis, we address the problem of robotic manipulator motion generation with data-driven and optimization-based approaches. The high-dimensional state spaces with complex underlying dynamics and real-time operational constraints pose major challenges. Our methods address these problems and provide solutions to a sequence of interconnected tasks. These include planning and tracking manipulator trajectories followed by dexterous object manipulation.
First, a feed-forward open-loop reference correction policy improves the joint trajectory tracking accuracy. The policy is learned offline in a supervised manner on a small real-world dataset. We propose to incorporate a hardwired one-step future prediction into the model to facilitate planning behavior. Next, we introduce a methodology for learning a closed-loop policy with deep reinforcement learning directly on the real robot. Our policy leverages the advantages of online feedback to significantly improve trajectory tracking accuracy.
Second, we address dual-arm trajectory optimization with multiple constraints. We propose an obstacle cost function based on the estimation of the worst-case overlap volume. Additionally, we handle the closed kinematic chain constraint by subdividing the system into active and passive sub-chains, with an implicit redundancy resolution for the passive sub-chain. These components significantly decrease the method's runtime when optimizing high-dimensional dual-arm trajectories.
Third, we propose a method for learning dexterous pre-grasp manipulation for functional grasping using a human-like hand. The policy is trained with deep reinforcement learning. Our dense multi-component reward function and curriculum avoid the need for expert demonstrations and other costly data collection processes. We propose two target grasp representations and analyze their effects on the behavior of the policy. The policy quickly learns to dexterously manipulate novel object instances of known categories and achieve provided functional grasps that enable object use, such as operating a drill.
We showcase the effectiveness of our methods in simulation and real-world experiments. Our approaches significantly improve trajectory tracking accuracy, quickly generate high-dimensional trajectories that satisfy multiple constraints, and dexterously manipulate complex objects using a human-like hand.},
url = {https://hdl.handle.net/20.500.11811/13464}
}

The following license files are associated with this item: