Efficient Learning and Optimization for Robotic Manipulator Motion Generation

Pavlichenko, Dmytro

dc.contributor.advisor	Behnke, Sven
dc.contributor.author	Pavlichenko, Dmytro
dc.date.accessioned	2025-09-29T13:13:59Z
dc.date.available	2025-09-29T13:13:59Z
dc.date.issued	29.09.2025
dc.identifier.uri	https://hdl.handle.net/20.500.11811/13464
dc.description.abstract	A central goal of robotics research is to develop autonomous systems capable of achieving and ultimately surpassing human-level task performance in unstructured environments. Robotic manipulation plays a critical role in this aspiration. In this thesis, we address the problem of robotic manipulator motion generation with data-driven and optimization-based approaches. The high-dimensional state spaces with complex underlying dynamics and real-time operational constraints pose major challenges. Our methods address these problems and provide solutions to a sequence of interconnected tasks. These include planning and tracking manipulator trajectories followed by dexterous object manipulation. First, a feed-forward open-loop reference correction policy improves the joint trajectory tracking accuracy. The policy is learned offline in a supervised manner on a small real-world dataset. We propose to incorporate a hardwired one-step future prediction into the model to facilitate planning behavior. Next, we introduce a methodology for learning a closed-loop policy with deep reinforcement learning directly on the real robot. Our policy leverages the advantages of online feedback to significantly improve trajectory tracking accuracy. Second, we address dual-arm trajectory optimization with multiple constraints. We propose an obstacle cost function based on the estimation of the worst-case overlap volume. Additionally, we handle the closed kinematic chain constraint by subdividing the system into active and passive sub-chains, with an implicit redundancy resolution for the passive sub-chain. These components significantly decrease the method's runtime when optimizing high-dimensional dual-arm trajectories. Third, we propose a method for learning dexterous pre-grasp manipulation for functional grasping using a human-like hand. The policy is trained with deep reinforcement learning. Our dense multi-component reward function and curriculum avoid the need for expert demonstrations and other costly data collection processes. We propose two target grasp representations and analyze their effects on the behavior of the policy. The policy quickly learns to dexterously manipulate novel object instances of known categories and achieve provided functional grasps that enable object use, such as operating a drill. We showcase the effectiveness of our methods in simulation and real-world experiments. Our approaches significantly improve trajectory tracking accuracy, quickly generate high-dimensional trajectories that satisfy multiple constraints, and dexterously manipulate complex objects using a human-like hand.	en
dc.language.iso	eng
dc.rights	In Copyright
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/
dc.subject	Robotic Manipulators
dc.subject	Trajectory Tracking
dc.subject	Trajectory Optimization
dc.subject	Dexterous Manipulation
dc.subject	Deep Reinforcement Learning
dc.subject	Real-World Robot Learning
dc.subject	Functional Grasping
dc.subject.ddc	004 Informatik
dc.title	Efficient Learning and Optimization for Robotic Manipulator Motion Generation
dc.type	Dissertation oder Habilitation
dc.identifier.doi	https://doi.org/10.48565/bonndoc-656
dc.publisher.name	Universitäts- und Landesbibliothek Bonn
dc.publisher.location	Bonn
dc.rights.accessRights	openAccess
dc.identifier.urn	https://nbn-resolving.org/urn:nbn:de:hbz:5-84353
dc.relation.doi	https://doi.org/10.1109/HUMANOIDS.2018.8624922
dc.relation.doi	https://doi.org/10.1109/IRC52146.2021.00008
dc.relation.doi	https://doi.org/10.1142/S1793351X22430036
dc.relation.doi	https://doi.org/10.1109/ICRA46639.2022.9812023
dc.relation.doi	https://doi.org/10.1109/CASE56687.2023.10260385
dc.relation.doi	https://doi.org/10.1109/TASE.2025.3541768
dc.relation.doi	https://doi.org/10.1109/HUMANOIDS43949.2019.9035030
ulbbn.pubtype	Erstveröffentlichung
ulbbnediss.affiliation.name	Rheinische Friedrich-Wilhelms-Universität Bonn
ulbbnediss.affiliation.location	Bonn
ulbbnediss.thesis.level	Dissertation
ulbbnediss.dissID	8435
ulbbnediss.date.accepted	07.07.2025
ulbbnediss.institute	Mathematisch-Naturwissenschaftliche Fakultät : Fachgruppe Informatik / Institut für Informatik
ulbbnediss.fakultaet	Mathematisch-Naturwissenschaftliche Fakultät
dc.contributor.coReferee	Neumann, Gerhard
ulbbnediss.contributor.orcid	https://orcid.org/0000-0003-0904-9524

Files in this item

Name:: 8435.pdf
Size:: 51.9MB
Format:: PDF

View/Open

This item appears in the following Collection(s)

E-Dissertationen (4466)

Show simple item record

The following license files are associated with this item: