This thesis is primarily focused on movement primitives-based imitation learn- ing, within the context of robot programming by demonstration. Specifically, the imitation problem is tackled from a supervised-learning perspective. Therefore, it allows us to resort to theoretical tools from structured prediction, which can handle data-sets with complex structures. The first part of the thesis provides an overall background, in which we overview state-of-the-art imitation learning algorithms as well as discuss relevant technical tools. We formally introduce our contribution in part II. Our algorithm is not only capable of learning usual Euclidean trajectories (Chapter 7), but also trajectories lying on some manifold (Chapter 8). The capability of adapting manifold trajectories distinguishes our approach from other imitation learning algorithms. Subsequently, we provide a few extensions to augment our approach, including trajectory refinement by policy search (Chapter 10), imitation learning with constraints (Chapter 11), and probabilistic trajectory transfer (Chapter 12). We then conclude the thesis in the epilogue.

A Structured Prediction Approach to Robot Imitation Learning

DUAN, ANQING
2021-07-15

Abstract

This thesis is primarily focused on movement primitives-based imitation learn- ing, within the context of robot programming by demonstration. Specifically, the imitation problem is tackled from a supervised-learning perspective. Therefore, it allows us to resort to theoretical tools from structured prediction, which can handle data-sets with complex structures. The first part of the thesis provides an overall background, in which we overview state-of-the-art imitation learning algorithms as well as discuss relevant technical tools. We formally introduce our contribution in part II. Our algorithm is not only capable of learning usual Euclidean trajectories (Chapter 7), but also trajectories lying on some manifold (Chapter 8). The capability of adapting manifold trajectories distinguishes our approach from other imitation learning algorithms. Subsequently, we provide a few extensions to augment our approach, including trajectory refinement by policy search (Chapter 10), imitation learning with constraints (Chapter 11), and probabilistic trajectory transfer (Chapter 12). We then conclude the thesis in the epilogue.
15-lug-2021
Robotics, learning by demonstration, structured prediction, trajectory generation, humanoid
File in questo prodotto:
File Dimensione Formato  
phdunige_4461403.pdf

Open Access dal 16/07/2022

Tipologia: Tesi di dottorato
Dimensione 8.07 MB
Formato Adobe PDF
8.07 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11567/1050082
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact