Pauly, Leo (2021) Seeing to learn: Observational learning of robotic manipulation tasks. PhD thesis, University of Leeds.
Abstract
Learning new tasks has always been a challenging problem in robotics. Even though several approaches have been proposed, from manual programming to learning from demonstrations, the field has directions which require further research and development. This thesis focuses on one of these relatively unexplored areas: observational learning.
We present O2A, a novel method for learning to perform robotic manipulation tasks from a single (one-shot) third-person demonstration video. The key novelty lies in pre-training a feature extractor for creating an abstract feature representation for actions that we call ‘action vectors’. The action vectors are extracted using a 3D-CNN network pre-trained for action recognition on a generic action dataset. The distance between the action vectors from the observed third-person demonstration and trial robot executions are used as a reward/cost for learning of the demonstrated task.
We report on experiments in simulation and on a real robot, with changes in viewpoint of observation, properties of the objects involved, scene background and morphology of the manipulator between the demonstration and the learning domains. O2A outperforms baseline approaches under different domain shifts and has comparable performance with an oracle (that uses an ideal reward function). We also plot visualisation of trajectories and show that our method has high reward for desired trajectories.
Finally, we present a framework for extending observational learning with multi modal observations. We report our initial experiments and results in the future works.
Metadata
Supervisors: | Fuentes, Raul and Hogg, David |
---|---|
Related URLs: | |
Awarding institution: | University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering (Leeds) > School of Civil Engineering (Leeds) The University of Leeds > Faculty of Engineering (Leeds) > School of Computing (Leeds) |
Identification Number/EthosID: | uk.bl.ethos.834050 |
Depositing User: | Mr. LEO PAULY |
Date Deposited: | 16 Jul 2021 12:55 |
Last Modified: | 11 Aug 2021 09:53 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:29169 |
Download
Final eThesis - complete (pdf)
Filename: Leo_Pauly_Engineering_PhD_2021.pdf
Licence:
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 2.5 License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.