Publication:
Apprenticeship Bootstrapping: Multi-Skill Reinforcement Learning for Autonomous Unmanned Aerial Vehicles

dc.contributor.advisor Abbass, Hussein en_US
dc.contributor.advisor Garratt, Mathew en_US
dc.contributor.advisor Bui, Lam en_US
dc.contributor.author Nguyen, Hung en_US
dc.date.accessioned 2022-03-22T18:19:16Z
dc.date.available 2022-03-22T18:19:16Z
dc.date.issued 2018 en_US
dc.description.abstract Apprenticeship Learning (AL) uses data collected from humans on tasks to design machine-learning algorithms to imitate the skills used by humans. Such a powerful approach to developing autonomous machines comes with challenges stemming from its reliance on the existence of expert humans who can perform the task and their willingness to be available. In this thesis, Apprenticeship Bootstrapping (ABS) is proposed as a new learning algorithm that relies on humans who are experts on less-complex tasks to aggregate the skills required for more complex ones. ABS has been validated using a ground and aerial coordination task (GACT), where an unmanned aerial vehicle (UAV) needs to autonomously follow a group of unmanned ground vehicles (UGVs) while maintaining the group within the field of view of the UAV's camera. ABS decomposes the complex control task for the UAV operator into simpler sub-tasks which an operator is available to perform and to collect demonstrations from. Two proposed learning approaches, apprenticeship bootstrapping via deep learning and inverse reinforcement learning (ABS-DL and ABS-IRL, respectively), are used to address this challenge. In ABS-DL, deep learning, which has shown dramatic advances in terms of effectively work on raw data, is attractive for learning directly from demonstrations of sub-tasks. However, it may not be effective for acquiring a desirable policy, as it relies heavily on the quality of demonstrations. The quality of demonstrations may not be good because of errors in collecting expert demonstrations caused by the limitations of the techniques, or poor demonstrations caused by the expert. ABS-IRL addresses these issues by recovering the expert reward function from demonstrations on the sub-tasks. In ABS-IRL, during every learning episode, the learner acts learned policy and re-optimizes its learned model while performing the task. This learning approach allows the leaner to avoid learning incorrect or poor demonstrations. Both ABS-DL and ABS-IRL were tested on GACT and results confirmed that ABS-IRL could bootstrap the more complex skill from lower less-complex ones. en_US
dc.identifier.uri http://hdl.handle.net/1959.4/60412
dc.language English
dc.language.iso EN en_US
dc.publisher UNSW, Sydney en_US
dc.rights CC BY-NC-ND 3.0 en_US
dc.rights.uri https://creativecommons.org/licenses/by-nc-nd/3.0/au/ en_US
dc.subject.other Apprenticeship Learning en_US
dc.subject.other Reinforcement learning en_US
dc.subject.other Inverse Reinforcement Learning en_US
dc.subject.other Apprenticeship Boostrapping en_US
dc.subject.other UAV and UGVs en_US
dc.title Apprenticeship Bootstrapping: Multi-Skill Reinforcement Learning for Autonomous Unmanned Aerial Vehicles en_US
dc.type Thesis en_US
dcterms.accessRights open access
dcterms.rightsHolder Nguyen, Hung
dspace.entity.type Publication en_US
unsw.accessRights.uri https://purl.org/coar/access_right/c_abf2
unsw.identifier.doi https://doi.org/10.26190/unsworks/20718
unsw.relation.faculty UNSW Canberra
unsw.relation.originalPublicationAffiliation Nguyen, Hung, Engineering & Information Technology, UNSW Canberra, UNSW en_US
unsw.relation.originalPublicationAffiliation Abbass, Hussein, Engineering & Information Technology, UNSW Canberra, UNSW en_US
unsw.relation.originalPublicationAffiliation Garratt, Mathew, Engineering & Information Technology, UNSW Canberra, UNSW en_US
unsw.relation.originalPublicationAffiliation Bui, Lam, Le Quy Don - Viet nam en_US
unsw.relation.school School of Engineering and Information Technology *
unsw.thesis.degreetype Masters Thesis en_US
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
public version.pdf
Size:
2.95 MB
Format:
application/pdf
Description:
Resource type