Publication:
Generic 3D object recognition using multi-view range data

dc.contributor.advisor Sammut, Claude en_US
dc.contributor.author Farid, Reza en_US
dc.date.accessioned 2022-03-21T14:40:53Z
dc.date.available 2022-03-21T14:40:53Z
dc.date.issued 2014 en_US
dc.description.abstract This thesis addresses the problem of learning object classification using multi-view range data. Class membership is determined by shared characteristics, which can be visual, structural or functional. The major steps in object recognition and object classification are: segmentation, feature extraction, object representation and learning. This research introduces segmentation methods to decompose a scene into shape primitives. The first segmentation method is a new approach for producing high-quality planar segments, while the second method employs a commonly used, standard library for creating planar, cylindrical and spherical regions. A set of higher-level, relational features is extracted from the segmented regions. Thus, features are presented in three different levels: single region features, pair-region relationships and features of all regions forming an object instance. The extracted features are represented as predicates in Horn clause logic. Positive and negative examples are produced for learning by the labelling and training facilities developed in this thesis. Inductive Logic Programming (ILP) is used to learn relational concepts from instances taken by a depth camera. As a result, a human-readable representation for each object class is created. The methods developed in this research have been evaluated in experiments on data captured from a real robot designed for urban search and rescue, as well as on standard datasets. RoboCup Rescue competition arenas and other natural indoor scenes were the source of much of the data. There are also several published standard sets of range data that allow comparison with other 3D object classification methods. The results show that ILP is successful in recognising objects encountered by a robot and are competitive with the other state-of-the-art methods. The main contribution of this thesis is in developing an object classification system that integrates data gathering, segmentation, relational feature representation, and relational learning that is capable of performing well in complex unstructured scenes. en_US
dc.identifier.uri http://hdl.handle.net/1959.4/53848
dc.language English
dc.language.iso EN en_US
dc.publisher UNSW, Sydney en_US
dc.rights CC BY-NC-ND 3.0 en_US
dc.rights.uri https://creativecommons.org/licenses/by-nc-nd/3.0/au/ en_US
dc.subject.other Machine learning en_US
dc.subject.other Object classification en_US
dc.subject.other Inductive logic programming en_US
dc.subject.other Range image en_US
dc.subject.other 3D point cloud en_US
dc.subject.other Urban search and rescue en_US
dc.subject.other ALEPH en_US
dc.title Generic 3D object recognition using multi-view range data en_US
dc.type Thesis en_US
dcterms.accessRights open access
dcterms.rightsHolder Farid, Reza
dspace.entity.type Publication en_US
unsw.accessRights.uri https://purl.org/coar/access_right/c_abf2
unsw.identifier.doi https://doi.org/10.26190/unsworks/17041
unsw.relation.faculty Engineering
unsw.relation.originalPublicationAffiliation Farid, Reza, Computer Science & Engineering, Faculty of Engineering, UNSW en_US
unsw.relation.originalPublicationAffiliation Sammut, Claude, Computer Science & Engineering, Faculty of Engineering, UNSW en_US
unsw.relation.school School of Computer Science and Engineering *
unsw.thesis.degreetype PhD Doctorate en_US
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
public version.pdf
Size:
3.67 MB
Format:
application/pdf
Description:
Resource type