Generic 3D object recognition using multi-view range data

Farid, Reza

doi:10.26190/unsworks/17041

Publication:

Generic 3D object recognition using multi-view range data

dc.contributor.advisor	Sammut, Claude	en_US
dc.contributor.author	Farid, Reza	en_US
dc.date.accessioned	2022-03-21T14:40:53Z
dc.date.available	2022-03-21T14:40:53Z
dc.date.issued	2014	en_US
dc.description.abstract	This thesis addresses the problem of learning object classification using multi-view range data. Class membership is determined by shared characteristics, which can be visual, structural or functional. The major steps in object recognition and object classification are: segmentation, feature extraction, object representation and learning. This research introduces segmentation methods to decompose a scene into shape primitives. The first segmentation method is a new approach for producing high-quality planar segments, while the second method employs a commonly used, standard library for creating planar, cylindrical and spherical regions. A set of higher-level, relational features is extracted from the segmented regions. Thus, features are presented in three different levels: single region features, pair-region relationships and features of all regions forming an object instance. The extracted features are represented as predicates in Horn clause logic. Positive and negative examples are produced for learning by the labelling and training facilities developed in this thesis. Inductive Logic Programming (ILP) is used to learn relational concepts from instances taken by a depth camera. As a result, a human-readable representation for each object class is created. The methods developed in this research have been evaluated in experiments on data captured from a real robot designed for urban search and rescue, as well as on standard datasets. RoboCup Rescue competition arenas and other natural indoor scenes were the source of much of the data. There are also several published standard sets of range data that allow comparison with other 3D object classification methods. The results show that ILP is successful in recognising objects encountered by a robot and are competitive with the other state-of-the-art methods. The main contribution of this thesis is in developing an object classification system that integrates data gathering, segmentation, relational feature representation, and relational learning that is capable of performing well in complex unstructured scenes.	en_US
dc.identifier.uri	http://hdl.handle.net/1959.4/53848
dc.language	English
dc.language.iso	EN	en_US
dc.publisher	UNSW, Sydney	en_US
dc.rights	CC BY-NC-ND 3.0	en_US
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/3.0/au/	en_US
dc.subject.other	Machine learning	en_US
dc.subject.other	Object classification	en_US
dc.subject.other	Inductive logic programming	en_US
dc.subject.other	Range image	en_US
dc.subject.other	3D point cloud	en_US
dc.subject.other	Urban search and rescue	en_US
dc.subject.other	ALEPH	en_US
dc.title	Generic 3D object recognition using multi-view range data	en_US
dc.type	Thesis	en_US
dcterms.accessRights	open access
dcterms.rightsHolder	Farid, Reza
dspace.entity.type	Publication	en_US
unsw.accessRights.uri	https://purl.org/coar/access_right/c_abf2
unsw.identifier.doi	https://doi.org/10.26190/unsworks/17041
unsw.relation.faculty	Engineering
unsw.relation.originalPublicationAffiliation	Farid, Reza, Computer Science & Engineering, Faculty of Engineering, UNSW	en_US
unsw.relation.originalPublicationAffiliation	Sammut, Claude, Computer Science & Engineering, Faculty of Engineering, UNSW	en_US
unsw.relation.school	School of Computer Science and Engineering	*
unsw.thesis.degreetype	PhD Doctorate	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: public version.pdf
Size:: 3.67 MB
Format:: application/pdf
Description:

Download

Resource type

Thesis

Publication: Generic 3D object recognition using multi-view range data

Files

Original bundle

Resource type

Publication:

Generic 3D object recognition using multi-view range data