Lexical Semantic Knowledge and Information Extraction

Ali, Muhammad Asif

doi:10.26190/unsworks/2283

Lexical Semantic Knowledge and Information Extraction

Download files

Access & Terms of Use

open access
Embargoed until 2023-06-07
Copyright: Ali, Muhammad Asif

CC BY-NC-ND 3.0

Abstract

In the recent decade there has been a sharp increase in the utilization of machine/deep learning models for the development of Natural Language Processing (NLP) applications, especially focused on language understanding, with end-goals targeted at, but not limited to: information retrieval, machine translation, sentiment analysis, question answering, etc. These applications call for the need of better models for in-depth understanding of the language structures which in turn help development of automated routines that may acquire vast variety of unstructured data from web resources, process the data and convert it to the desired information content. In the recent past many different models have been developed for language-specific information extraction and a better understanding of semantic aspects of the language, however, yet there are some challenges associated that need to be addressed for the improved utility of these models in the down-streaming tasks. In this thesis, we propose new models with the aims to improve upon the existing methods for the lexico-semantic relation and information extraction tasks. For lexico-semantic relation extraction, we work around distinguishing among different lexico-semantic relations captured from unstructured data, i.e., distinguishing antonyms from synonyms and hypernymy detection. For information extraction, we work with improving the Fine-Grained Named Entity Typing (FG-NET), which is a key component for different down-streaming information and relation extraction tasks. Given the fact that the fine-grained type hierarchy follows a hierarchical structure, in Chapter 5, we extend the concepts of FGET-RR to Fine-Grained Named Entity Typing with Refinement in Hyperbolic space (FGNET-RH) that combine the benefits of the non-euclidean geometry (hyperbolic space) along with the graph structures to perform FG-NET in performance-enhanced fashion. Finally, in order to evaluate the reliability of the machine/deep learning systems for information extraction in real-life scenarios, we evaluate the performance of these systems under uncustomized settings.

Persistent link to this record

http://hdl.handle.net/1959.4/70870

DOI

https://doi.org/10.26190/unsworks/2283

Author(s)

Ali, Muhammad Asif

Supervisor(s)

wang, wei

Publication Year

2021

Resource Type

Thesis

Degree Type

PhD Doctorate

UNSW Faculty

Files

public version.pdf

2.2 MB

Adobe Portable Document Format

View full record Show statistics

Library

Lexical Semantic Knowledge and Information Extraction

Access & Terms of Use

Altmetric

Abstract

Persistent link to this record

DOI

Link to Publisher Version

Link to Open Access Version

Additional Link

Author(s)

Supervisor(s)

Creator(s)

Editor(s)

Translator(s)

Curator(s)

Designer(s)

Arranger(s)

Composer(s)

Recordist(s)

Conference Proceedings Editor(s)

Other Contributor(s)

Corporate/Industry Contributor(s)

Publication Year

Resource Type

Degree Type

UNSW Faculty

Files

Related dataset(s)