Publication:
Development of synthetic power distribution networks and datasets with industrial validation

dc.contributor.advisor Pota, Hemanshu
dc.contributor.advisor Hu, Jiankun
dc.contributor.author Ali, Muhammad
dc.date.accessioned 2022-08-16T04:26:56Z
dc.date.available 2022-08-16T04:26:56Z
dc.date.issued 2022
dc.date.submitted 2022-08-15T23:16:27Z
dc.description.abstract This thesis addresses a key challenge for creating synthetic distribution networks and open-source datasets by combining the public databases and data synthesis algorithms. Novel techniques for the creation of synthetic networks and open-source datasets that enable model validation and demonstration without the need for private data are developed. The developed algorithms are thoroughly benchmarked against existing approaches and validated on industry servers to highlight their usefulness in solving real-world problems. A review using novel techniques that provides unique insights into the literature is conducted to identify research gaps. Based on this review, three contributions have been made in this thesis. The first contribution is the development of a data protection framework for anonymizing sensitive network data. A novel approach is proposed based on the maximum likelihood estimate for estimating the parameters that represent the actual data. A data anonymization algorithm that uses the estimated parameters to generate realistic anonymized datasets is developed. A Kolmogorov-Smirnov test criteria is used to create realistic anonymized datasets. Validation is carried out by collecting actual network data from an energy company and comparing it to anonymized datasets created using the methods developed in this thesis. The application of this method is shown by performing simulation studies on the IEEE 123-node test feeder. The second contribution is developing a practical approach for creating synthetic networks and datasets by integrating the open-source data platforms and synthesis methods. New data synthesis algorithms are proposed to obtain the network datasets for electricity systems in a chosen geographical area. The proposed algorithms include a topology for designing power lines from road infrastructure, a method for computing the lengths of power lines, a hub-line algorithm for determining the number of consumers connected to a single transformer, a virtual layer approach based on FromNode and ToNode for establishing electrical connectivity, and a technique for ingesting raw data from the developed network to industrial data platforms. The practical feasibility of the proposed solutions is shown by creating a synthetic test network and datasets for a distribution feeder in the Colac region in Australia. The datasets are then validated by deploying them on industry servers. The results are compared with actual datasets using geo-based visualizations and by including feedback from industry experts familiar with the analysis. The third contribution of this thesis is to address the problem of electric load profile classification in the context of buildings. This classification is essential to effectively manage energy resources across power distribution networks. Two new methods based on sparse autoencoders (SAEs), and multi-stage transfer learning (MSTL) are proposed for load profile classification. Different from conventional hand-crafted feature representations, SAEs can learn useful features from vast amounts of building data in an unsupervised automatic way. The problems of missing data and class imbalance for building datasets are addressed by proposing a minority over-sampling algorithm that effectively balances missing or unbalanced data by equalizing minority and majority samples for fair comparisons. The practical feasibility of the methodology is shown using two case studies that include both public benchmark and real-world datasets of buildings. An empirical comparison is conducted between the proposed and the state-of-the-art methods in the literature. The results indicate that the proposed method is superior to traditional methods, with a performance improvement from 1 to 10 percent.
dc.identifier.uri http://hdl.handle.net/1959.4/100574
dc.language English
dc.language.iso en
dc.publisher UNSW, Sydney
dc.rights CC BY 4.0
dc.rights.uri https://creativecommons.org/licenses/by/4.0/
dc.subject.other Power distribution networks
dc.subject.other synthetic networks and datasets
dc.subject.other data analytics
dc.subject.other data anonymization
dc.subject.other data architecture and workflows
dc.subject.other geospatial visualisation
dc.subject.other renewable energy
dc.subject.other artificial intelligence
dc.subject.other machine learning
dc.subject.other buildings data
dc.subject.other industrial validation
dc.subject.other building load profiles
dc.title Development of synthetic power distribution networks and datasets with industrial validation
dc.type Thesis
dcterms.accessRights open access
dcterms.rightsHolder Ali, Muhammad
dspace.entity.type Publication
unsw.accessRights.uri https://purl.org/coar/access_right/c_abf2
unsw.date.embargo 2023-12-16
unsw.date.workflow 2022-08-15
unsw.description.embargoNote Embargoed until 2023-12-16
unsw.description.notePublic Added an embargo to the public version of the thesis as some of the content is under-review in journal papers. Many thanks
unsw.identifier.doi https://doi.org/10.26190/unsworks/24281
unsw.relation.faculty Other UNSW
unsw.relation.faculty UNSW Canberra
unsw.relation.school School of Engineering and Information Technology
unsw.relation.school School of Engineering and Information Technology
unsw.subject.fieldofresearchcode 400899 Electrical engineering not elsewhere classified
unsw.thesis.degreetype PhD Doctorate
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
public version.pdf
Size:
12.54 MB
Format:
application/pdf
Description:
Embargo
Resource type