Fast methods for fitting log-Gaussian Cox process models in ecology.

dc.contributor.advisor Warton, David en_US
dc.contributor.advisor Popovic, Gordana en_US Dovers, Elliot en_US 2022-03-23T15:58:51Z 2022-03-23T15:58:51Z 2021 en_US
dc.description.abstract Log-Gaussian Cox processes (LGCPs) offer a framework for regression-style modelling of point patterns that can accommodate latent effects. These latent effects can be used to account for missing predictors or other sources of clustering that could not be explained by a Poisson process. Such models are important in ecology where point patterns arise in the form of presence-only data - records of species' locations - and used to construct Species Distribution Models (SDMs) as a function of environmental variables. Fitting LGCP models can be difficult and time consuming and, as a result, limits the ability of researchers to flexibly analyse presence-only data. In this thesis, we develop novel methodology and software for fitting LGCP models, as well as demonstrating how to incorporate presence-only and other data sources jointly into SDMs. Fitting LGCPs quickly is challenging due to their intractable marginal likelihood which involves a high dimensional integral to account for the latent Gaussian field - leading to large spatial variance-covariance matrices. In this thesis we address these using a novel combination of variational approximation and reduced rank interpolation. Additionally, we implement automatic differentiation that enables us to obtain exact gradient information rapidly for computationally efficient optimisation and inference. We demonstrate the method’s performance through both simulations and a real data application, with promising results in terms of computational speed and accuracy compared to that of existing approaches. We then extend our novel method to combine presence-only data with that obtained through scientific surveys to improve SDM in what is called data integration. We demonstrate scenarios in which sharing both the latent influence and the species' response to environment across each data set can improve upon results achieved by modelling each individually - both via simulation and using real data involving several species of flora in NSW, Australia. Within this thesis we also illustrate the use of software developed to implement these advances via the freely available R package - scampr. The package allows users to fit likelihood-based LGCP to presence-only data swiftly and with a formula interface familiar to those with experience in other regression-style modelling frameworks implemented in R. en_US
dc.language English
dc.language.iso EN en_US
dc.publisher UNSW, Sydney en_US
dc.rights CC BY-NC-ND 3.0 en_US
dc.rights.uri en_US
dc.subject.other Ecology en_US
dc.subject.other Spatial point process en_US
dc.subject.other Log-Gaussian Cox Process en_US
dc.subject.other Presence-only data en_US
dc.subject.other Species distribution modelling en_US
dc.subject.other Data integration en_US
dc.title Fast methods for fitting log-Gaussian Cox process models in ecology. en_US
dc.type Thesis en_US
dcterms.accessRights open access
dcterms.rightsHolder Dovers, Elliot
dspace.entity.type Publication en_US
unsw.relation.faculty Science
unsw.relation.originalPublicationAffiliation Dovers, Elliot, School of Mathematics & Statistics, Science, UNSW en_US
unsw.relation.originalPublicationAffiliation Warton, David, Mathematics & Statistics, Faculty of Science, UNSW en_US
unsw.relation.originalPublicationAffiliation Popovic, Gordana, Mathematics & Statistics, Faculty of Science, UNSW en_US School of Mathematics & Statistics *
unsw.thesis.degreetype PhD Doctorate en_US
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
public version.pdf
4.38 MB
Resource type