Addressing issues in sparseness, ecological bias and formulation of the adjacency matrix in Bayesian spatiotemporal analysis of disease counts
Earnest, Arul (2010) Addressing issues in sparseness, ecological bias and formulation of the adjacency matrix in Bayesian spatiotemporal analysis of disease counts. PhD thesis, Queensland University of Technology.

Arul Earnest Thesis (PDF 1MB)  

Arul Earnest Citation (PDF 9kB) 
Abstract
The main objective of this PhD was to further develop Bayesian spatiotemporal models (specifically the Conditional Autoregressive (CAR) class of models), for the analysis of sparse disease outcomes such as birth defects. The motivation for the thesis arose from problems encountered when analyzing a large birth defect registry in New South Wales. The specific components and related research objectives of the thesis were developed from gaps in the literature on current formulations of the CAR model, and health service planning requirements. Data from a large probabilisticallylinked database from 1990 to 2004, consisting of fields from two separate registries: the Birth Defect Registry (BDR) and Midwives Data Collection (MDC) were used in the analyses in this thesis.
The main objective was split into smaller goals. The first goal was to determine how the specification of the neighbourhood weight matrix will affect the smoothing properties of the CAR model, and this is the focus of chapter 6. Secondly, I hoped to evaluate the usefulness of incorporating a zeroinflated Poisson (ZIP) component as well as a sharedcomponent model in terms of modeling a sparse outcome, and this is carried out in chapter 7. The third goal was to identify optimal sampling and sample size schemes designed to select individual level data for a hybrid ecological spatial model, and this is done in chapter 8. Finally, I wanted to put together the earlier improvements to the CAR model, and along with demographic projections, provide forecasts for birth defects at the SLA level. Chapter 9 describes how this is done.
For the first objective, I examined a series of neighbourhood weight matrices, and showed how smoothing the relative risk estimates according to similarity by an important covariate (i.e. maternal age) helped improve the model’s ability to recover the underlying risk, as compared to the traditional adjacency (specifically the Queen) method of applying weights.
Next, to address the sparseness and excess zeros commonly encountered in the analysis of rare outcomes such as birth defects, I compared a few models, including an extension of the usual Poisson model to encompass excess zeros in the data. This was achieved via a mixture model, which also encompassed the shared component model to improve on the estimation of sparse counts through borrowing strength across a shared component (e.g. latent risk factor/s) with the referent outcome (caesarean section was used in this example). Using the Deviance Information Criteria (DIC), I showed how the proposed model performed better than the usual models, but only when both outcomes shared a strong spatial correlation.
The next objective involved identifying the optimal sampling and sample size strategy for incorporating individuallevel data with areal covariates in a hybrid study design. I performed extensive simulation studies, evaluating thirteen different sampling schemes along with variations in sample size. This was done in the context of an ecological regression model that incorporated spatial correlation in the outcomes, as well as accommodating both individual and areal measures of covariates. Using the Average Mean Squared Error (AMSE), I showed how a simple random sample of 20% of the SLAs, followed by selecting all cases in the SLAs chosen, along with an equal number of controls, provided the lowest AMSE.
The final objective involved combining the improved spatiotemporal CAR model with population (i.e. women) forecasts, to provide 30year annual estimates of birth defects at the Statistical Local Area (SLA) level in New South Wales, Australia. The projections were illustrated using sixteen different SLAs, representing the various areal measures of socioeconomic status and remoteness. A sensitivity analysis of the assumptions used in the projection was also undertaken.
By the end of the thesis, I will show how challenges in the spatial analysis of rare diseases such as birth defects can be addressed, by specifically formulating the neighbourhood weight matrix to smooth according to a key covariate (i.e. maternal age), incorporating a ZIP component to model excess zeros in outcomes and borrowing strength from a referent outcome (i.e. caesarean counts). An efficient strategy to sample individuallevel data and sample size considerations for rare disease will also be presented. Finally, projections in birth defect categories at the SLA level will be made.
Impact and interest:
Citation counts are sourced monthly from Scopus and Web of Science® citation databases.
These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.
Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.
Fulltext downloads:
Fulltext downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.
ID Code:  31842 

Item Type:  QUT Thesis (PhD) 
Supervisor:  Mengersen, Kerrie, Pettitt, Anthony, & Morgan, Geoffrey 
Keywords:  spatial, autoregressive, disease mapping, CAR model, birth defects, ecological bias, neighbourhood weight matrix, forecasting, priors, Bayesian, MCMC, joint modeling 
Divisions:  Past > QUT Faculties & Divisions > Faculty of Science and Technology Past > Schools > Mathematical Sciences 
Institution:  Queensland University of Technology 
Deposited On:  19 Apr 2010 02:13 
Last Modified:  28 Oct 2011 19:56 
Export: EndNote  Dublin Core  BibTeX
Repository Staff Only: item control page