The Dennis Trewin Prize, named after the former Australian Statistician Dennis Trewin AO, is awarded annually by the Canberra Branch of the Statistical Society of Australia for outstanding research in statistics or data science by a current or recently graduated postgraduate student from a ACT or regional NSW (excluding Sydney-Newcastle-Wollongong) university. The 2022 competition took place on Tuesday 25 October, online and in person. Over 20 people attended across the two platforms.
This year I was honoured to be invited to be a judge of the three short-listed candidates: Dr Fui Swen Kuh of Monash University, Mr Zhi Yang Tho of the ANU and Mr Jiazhen Xu also of the ANU. Below is a short description of each of their talks in order of presentation, based on the abstracts they provided. And the winner was … really hard to determine! In the end the judges awarded first prize to Zhi Yang , second prize to Swen and third prize to Jiazhen. Congratulations to all three contestants!
1. Fui Swen Kuh: A Holistic Bayesian Framework for Modelling Socio-Economic Health. We propose the novel LAtent Causal Socioeconomic Health (LACSH) index to holistically evaluate a country’s performance from the social, economic, political and environmental aspects to replace the narrowly focused gross domestic product (GDP). Our framework integrates the latent health factor index (LHFI) structure (a latent factor model), spatial modelling to formally account for spatial dependency among the nations and causal modelling to evaluate the impact of a continuous policy variable. We apply our methodology to investigate the causal effect of mandatory maternity leave days and government expenditure on healthcare on a country’s health. We believe this comprehensive approach is the first in the literature to capture a country’s holistic performance while accounting for spatial effects and examining the causal effect of public policy.
2. Zhi Yang Tho: Joint Mean and Correlation Regression Modelling for Multivariate Data. In the analysis of multivariate or multi-response data, researchers are often not only interested in studying how the mean (say) of each response evolves as a function of covariates, but also and simultaneously how the correlations between responses are related to one or more similarity/distance measures. To address such questions, we propose a novel joint mean and correlation regression model that simultaneously regresses the mean of each response against a set of covariates and the correlations between responses against a set of similarity measures, which can be applied to a wide variety of correlated discrete and (semi-)continuous responses. Under a general setting where the number of responses can tend to infinity with the number of clusters, we demonstrate that our proposed joint estimators of the regression coefficients and correlation parameters are consistent and asymptotically normally distributed with differing rates of convergence. We apply the proposed model to a dataset of overdispersed counts of 38 Carabidae ground beetle species sampled throughout Scotland, with results showing in particular that beetle total length and breeding season have statistically important effects in driving the correlations between beetle species.
3. Jiazhen Xu: Generalized Score Matching for Regression. Many probabilistic models that have an intractable normalizing constant may be extended to contain covariates. Since the evaluation of the exact likelihood is difficult or even impossible for these models, we propose score matching to avoid explicit computation of the normalizing constant. In the literature, score matching has so far only been developed for models in which the observations are independent and identically distributed (IID). However, the IID assumption does not hold in the traditional fixed design setting for regression-type models. To deal with the estimation of these covariate-dependent models, we present a new score matching approach for independent but not necessarily identically distributed data under a general framework for both continuous and discrete responses, which includes a novel generalized score matching method for count response regression. We prove that our proposed score matching estimators are consistent and asymptotically normal under mild regularity conditions. The theoretical results are supported by simulation studies and a real-data example. involving doctoral publication data.