Doctoral Degrees (Statistics)
Permanent URI for this collectionhttps://hdl.handle.net/10413/7126
Browse
Browsing Doctoral Degrees (Statistics) by Author "Achia, Thomas Noel Ochieng."
Now showing 1 - 3 of 3
- Results Per Page
- Sort Options
Item Bayesian modelling of non–gaussian time series of serve acute respiratory illness.(2019) Musyoka, Raymond Nyoka.; Mwambi, Henry.; Achia, Thomas Noel Ochieng.; Gichangi, Anthony Simon Runo.Respiratory syncytial virus (RSV), Human metapneumovirus (HMPV) and Influenza are some of the major causes of acute lower respiratory tract infections (ALRTI) in children. Children younger than 1 year are the most susceptible to these infections. RSV and influenza infections occur seasonally in temperate climate regions. We developed statistical models that were assessed and compared to predict the relationship between weather and RSV incidence in chapter 2. Human metapneumovirus (HMPV) have similar symptoms to those caused by respiratory syncytial virus (RSV). Currently, only a few models satisfactorily capture the dynamics of time series data of these two viruses. In chapter 3, we used a negative binomial model to investigate the relationship between RSV and HMPV while adjusting for climatic factors. In chapter 4, we considered multiple viruses incorporating the time varying effects of these components.The occurrence of different diseases in time contributes to multivariate time series data. In this chapter, we describe an approach to analyze multivariate time series of disease counts and model the contemporaneous relationship between pathogens namely, RSV, HMPV and Flu. The use of the models described in this study, could help public health officials predict increases in each pathogen infection incidence among children and help them prepare and respond more swiftly to increasing incidence in low-resource regions or communities. We conclude that, preventing and controlling RSV infection subsequently reduces the incidence of HMPV. Respiratory syncytial virus (RSV) is one of the major causes of acute lower respiratory tract infections (ALRTI) in children. Children younger than 1 year are the most susceptible to RSV infection. RSV infections occur seasonally in temperate climate regions. Based on RSV surveillance and climatic data, we developed statistical models that were assessed and compared to predict the relationship between weather and RSV incidence among refugee children younger than 5 years in Dadaab refugee camp in Kenya. Most time-series analyses rely on the assumption of Gaussian-distributed data. However, surveillance data often do not have a Gaussian distribution. We used a generalised linear model (GLM) with a sinusoidal component over time to account for seasonal variation and extended it to a generalised additive model (GAM) with smoothing cubic splines. Climatic factors were included as covariates in the models before and after timescale decompositions, and the results were compared. Models with decomposed covariates fit RSV incidence data better than those without. The Poisson GAM with decomposed covariates of climatic factors fit the data well and had a higher explanatory and predictive power than GLM. The best model predicted the relationship between atmospheric conditions and RSV infection incidence among children younger than 5 years. Human metapneumovirus (HMPV) have similar symptoms to those caused by respiratory syncytial virus (RSV). The modes of transmission and dynamics of these epidemics still remain poorly understood. Climatic factors have long been suspected to be implicated in impacting on the number of cases for these epidemics. Currently, only a few models satisfactorily capture the dynamics of time series data of these two viruses. In this study, we used a negative binomial model to investigate the relationship between RSV and HMPV while adjusting for climatic factors. We specifically aimed at establishing the heterogeneity in the autoregressive effect to account for the influence between these viruses. Our findings showed that RSV contributed to the severity of HMPV. This was achieved through comparison of 12 models of various structures, including those with and without interaction between climatic cofactors. Most models do not consider multiple viruses nor incorporate the time varying effects of these components. Common ARIs etiologies identified in developing countries include respiratory syncytial virus (RSV), human metapneumovirus (HMPV), influenza viruses (Flu), parainfluenza viruses (PIV) and rhinoviruses with mixed co-infections in the respiratory tracts which make the etiology of Acute Respiratory Illness (ARI) complex. The occurrence of different diseases in time contributes to multivariate time series data. In this work, the surveillance data are aggregated by month and are not available at an individual level. This may lead to over-dispersion; hence the use of the negative binomial distribution. In this paper, we describe an approach to analyze multivariate time series of disease counts. A previously used model in the literature to address dependence between two different disease pathogens is extended. We model the contemporaneous relationship between pathogens, namely; RSV, HMPV and Flu from surveillance data in a refugee camp (Dadaab) for children under 5 years to investigate for serial correlation. The models evaluate for the presence of heterogeneity in the autoregressive effect for the different pathogens and whether after adjusting for seasonality, an epidemic component could be isolated within or between the pathogens. The model helps in distinguishing between an endemic and epidemic component of the time series that would allow the separation of the regular pattern from irregularities and outbreaks. The use of the models described in this study, can help public health officials predict increases in each pathogen infection incidence among children and help them prepare and respond more swiftly to increasing incidence in low-resource regions or communities. This knowledge helps public health officials to prepare for, and respond more effectively to increasing RSV incidence in low-resource regions or communities. The study has improved our understanding of the dynamics of RSV and HMPV in relation to climatic cofactors; thereby, setting a platform to devise better intervention measures to combat the epidemics. We conclude that, preventing and controlling RSV infection subsequently reduces the incidence of HMPV.Item Bayesian spatial models with application to HIV, TB and STI modeling in Kenya.(2014) Owino, Ngesa Oscar.; Mwambi, Henry Godwell.; Achia, Thomas Noel Ochieng.This dissertation is concerned with developing and extending statistical models in the area of spatial modeling with particular interest towards application to HIV, TB and HSV-2 data. Hierarchical spatial modeling is a common and useful approach for modeling complex spatially correlated data in many settings in epidemiological, public health and ecological studies. Chapter 1 of this thesis gives a chronological development of disease mapping models, from non-spatial to spatial and from single disease models to multiple disease models. In Chapter 2, a new model that relaxes the over-restrictive normal distribution assumption on the spatially unstructured random effect by using the generalised Gaussian distribution is introduced and investigated. The third chapter provides a framework for including sampling weights into the Bayesian hierarchical disease mapping model. In this model, design effect is used to re-scale the sample sizes. A new model for over dispersed spatially correlated binary data is developed in chapter 4 of this thesis; in this model, the over dispersion parameter is modeled by a beta random effect which is allowed to vary spatially also. In chapter 5, the common multiple spatial disease mapping models are reviewed and adopted for the binary data at hand since the original models were developed based on Poisson count data. The methodologies developed in this dissertation widen the toolbox for spatial analysis and disease mapping in applications in epidemiology and public health studies.Item Statistical methods for analysing complex survey data : an application to HIV/AIDS in Ethiopia.(2013) Mohammed, Mohammed Omar Musa.; Zewotir, Temesgen Tenaw.; Achia, Thomas Noel Ochieng.The HIV/AIDS pandemic is currently the most challenging public health matter that faces third world countries, especially those in Sub-Saharan Africa. Ethiopia, in East Africa, with a generalised and highly heterogeneous epidemic, is no exception, with HIV/AIDS affecting most sectors of the economy. The first case of HIV in Ethiopia was reported in 1984. Since then, HIV/AIDS has become a major public health con cern, leading the Government of Ethiopia to declare a public health emergency in 2002. In 2011, the adult HIV/AIDS prevalence in Ethiopia was estimated at 1.5%. Approximately 1.2 million Ethiopians were living with HIV/AIDS in 2010. Surveys are an important and popular tool for collecting data. Analytical use of survey data especially health survey data has become very common, with a focus on the association of particular outcome variables with explanatory variables at the population level. In this study we used the data from the 2005 Ethiopian Demographic and Health Survey, (EDHS 2005), and identified key demographic, socioeconomic, sociocultural, behavioral and proximate determinants of HIV/AIDS risk factor. Usually most survey analysts ignore the complex survey design issues like clustering, stratification and unequal probability of selection (weights). This study deals with complex survey design and takes the design aspect into account, because failure to do so leads to bias parameters estimates and standard error, wide confidence intervals and statistical tests will be incorrect. In this study, three statistical approaches were used to analyse the complex survey data. The first approach was a survey logistic regression used to model the binary outcome (HIV serostatus) and set of explanatory variables (the dependence of the HIV risk factors). The difference between survey logistic regression and the ordinary logistic regression is that survey logistic regression approach takes the study design into account during analysis. The second approach was a multilevel logistic regression model, that assumed that the data structure in the population was hierarchical, and that individual within household was selected from clusters that were randomly selected from a national sampling frame. We considered a three-level model for our analysis. This second approach considered the results from Frequentist and a Bayesian multilevel models. Bayesian methods can provide accurate estimates of the parameters and the uncertainty associated with them. The third approach used was a Spatial models approach where model parameters were estimated under the Integrated Nested Laplace Approximation (INLA) paradigm.