Repository logo
 

Flexible statistical modeling of deaths by diarrhoea in South Africa.

dc.contributor.advisorRamroop, Shaun.
dc.contributor.advisorMwambi, Henry G.
dc.contributor.authorMbona, Sizwe Vincent.
dc.date.accessioned2013-12-17T11:00:43Z
dc.date.available2013-12-17T11:00:43Z
dc.date.created2013
dc.date.issued2013
dc.descriptionThesis (M.Sc.)-University of KwaZulu-Natal, Pietermaritzburg, 2013.en
dc.description.abstractThe purpose of this study is to investigate and understand data which are grouped into categories. Various statistical methods was studied for categorical binary responses to investigate the causes of death from diarrhoea in South Africa. Data collected included death type, sex, marital status, province of birth, province of death, place of death, province of residence, education status, smoking status and pregnancy status. The objective of this thesis is to investigate which of the above explanatory variables was most affected by diarrhoea in South Africa. To achieve this objective, different sample survey data analysis techniques are investigated. This includes sketching bar graphs and using several statistical methods namely, logistic regression, surveylogistic, generalised linear model, generalised linear mixed model, and generalised additive model. In the selection of the fixed effects, a bar graph is applied to the response variable individual profile graphs. A logistic regression model is used to identify which of the explanatory variables are more affected by diarrhoea. Statistical applications are conducted in SAS (Statistical Analysis Software). Hosmer and Lemeshow (2000) propose a statistic that they show, through simulation, is distributed as chi‐square when there is no replication in any of the subpopulations. Due to the similarity of the Hosmer and Lemeshow test for logistic regression, Parzen and Lipsitz (1999) suggest using 10 risk score groups. Nevertheless, based on simulation results, May and Hosmer (2004) show that, for all samples or samples with a large percentage of censored observations, the test rejects the null hypothesis too often. They suggest that the number of groups be chosen such that G=integer of {maximum of 12 and minimum of 10}. Lemeshow et al. (2004) state that the observations are firstly sorted in increasing order of their estimated event probability.en
dc.identifier.urihttp://hdl.handle.net/10413/10239
dc.language.isoen_ZAen
dc.subjectStatistics--Mathematics.en
dc.subjectMathematical statistics.en
dc.subjectStatistics--Data processing.en
dc.subjectLinear models (Statistics)en
dc.subjectTheses--Statistics and actuarial science.en
dc.titleFlexible statistical modeling of deaths by diarrhoea in South Africa.en
dc.typeThesisen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mbona_Sizwe_Vincent_2013.pdf
Size:
781.29 KB
Format:
Adobe Portable Document Format
Description:
Thesis

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.64 KB
Format:
Item-specific license agreed upon to submission
Description: