|
|
||||||||
| ABSTRACT |
|
|
|---|
| INTRODUCTION |
|
|
|---|
Population surveys of health and fertility provide valuable insights into the etiology of diseases in developing countries. Moreover, they can provide nationally and regionally representative estimates on a range of epidemiologic variables. The Demographic and Health Surveys (DHS) are a well-established source of reliable population data with a substantial focus on childhood diseases and health-seeking behavior. However, linking individual survey records with disease prevalence at the disaggregated small-scale, community level has not been possible because of the methodological restraints of traditional regression modeling.
It is critical for policy makers to understand the prevalence of these diseases to plan proper health-care interventions at a community level. In a previous analysis, we examined the spatial location and childhood diseases for the Nigerian DHS 1999 and 2003 separately; in this report, we combine the 1999 and 2003 rounds of the Surveys to further investigate overall patterns of childhood epidemics in Nigeria.
Specifically, we apply the generalized geoadditive model7 as an alternative to the common linear model, in the context of analyzing childhood disease in developing countries. This will enable us to account for nonlinear, location effects on childhood morbidity at the disaggregated district level (37 states) and to assess temporal and geographical variation in while simultaneously controlling for important risk factors.
| METHODS |
|
|
|---|
For the 2 surveys, data were collected on 9,581 children under the age of 5 years (59 months). By pooling the data from the 2 survey rounds, we were able to obtain a larger sample for each state, thus increasing the statistical power of our analyses. For the combined surveys, there were 8,189 observations for diarrhea, 8,152 for cough, and 8,165 for fever, representing 85.5%, 85.1%, and 85.2% of the total survey sample.
In both surveys, health status was assessed by the answer to the question posed to the mother: "Has your child had diarrhea/cough/fever in the last two weeks?"
Most of the covariates were recategorized to be comparable to previous studies.6,8,9,11 Poverty status is measured in the DHS surveys in terms of assets or wealth, rather than income or consumption. Ownership of consumer items, such as a radio or car, as well as characteristics of the dwelling, such as floor or roof type, toilet facilities, and water source, are items that measure the concept of poverty in these settings; this concept has been used by the World Bank to allocate households and thus their members into poverty quintiles, using principal components analysis (PCA).8
Statistical method.
In the descriptive analysis, 2 pairs of frequency tables were generated to show the disease prevalence by district (states) and by factors.
2 tests and Mann–Whitney tests were used to investigate the association between factors and diseases. Multivariate analysis shows the significance of the posterior mean estimates of fixed effects, nonlinear effects, and spatial effects. The model used in this study was a geoadditive semiparametric Bayesian model that contains a spatial effect.7,10,11 In this analysis, we apply a novel approach by exploring regional patterns of childhood morbidity and possible nonlinear effects of other factors within a simultaneous, coherent regression framework using a geoadditive semiparametric mixed model. The model used for this investigation has been described elsewhere.7,11,26 Briefly, we replace the strictly linear predictor
![]() |
with a geoadditive predictor, leading to the geoadditive regression model:
![]() |
where f1, . . . , fp are nonlinear smooth effects of the metrical covariates (childs and mothers age), and fspat(si) is the effect of the spatial covariate si
{1, . . . , S} labeling the districts (states) in Nigeria. Regression models with predictors as in (2) are sometimes referred to as geoadditive models. P-spline priors are assigned to the functions f1, . . . , fp and for fspat(si) we used Markov random field prior parameters.7,11,12,16,26
| RESULTS |
|
|
|---|
|
|
The mean prevalence of childhood cough appeared to be higher than diarrhea across the country. Children from the North East and South South had the highest association with diseases (31.6% and 32.5%, respectively). The prevalence of childhood cough did not vary significantly across districts in the South East (P = 0.195). In all other regions, the prevalence of childhood cough varied significantly between districts (all P < 0.05).
Childhood fever was more prevalent than diarrhea and cough. Four regions, the North West, North East, South South, and South East, showed > 30% of the children had had fever during the 2 weeks preceding the surveys. The prevalence of childhood fever did not significantly vary across states in the South East (P = 0.613). In all other regions, the prevalence of childhood fever varied significantly between districts (all P < 0.001).
Obviously, the observed prevalence fluctuated more at the district level than at the regional level. The regional classification masked the variations of morbidity in the states. For example, the lowest rate of diarrhea in the North Central area was 6.7% in Kwara State, whereas the highest was 27.5% in the federal capital, Abuja. This fact reveals that pronounced district variations were masked by regional classifications.
The distribution of all the factors we used in this analysis and their association with the diseases are shown in Table 2
. The following factors were significantly associated with diarrhea: place of residence (P < 0.001); asset index (P = 0.009); antenatal visits (P < 0.001); place of delivery (P < 0.001); type of feeding (P < 0.001); mothers education (P < 0.001); and partners education (P < 0.001). For cough, the significant factors were marital status (P = 0.005); antenatal visit (P = 0.001); type of feeding (P = 0.005); mothers education (P < 0.001); place of residence (P = 0.018); and partners education (P = 0.002). For childhood fever, significant factors were place of residence (P < 0.001); place of delivery (P < 0.001); type of feeding (P < 0.001); mothers education (P = 0.001); and partners education (P = 0.049).
|
The left panel of Figure 2
presents the residual spatial district effects on the childhood diseases in Nigeria. Red represents a higher prevalence of childhood diseases, whereas green denotes a lower prevalence. A high prevalence of all 3 diseases was concentrated in the North and East of the country.
|
The nonlinear effects of childs age and mothers age are shown in Figure 3
(left and right panel, respectively). Initially, the effect of childs age turned quickly from negative to positive for each disease as the child grew. Steeply rising curves at the beginning can be observed in all the graphs. During the first 8–10 months, the effects on childhood cough and fever reached the highest point. The effect of age on diarrhea rose slightly after this period to the highest point. Afterward, the effect of childs age on all diseases decreased. At
35 months, the confidence intervals widen, and by the end of the childs age category the intervals expand rapidly. Note that children < 35 months were included in the 1999 survey whereas children < 59 months were included in 2003. Thus the large intervals are caused by fewer observations after 35 months involved in the data set.
|
Table 3
shows the fixed effects of categorical covariates after controlling for spatial dependence in the data. Several factors were significantly associated with childhood morbidity. The overall effect of each factor is indicated by the posterior mean. Children from educated mothers have a lower association with childhood diarrhea. The upper quintiles of the asset index, which represents children in families in the group of highest household economic status, did not have preventative effects on diarrhea. Additionally, urbanized areas have a lower association with childhood fever. Children of mothers who had prenatal visits and delivered in a hospital were less likely to get fever. However, if mothers fed children with mixed foods (breast milk, water, fruit juice, or other solid foods) or the preceding birth interval was at least 2 years (> 25 months), then the association with childhood fever significantly increased. Unexpectedly, there is no factor showing any significant association with childhood cough.
|
| DISCUSSION |
|
|
|---|
We carried out a sensitivity analysis about different prior assumptions for spatial effects. It turned out that Markov random field (MRF) prior parameters outperformed the other prior parameters.17 The result is in line with the findings of a previous analysis using other survey data.6 There are various reasons for the higher associations with disease in the northern and eastern states. Sahelian drought is probably one of the most influential reasons. In recent studies,19,20 it has been found to be an important, socioeconomic determinant in both northern Nigeria and neighboring countries, which have suffered from increasing pressures on the available resources due to a fluctuating rainfall regimen in the latter part of the last century. Therefore, agricultural activities have been severely affected, and the resulting food security crisis forced people to consume unfit food and polluted water, which in turn affected feeding practices. Meanwhile, the resulting poverty prevents people from accessing the free health centers and buying medication. On the other hand, the higher association with childhood cough and fever, observed in several southern states, are probably linked to oil spillages and environmental pollution.2,21 In the 1950s, the discovery of oil in the Niger Delta began to drive the Nigerian economy and continues to dominate it today. However, the positive economic effect of oil has in turn degraded both the environment and general health. Oil spillage incidents have been increasing and occur mainly inland or in swamp and off-shore areas in the Niger Delta, South of Nigeria.21 The resulting pollution kills plants and animals, damages the environment, and destroys food chains, all of which contribute to local malnutrition and disease epidemics. Besides the oil pollution, gas flaring is also an important contributing factor to the formation of acid rain, which is toxic to both humans and animals alike and leads to soil degradation, deforestation, and an unbalanced ecosystem.22
States with a lower association with childhood morbidity are mainly restricted to western regions. Some are found in the north and central parts of the country. As pointed out in the previous study,23 this reduced association might derive from a regional advantage prior to the discovery of oil. People in these states benefited from free education, agricultural settlements, and industrial development.23 Thus parents are still likely to have more health-care knowledge and enjoy better health conditions, which could effectively lower the associations.
Compared with our previous study on the separate analysis of each data survey, childs age shows a pronounced nonlinear effect on all 3 diseases. The rapid deterioration of the effect of a childs age and the narrow confidence intervals during the first 8–10 months suggests that the association with childhood morbidity increases quickly and tapers off as age increases. This might be because of malnutrition in early feeding practice while a childs immune system is not sufficiently developed to protect him or her from contamination by bacteria. Because of poverty, insufficient household sanitary conditions, and other related factors, mothers might not be able to feed infants with enough clean breast milk so that mixed low-quality foods or contaminated water were used instead. For example, Ahiadeke24 found that diarrhea was prevalent in infants aged 4–6 months from households with poor sanitary conditions unless mothers reverted to exclusive breastfeeding; Woldemicael25 attributed the immediate deterioration pattern to the fact that most children were weaned during 6–12 months. It is likely that as the childs immune system develops, the associations of disease decreased, which is reflected by a gradual decline of the effect of age after 10 months in Figure 2
. Another possible explanation may be that maternal immunity may be protective among young children against infectious agents that may cause each of these symptoms. The decline of maternally derived immunity since birth may account for the increase in effect in early childhood. Moreover, in this analysis the confidence intervals tended to become larger after 35 months. It should be noted, however, that this is a combined data set based on 2 surveys involving children at 2 different ages. Fewer cases were observed in the combined data set after 3 years (35 months), thus the fluctuation of the intervals after 35 months increases markedly.
The approximate negative linear relationship between mothers of age 20–40 years and the 3 diseases suggest that a childs association with diarrhea, cough, and fever declines as their mothers get older. However, few data from younger (age < 20 years) and older (age > 40 years) mothers were collected (7.3% and 4.8% of the data, respectively). Thus, this is likely to be the explanation for the expanded 80% and 95% CIs in the graphs.
In line with the results from the 2003 survey, children of educated mothers had a lower association with diarrhea. This finding suggests that these mothers were likely to have more health-care knowledge to protect their children and to deal with these conditions more effectively. The World Bank and the 2003 NDHS reports both found that mothers education can significantly reduce childhood morbidity by improving the mothers health seeking ability.
In a previous study,6 only children from families of the second-lowest economic status had a significantly lower association with diarrhea than those from other families, while other SES categories showed no significant association with childhood diarrhea. It is important to point out that in this analysis the calculation of asset index was based on questions of ownership of several household items, such as a car, bicycle, radio, and telephone. It is possible that these items might be more common in urban areas than rural ones in SSA countries.26 As such, asset index classification may not represent the real economic structure of the households in Nigeria.
We also found several factors with significant effects on childhood fever. Table 3
shows that children whose mothers had had prenatal visits and delivered in hospitals experienced a significantly reduced association with childhood fever. It is likely that hospitals provided mothers with prenatal advice and administered professional health care to the infants. In addition to the mother-related factors, urban children had a significantly lower association than rural children across the country. Recent research27 showed that urban children were twice as likely to be immunized and have access to basic health-care services as rural children, which gives a reasonable explanation for this result. Children fed mixed foods (mixed breastfeeding) had the highest association with childhood fever among the 3 types of feeding practices. Mixed breastfeeding may cause infant malnutrition and increase the risks from infectious diseases. Some studies24,28,29 found evidence that mixed feeding is related to infant mortality. The high risk of fever was also found in children who were born at least 2 years after their previous siblings (preceding birth interval > 24 months). Several researchers found contradictory results showing childhood diarrhea decreases as birth interval increases.30,31 However, different data sources and time period might be the main reasons resulting in this difference.
In this report,
50% of the factors in the preliminary analysis showed a significant association with the 3 diseases (P < 0.01), while only 7% showed such an association with only 2 conditions (diarrhea and fever) in the multivariate analysis. A possible explanation for this is that health-care utilization reflects the accessibility of health-care services and has been captured by the spatial effects in the multivariate analysis.26
We identified an east–west spatial pattern for childhood diarrhea, cough, and fever by mapping the posterior mean results of the model. In addition, the highly nonlinear effect of childs age has emerged, which is difficult to assess with conventional parametric models. Our study indicates that policy makers should specifically address the increasing health issues among children under 8–10 months. More emphasis should be given to the WHO guidelines on exclusive breastfeeding for 6 months after birth, as we found that children who are solely breastfed are at a lower association with infection. In addition, mothers education, place of residence, prenatal visits, asset index, preceding birth interval, and place of delivery were also significantly associated with child morbidity.
It is worth mentioning some advantages of our approach over existing ones using, for example, logistic models with constant-fixed effects of covariates and fixed (or random) districts (states) effects or standard two-level multilevel modeling with unstructured spatial effects (see Appendix at www.ajtmh.org). With standard models, it is assumed that the random components at the contextual level (district or state) are mutually independent, even though, in practice, this assumption is not actually implied by these approaches, so correlated random residuals could also be specified.7 The independence assumption has an inherent problem of inconsistency: if the location of the event matters, then it makes sense to assume that areas close to each other are more similar than areas that are far apart. Also, the NDHS data are based on a random sample of districts or states. That is, the structured component introduced here allows us to "borrow strength" from neighbors to cope with the sample variation of the district or state effect and obtain estimates for areas that may have inadequate sample sizes or be unsampled. In the exploratory analysis, we tried several models to highlight the differences that can be found by adopting this approach in a spatial context and the possible bias involved with the violation of the independent assumption between aggregated spatial areas. Our final model has a spatial component and a random component that reflect spatial heterogeneity globally and relative homogeneity among neighboring states [see Model (2)], while the other model did not [see Model (1)]. Failure to take into account the posterior uncertainty in the spatial location (district or state) would overestimate the precision of the association of childhood morbidity in unsampled districts.
| POLICY IMPLICATION |
|
|
|---|
| APPENDIX |
|
|
|---|
Consider regression situations, where observations (yi; xi; wi); i = 1, . . . , N, on a binary response y, a vector x = (xi; . . . ; xp)' of metrical covariates (childs age), time scales or spatial covariates (districts), and a vector w = (w1; . . . ; wr)' of further covariates, in which categorical covariates are often given. The generalized additive modeling framework assumes that, given xi and wi, the distribution of the response yi belongs to an exponential family, with mean µi = E(yi|–xi, wi) linked to an additive semiparametric predictor µI = h(
).
Traditionally, the effect of the covariates on the response is modeled by a linear predictor
![]() |
where h is a known response function. The response variable in this application is defined as Yi = 1 if child i had diarrhea, cough, fever during the reference period t and Yi = 0 otherwise. We have a probit link function Pr(yi = 1|xi*) =
(
i) for the probability of having diarrhea, cough, fever at the reference period (i.e., we model the conditional probability of a child having diarrhea, cough, fever) given the childs age in months, the district where the child lives, and X, with predictor (A1).
In our application to childhood morbidity and in many other regression situations, we are facing the following problems:
To overcome these difficulties, we replace the strictly linear predictor through a geoadditive predictor, leading to the geoadditive regression model:
![]() |
here, f1, . . . , fp are nonlinear smooth effects of the metrical covariates, and fspat(si) is the effect of the spatial covariate si
{1, . . . , S} labeling the districts in Nigeria. Regression models with predictors as in (A2) are sometimes referred to as geoadditive models.
In a Bayesian approach, unknown functions fj and parameters
as well as the variance parameter
2 are considered as random variables and have to be supplemented with appropriate prior assumptions. In the absence of any prior knowledge, we assume independent diffuse priors
j
const, j = 1, . . . , r for the parameters of fixed effects. Another common choice is highly dispersed Gaussian priors.
Several alternatives are available as smoothness priors for the unknown functions fj(xj), see References 6, 7, and 10. We use Bayesian penalized splines (P-splines),14 introduced by Eilers and Marx13 in a frequentist setting. For the spatially correlated effect fstr(s), s = 1, S, we choose Markov random field prior parameters common in spatial statistics.6,7,10,11,14,18 These priors reflect spatial neighborhood relationships. For geographical data, one usually assumes that 2 sites or regions s and r are neighbors if they share a common boundary. Then a spatial extension of random walk models leads to the conditional, spatially autoregressive specification
![]() |
where Ns are the number of adjacent regions, and r
s denotes that region r is a neighbor of region s. Thus the (conditional) mean of fstr(s) is an average of function evaluations fstr(s) of neighboring regions. Again, the variance
2str controls the degree of smoothness.
For a spatially uncorrelated (unstructured) effect funstr, a common assumption is that the parameters funstr(s) are independently and identically distributed Gaussian
![]() |
The standard measure of effect is still the OR for a logistic model and mean for a probit model. However, because of the use of a fully Bayesian approach that relies on prior assumption to make posterior inference, instead of "OR" or mean, we have "posterior OR" or posterior mean. For model choice, we routinely use the deviance information criterion (DIC)17 developed as a measure of fit and model complexity instead of tests for significance, linear trends, and interactions. To account for possible departures from the assumed distribution, 95% confidence intervals (CIs) for the posterior ORs and probability maps (the equivalent of CIs for the spatial effects) are calculated using robust standard errors estimated via MCMC simulation techniques.
Received February 25, 2007. Accepted for publication June 13, 2007.
Acknowledgments: The authors thank participants at seminars in Warwick and at the BIOMETRIC Conference in Montreal and the British Population Conference in Southampton for helpful comments and discussions. We also thank Macro Int. for providing free data sets for Nigeria.
Financial support: This work is supported by the UK Economic & Social Research Council (ESRC, grant no. RES-000-22-1221).
* Address correspondence to Ngianga-Bakwin Kandala, Clifford Road Bridge, Clinical Sciences Research Institute, Warwick Medical School, Coventry CV2 2DX, United Kingdom. E-mail: N-B.kandala{at}warwick.ac.ukdiseases ![]()
Authors addresses: Ngianga-Bakwin Kandala, Chen Ji, Saverio Stranges, and Francesco P. Cappuccio, Clinical Sciences Research Institute, Warwick Medical School, Coventry CV2 2DX, U.K., Telephone: +44 (0)24 76968669, Fax: +44 (0)24 76968660, E-mail: N-B.Kandala{at}warwick.ac.uk. Nigel Stallard, Health Sciences Research Institute, Warwick Medical School, Coventry CV4 7AL, U.K.
Contributions: N.-B. Kandala—conception and design; literature review; data analysis and interpretation; drafting the article; approval of final article for submission. C. Ji—literature review; data analysis and interpretation; drafting the article. N. Stallard—interpretation of results; critical revisions for important intellectual content; approval of final article for submission. S. Stranges—interpretation of results; critical revisions for important intellectual content. F.P. Cappuccio—interpretation of results; critical revisions for important intellectual content; approval of final article for submission.
Reprint requests: Ngianga-Bakwin Kandala, Clinical Sciences Research Institute, Warwick Medical School, University of Warwick, Clifford Bridge Road, Coventry CV2 2DX, U.K., Telephone: +44 (0)24 76968669, Fax: +44 (0)24 76968660, E-mail: N-B.Kandala{at}warwick.ac.uk.
| REFERENCES |
|
|
|---|
This article has been cited by other articles:
![]() |
N.-B. Kandala, N. Nwakeze, and S. N. I. I. Kandala Spatial Distribution of Female Genital Mutilation in Nigeria Am J Trop Med Hyg, November 1, 2009; 81(5): 784 - 792. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |