|
|
||||||||
| ABSTRACT |
|
|
|---|
| INTRODUCTION |
|
|
|---|
Prevalence of HIV infection in South Africa, as elsewhere, has previously been reported either as a national or a provincial average.1,2 Such aggregate estimates may eclipse the geographic heterogeneity in the HIV epidemic in a country. The HIV seroprevalence in a local area is often only conjecture, based on hearsay or on non-representative samples of the population. For more effective monitoring of local HIV prevalence, it is necessary to report the results of national HIV prevalence surveys in a geographically disaggregate form while avoiding the reporting or mapping of prevalence levels that may be subject to large random errors caused by the small samples on which they are based.
We have used data from the 2003 South African national household youth survey of HIV infection and sexual behavior5 to derive spatial statistical models of HIV prevalence in 15–24 year-old men and women separately and, using these models, performed spatial prediction of HIV prevalence for the entire area of South Africa. We also derived the prediction uncertainty at each map location. We therefore present the first continuous HIV prevalence maps for South Africa. The simultaneous mapping of estimated prevalence and the confidence interval of the prediction map ensures that HIV prevalence maps do not assume a false claim of precision for any particular locality. Nevertheless, detailed mapping of HIV prevalence in South Africa provides us with a picture of the geographic variation of the epidemic that overall national (or provincial) HIV prevalence estimates are unable to do. Such maps can be used to further guide local prevention and care efforts to ensure their relevance to the local HIV situation.
| METHODS |
|
|
|---|
For the purpose of spatial modeling, the total number of male and female respondents was aggregated for each of the 697 sampled EAs that could be geo-located, and the proportion infected with HIV, by sex, was calculated. Using geo-graphic information systems,11 the latitude and longitude of the centroid of each sampled EA was determined (Figure 1
). The map surface of South Africa was divided into 1,945 pixels of approximately 25 km x 25 km, for which model predictions of HIV prevalence were made that excluded areas for which no census data apply, such as nature reserves and game parks. Socioeconomic and demographic data were extracted from the 2001 census for each of the sampled EAs and for each of the pixels. The value for each pixel was calculated by summing the variable values over the census areas making up the area of the pixel, and allocating values in proportion to the area of the census entity contained within the pixel.
|
Generalized geostatistical spatial models using Markov Chain Monte Carlo (MCMC) simulation were fitted to HIV prevalence at sampled locations (EAs) to identify significant covariates from the census. Bayesian kriging was used to perform spatial prediction of HIV prevalence at the grid of all map locations (pixels). Details of the method are given in the Appendix. Spatial modeling and spatial prediction was carried out using the package geoRglm in the statistical software system R.13
| RESULTS |
|
|
|---|
In spatial logistic regression models, the following area variables were significantly associated with HIV infection in 15–24 year-old women: proportion of residents who were black Africans, proportion of 20–64 year-old women who were unemployed in an area, proportion of households that were informal, and proportion of 15–19 year-old persons who had not finished school. Factors significantly associated with HIV infection in young men were proportion of residents who were African Blacks and proportion of an area urban, but the proportion of 20–64 year-old unemployed men in an area was marginally significant (Table 1
).
|
|
|
| DISCUSSION |
|
|
|---|
To our knowledge, our maps are the first HIV prevalence maps showing variation in infection levels without restriction to administrative boundaries such as provinces. On the basis of a large sample of persons, our maps depict HIV prevalence with a known estimate of uncertainty. As one might expect, prevalence of infection does not abruptly change at provincial boundaries, but instead follows a continuum across these boundaries. This should form the basis for a more integrated response to HIV prevention and treatment, rather than one based essentially on provincial confines. For example, the high prevalence cluster in the area along the common provincial boundaries between Mpumalanga and KwaZulu Natal may benefit from between province coordination of anti-retroviral treatment provisions and HIV prevention campaigns.
South Africa can be divided into the high prevalence eastern half and the low prevalence western half. For women, the highest rates were evident in western parts of KwaZulu Natal near Lesotho and Swaziland, parts of southern Mpumalanga and in outlying areas of Durban. For men, the highest prevalence was in urban and in coastal areas of KwaZulu Natal, but at significantly lower levels than for women. The reasons why areas of peak prevalence in men and women of this age group do not coincide are not clear. It may be because the male partners of women in the survey were an average of four years older and were therefore often outside the age range of this survey, or it may be that many of their partners are migrant workers who work in areas with better employment prospects.5
Overall, young women are at approximately three times the risk for HIV infection than young men, as has been previously reported.2,5 This difference is vividly illustrated by the two prevalence maps (Figure 2
). The contrast is particularly striking in KwaZulu Natal where our model estimates a prevalence of 3% to < 5% for men compared with 25% to < 35% for women in areas of peak prevalence for women. There are many factors that place young women at higher risk of HIV infection than men; these have been identified in the main findings of this survey,5 including the finding that young women were more likely to have older partners than young men. This would expose them more directly to the already higher levels of HIV prevalence of older cohorts in the communities from which their partners are drawn, while young men are exposed to the levels of HIV prevalence in their own cohort, which is largely HIV negative.
Our maps concur with antenatal surveillance showing that prevalence is highest in KwaZulu Natal and lowest in Western Cape and Limpopo provinces.1 However, our maps show that there is considerable variation within provinces: e.g., the inland rural areas of the Western Cape showed a low prevalence of < 3% in men and < 5% in women, but the risk of HIV infection in and around urban areas of Cape Town was higher at > 3–5% for men and > 8–15% for women.
The prevalence of HIV in central parts of Limpopo Province was low compared with other northeastern parts of the country. We were unable to identify any factors that would explain this relatively low prevalence, other than a very low proportion of urbanization for this province.23
An area being urban was independently associated with risk of HIV infection in young men after adjusting for proportion who were black Africans. An association between urban residence status and HIV infection is likely to be the result of increased mobility and possibly of a different youth culture that is less affected by parental control. Nevertheless, the survey found no difference in mean age for first time having sex between rural and urban men. Furthermore, we did not find an association between informal housing and HIV infection in young men, although we did find such an association in young women (Table 1
). Such an association has been reported in other studies.2 This negative finding may be the result of informal settlements not being homogenous in their socioeconomic and cultural milieu, or it may be that the classification available from the census is at insufficient spatial resolution to separate informal areas from those containing both formal and informal residences.
Of the potential area-based explanatory variables that can be obtained from the census, the proportion of the residents who are black Africans in an area was strongly related to HIV infection in both young women and young men. When HIV status was analyzed in relation to questionnaire derived variables from the survey,5 a similar result was obtained even after adjusting for socioeconomic and sexual behavior variables that were available from the questionnaire. The strong association between HIV infection and the proportion who were black Africans in an area is likely to be the result of the historical differences in the HIV epidemic between Africans and other race groups in South Africa. It has been shown that the system of migrant labor during the apartheid era, which resulted in men spending long periods separated from their families, directly fueled the HIV epidemic in South Africas black population.24 Migration in South Africa is characterized by a high prevalence of temporary or circular migration23 that has endured from entrenched patterns of labor migration in the past.25 As a result, there is extensive mixing between many city dwellers and their rural counterparts that would have had a considerable impact on the pattern of the HIV epidemic in South Africa,26 with a high prevalence of infection not only in urban centers, but also in many rural areas and small towns.
The proportion of the economically active population being unemployed was clearly related to HIV infection for both women and men (albeit marginally in the case of men), which confirmed that HIV in South Africa is linked to poor socioeconomic conditions and poverty. However, as with most of the other variables, this association did not persist once the proportion of area having black Africans was included in the model because of the correlation between unemployment and race.
It is noteworthy that the odds ratio for urban status of an area relative to a rural area and the odds ratio per percent increase in the racial composition of an area having black Africans both increased for men when adjusting for each other (Tables 1
and 2
). This implies that in the single variable models, urban residence was negatively confounded by the proportion who were black Africans and vice versa. This is plausible because rural areas generally have more black Africans in their population than urban areas. Conversely, for young women, the predominant risk factor for HIV infection was the racial composition of their area; living in an urban or rural area had no further effect.
A constraint to our model-based mapping approach is the limitation imposed by the restricted number of variables that are available from the national population census. To model and predict prevalence based on relevant social, demographic, and economic indicators, we require these to be available at all map locations, which generally means that only census-derived variables can be used. For instance, we would have liked to use a variable that quantifies the proportion of adults who are migrant workers, had this been available.
Other limitations of our study are those that apply to the survey from which the data are drawn, in particular, the low response rates among white youths, which may have affected our estimates in some areas. A further limitation is the computationally intense nature of the Bayesian MCMC–based modeling process, which limited the spatial resolution of the grid for which we predicted prevalence, resulting in the fairly large grid-blocks (pixels) that characterize the prediction maps.
Maps such as the ones we have produced here are based on random samples of the population and are therefore a far more reliable guide to the variation of HIV prevalence in the population than reports of much higher prevalence, which are based on volunteers who are screened for cohort studies.
The confidence intervals that accompany our map predictions will provide some assurance against over-interpretation of the data. The confidence interval maps indicate that the high prevalence among women in northwestern KwaZulu Natal may be no higher than 15%, but could be as high as 65%. Similarly, the low prevalence of HIV among women in the Western Cape, may be as high as 15%. To avoid misinterpretation of HIV prevalence maps, we would urge that these should always be accompanied by corresponding uncertainty or confidence interval maps.
The value of our maps is that they can be used as a health management tool that shows which parts of provinces are in need of particular attention. In these areas of high prevalence, further focused research should be done to inform intensified prevention and treatment programs. Because young people enter the age group largely uninfected, prevalence in this age group is a reasonable proxy for age-specific incidence of infection. Maps showing change in prevalence based on two or more surveys would provide valuable clues as to where prevention efforts are bearing fruit, and where they must be strengthened.
| APPENDIX |
|
|
|---|
Let Yji represent the binary response corresponding to the infection status of individual j at site i (the sampled EAs) taking value 1 if the youth tested HIV positive and 0 otherwise. The Yji are conditionally independent Bernoulli variables with infection probability at location i. The pi are defined via a generalized linear mixed model to take account of spatial dependence logit(pi) = Xiβ + S(
i) where β represents the regression coefficients for a set of known covariates X at the centroids
i of all sampled EAs in the study area. S = (S(
1), . . . . , S(
n))T denotes the values of the (unobserved) Gaussian spatial process S(·) at sample locations
i;
2 = Var{S(
)}, and
is a parameter of the correlation function
(dij,
), in our case exp(–dij/
), where dij is the distance between locations
i and
j.
Prior distributions were specified for each model parameter. For β and
2, flat and uniform priors were specified, respectively. For
, a discrete exponential prior with mean of 0.8 and 100 discretization points in the interval 0.2–1.2 was specified.
Let S* represent the values of S(·) at other locations of interest, i.e., the grid of pixels covering the entire map. To produce a smooth map of HIV prevalence, we estimated YO = (Y(
O1), Y(
O2), . . . . . Y(
ON)) at these unsampled locations
O = (
O1,
O2, . . . . .
ON) by obtaining MCMC estimates from the following predictive distribution:
![]() |
where
![]() |
with
![]() |
is the distribution of the spatial effects at new sites (kriging); and P(β, S,
2,
| Y) is the posterior distribution of the model parameters.
Convergence was assessed by inspecting plots of traces of simulations for individual parameters. The first 10,000 iterations were discarded; thereafter every 50th sample was retained until 5,000 iterations had been attained, i.e. simulations were run for 250,000 iterations.
At each prediction location, the median and 2.5th percentiles and 97.5th percentiles were calculated from the MCMC simulations, and mapped, using ArcView.30 Approximate P values were derived from z-scores that were calculated as the mean divided by a quarter of the 2.5th–97.5th percentile range.
Received September 4, 2006. Accepted for publication November 1, 2006.
* Address correspondence to Immo Kleinschmidt, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK. E-mail: Immo.kleinschmidt{at}lshtm.ac.uk ![]()
Authors addresses: Immo Kleinschmidt, Tropical Epidemiology Group, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, United Kingdom, Telephone: 44 20 7927 2103, Fax: 44 20 7636 8739, E-mail: Immo.kleinschmidt{at}lshtm.ac.uk. Natasha Morris, Medical Research Council of South Africa, 491 Ridge Road, Durban, South Africa, Telephone: 27-31-203- 4806, Fax: 27-31-203-4704, E-mail: nmorris{at}mrc.ac.za. Audrey Pettifor, Department of Epidemiology, University of North Carolina, Chapel Hill, NC 27514, E-mail: a.pettifor{at}rhrujhb.co.za. Catherine MacPhail and Helen Rees, Reproductive Health and HIV Research Unit, University of the Witwatersrand, Johannesburg, South Africa, E-mails: c.macphail{at}rhrujhb.co.za and h.rees{at}rhrujhb.co.za.
| REFERENCES |
|
|
|---|
This article has been cited by other articles:
![]() |
F. Tanser, T. Barnighausen, G. S Cooke, and M.-L. Newell Localized spatial clustering of HIV infections in a widely disseminated rural South African epidemic Int. J. Epidemiol., March 4, 2009; (2009) dyp148v1. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |