A simple imputation algorithm reduced missing data in SF-12 health surveys

doi:10.1016/j.jclinepi.2004.06.005

Journal of Clinical Epidemiology

Volume 58, Issue 2, February 2005, Pages 142-149

https://doi.org/10.1016/j.jclinepi.2004.06.005 Get rights and content

Abstract

Objective

The SF-12 Health Survey is a 12-item questionnaire that yields two summary scores (physical and mental health). Neither score can be computed when an item is missing. We explored imputation methods for missing scores for this instrument.

Study design and setting

Using data from a population-based survey, we tested several ways of imputing simulated missing data.

Results

Among 1250 participants, 118 (9.6%) had at last one missing SF-12 item. Missing data were more common among women, older respondents, non-Swiss nationals, and health service users. Among the 1132 respondents with complete data, replacement of any item with the mean population item weight yielded good results: the mean correlation between imputed and true score was 0.979 for both the physical and mental score. Results remained satisfactory when up to three of the six key items for each score (items that contribute predominantly to a given score), and any number of non-key items, were replaced by the mean. Application of this imputation algorithm to the original survey reduced the proportion of missing scores to <1%. Respondents with incomplete surveys, hence imputed scores, had lower scores than respondents with complete data (physical score: 44.9 vs. 49.8, p < 0.001, mental score: 44.4 vs. 46.3, p = 0.064).

Conclusions

A simple imputation algorithm can substantially reduce the proportion of missing scores for the SF-12 health survey, and consequently reduce non-response bias.

Introduction

Valid research requires complete data. Missing values reduce statistical power, and more importantly, may cause selection bias. Many studies have documented differences between respondents and non-respondents [1], [2], [3], [4], [5], or between early and late respondents [5], [6], [7], in health research. However, even among study participants, substantial proportions of some variables may have missing values. Possible bias due to partial survey response has attracted only limited attention [7], [8], [9].

Incomplete data cause particular concern when multiple data elements are combined to form a single variable, such as a composite score, a clinical prediction rule, or a multi-item psychometric scale. Without imputation, a single bit of missing information may cause the composite score to be missing. Obviously, the more items are combined, the greater the problem: if the probability of a missing value was 2% per item in a composite score, and if missing item probabilities were mutually independent, the probability of a missing score value would be 10% for a 5-item score, 18% for a 10-item score, and 33% for a 20-item score.

Imputation rules alleviate this problem by substituting an acceptable value for the missing element, hence salvaging useful information available in non-missing items. When the composite score is essentially a sum of similar parts, a common imputation rule is “if up to half of the items are missing, replace missing values with mean of available items; if more than half are missing, declare score as missing.” In theory, replacement with the respondent's mean is appropriate only for strictly parallel tests, that is, when all items have the same mean, variance, and correlation with the underlying latent variable [10]. In practice, this rule is often used even when these conditions do not apply, and with good results as long as missing values are fairly rare. For instance, this imputation method is recommended for the SF-36 health survey, which consists of scales based on items that have the same number of response options, but not necessarily the same distribution [11].

When the composite score is derived through a more complex formula, or when items are not identically distributed, replacement with the mean of available items will not work. Often, no imputation rule is available. This is the case for the mental and physical component summary scores (MCS and PCS) based on the SF-36 [12] and the SF-12 [13], [14] health surveys. Each summary score is a sum of 36 or 12 item weights. As no imputation rules exist, a single missing item value will cause missing values for both summary scores. As a result, the SF-12 summary scores have rates of missing data among respondents that are frequently around 10%, and sometimes exceed 25% [15], [16], [17], [18], [19], [20], [21]. Corresponding figures are necessarily higher for the SF-36 summary scores. This issue is of concern to anyone who considers using these popular instruments.

In this article, we examine selection bias due to incomplete responses to an SF-12 survey, explore several ways of imputing summary scores when one or more items have a missing value, and propose a simple and effective imputation algorithm.

Section snippets

Survey

A mail survey of residents of the French-speaking Swiss canton of Vaud was conducted in 1996 to produce local population norms for health status questionnaires [22]. Participants were selected at random from the official resident file, in strata defined by age (20–29 years to 70–79 years) and sex, of 200 persons each (total 2400). Two follow-up mailings were sent to non-respondents.

Questionnaire content

The questionnaire included the COOP Charts [22], the SF-36 health survey [11], including the 12 items needed for

Response rate

Of the initial sample, 2327 persons were eligible, and 1329 (57.1%) returned the questionnaire. A further 79 questionnaires were eliminated because of a mismatch on age or sex between questionnaire data and the original database, leaving 1250 (53.7%) questionnaires for the analysis.

Missing items

All SF-12 items were filled by 1132 (90.6%) respondents; 118 (9.4%) respondents failed to answer at least one item. Most (N = 66) omitted only one item, 16 respondents omitted two items, 13 omitted three items, 8 four

Discussion

This study confirmed that partial completion of the SF-12 health status questionnaire may cause bias, as women, foreigners, the elderly, and users of health services were less likely to answer all 12 questions. To correct this problem, we propose a simple imputation algorithm— replace missing value by mean population weight for up to three key items, and any number of non-key items—which works well in most situations. Imputation revealed that respondents with incomplete data had markedly lower

References (27)

C.A. Macera et al.
Patterns of non-response to a mail survey
J Clin Epidemiol
(1990)
L. Richiardi et al.
Analysis of nonresponse bias in a population-based case–control study on lung cancer
J Clin Epidemiol
(2002)
J.F. Etter et al.
Analysis of non-response bias in a mailed health survey
J Clin Epidemiol
(1997)
S.K. Gandhi et al.
Psychometric evaluation of the 12-item short form health survey (SF-12) in osteoarthritis and rheumatoid arthritis clinical trials
Clin Ther
(2001)
G. Bravo et al.
Estimating the reliability of continuous measures with Cronbach's alpha or the intraclass correlation coefficient: Toward the integration of two traditions
J Clin Epidemiol
(1991)
M.C. Norton et al.
Characteristics of nonresponders in a community survey of the elderly
J Am Geriatr Soc
(1994)
K.M. Grotzinger et al.
Assessment and control of nonresponse bias in a survey of medicine use by the elderly
Med Care
(1994)
A. Paganini-Hill et al.
Comparison of early and late respondents to a postal health survey questionnaire
Epidemiology
(1993)
V. Helasoja et al.
Late response and item nonresponse in the Finbalt Health Monitor survey
Eur J Public Health
(2002)
D.J. Slymen et al.
Item non-response to lifestyle assessment in an elderly cohort
Int J Epidemiol
(1994)

E.W. Wolfe

Using logistic regression to detect item-level non-response bias in surveys

J Appl Meas

(2003)

R.F. DeVellis

Scale development: theory and applications

Sage Publications

(1991)

J.E. Ware

SF-36 Health Survey. Manual & interpretation guide

(1993)

Cited by (64)

How do forms and characteristics of Asian public housing neighbourhoods affect dementia risk among senior population? A cross-sectional study in Hong Kong
2023, Public Health
Public housing estate is a key determinant of community health risk in American/European cities. However, how forms/characteristics of compact/hilly public housing's neighbourhoods affect dementia among Asian seniors was underestimated.
This was a cross-sectional study.
A total of 2,077 seniors living in Hong Kong's public housing estates were included. Dementia was measured by a Cantonese version of Montreal – Cognitive Assessment. Built environment was measured based on three dimensions (greenery, walkability, accessibility), including 11 metrics. Circular buffers (without walking paths) and service areas (considering walking paths) with two-dimensional/three-dimensional (terrain) adjustment were applied to quantify forms/characteristics of neighbourhoods. Two spatial buffers were applied: immediate distance (200 m) and walkable distance (500 m). Exposure-by-exposure regressions were applied to evaluate the associations between form/characteristics of neighbourhood and dementia.
Forms/characteristics without considering walking paths may overestimate health benefits from built environment. For circular buffers, higher percentage of building coverage, higher land use mix and more community/transportation/leisure facilities were negatively associated with dementia. All measures of greenery were positively associated with dementia. For service areas, measures of walkability and accessibility became insignificant except more community facilities at the immediate distance. Furthermore, terrain effect was insignificant when it was compared with the impacts of walking paths.
Dementia among seniors in hilly public housing estates was negatively associated with neighbourhood's walkability and accessibility and was influenced by walking paths. For healthy ageing, improved forms/characteristics of public housing neighbourhoods should include more accessible spaces and community facilities along walking paths for physical activities and basic daily needs.
Spatial uncertainty and environment-health association: An empirical study of osteoporosis among “old residents” in public housing estates across a hilly environment
2022, Social Science and Medicine
Built environment can influence physical conditions of older adults (e.g. osteoporosis). However, traditional methods using 2-dimensional circular buffer as a spatial structure to measure neighbourhood effect may create bias in health estimation, especially for the hilly and compact environment across low-income neighbourhoods (e.g. public housing estates).
We evaluated the environmental influences on self-reported osteoporosis among “old residents” (age≥65) in Hong Kong (n = 2077). Twelve public housing estates across hilly neighbourhoods in Hong Kong were selected as study sites. A cross-validated approach was developed to evaluate four spatial structures (2D circular, 2D service area, 3D circular, 3D service area). To determine problems of spatial uncertainty, we compared odds ratios (OR) and differences in effect sizes from models using different spatial structures. When all adjusted models achieve significant results based on 95% confidence intervals (CI) and with all positive/negative ORs, this study reported to have reached “a result with consistency”. Results from the 3D service area were then used to explain the environment-health relationship.
Different spatial structures can yield different results. Particularly, circular buffers overestimated environmental effects on self-reported osteoporosis. Overestimated measures were related to walkability and accessibility but not greenery. Specifically, results from the 3D service area showed that more public space and health facilities within a walkable distance (500 m) from a location of subject's residence were negatively associated with self-reported osteoporosis (adjusted ORs: 0.44 [0.29, 0.66]; 0.94 [0.90, 0.99]). However, more major transport facilities at the immediate distance from residence (200 m) was positively associated with self-reported osteoporosis (adjusted OR: 1.11 [1.01, 1.23]).
Physical conditions (e.g. osteoporosis) of older adults living in a hilly neighbourhood could be driven by walking behaviours. It is necessary to include local terrain and road network to define a walkable neighbourhood for environment-health estimations to minimize spatial bias.
Evaluating clinical and demographic influences on health perception: A translation of the SF-12 for use with NHANES
2022, SSM - Population Health
Improving public health depends on an intricate understanding of the factors that influence how individuals perceive and self-report their personal health. Self-perceived health is an independent predictor of future health-related outcomes, but capturing self-perception of health is complex due to the intricate relationship between clinical and perceived health. A commonly used measure of self-perceived health is the Short Form 12 (SF-12), developed in the 1990s. In this study, we aim to evaluate clinical and demographic influences on self-perceived health among American adults using the National Health and Nutrition Examination Survey (NHANES). While NHANES captures information on a number of domains of health, including clinical assessments, it does not include SF-12 items necessary to measure self-perceived health. Therefore, to assess self-perceived health for our study, we constructed and validated a novel SF-12-equivalent measure for use with NHANES using analogous items from the 2015–2016 NHANES interview questionnaires. The developed measure reflects established knowledge of population health patterns and closely parallels the behavior of the original SF-12. An analysis of the clinical and demographic influences on this novel measure of health perception revealed that both clinical and demographic factors, such as depression status and race, influence how healthy individuals perceive themselves to be. Importantly, our analysis indicated that among American adults, while controlling for clinical and demographic covariates, an increase in low-density lipoprotein (i.e., “bad”) cholesterol level was associated with an improvement in self-perceived health. This study contributes significantly in two domains: it provides a novel measure of self-perceived health compatible for use with the widely used NHANES data (as well as details on how the process was developed), and it identifies a critical area in need of improved clinical education regarding the apparent confusion around cholesterol health.
Health status and quality of life in patients with diabetes in Switzerland
2019, Primary Care Diabetes
Citation Excerpt :
Scores range from 0 (lowest level of health) to 100 (highest level of health) and were initially calibrated so that 50 is the average score or norm for the US general population, with a standard deviations equalized to 10 [12]. In Switzerland, the PCS and MCS scores were respectively 49.8 (SD 8.6) and 46.7 (SD 10.1) in a sample of Swiss residents in the canton of Vaud from a study performed to establish local population norms for health status questionnaires [13]. Diabetes-specific QoL was assessed with the third version of the Audit of Diabetes-Dependent Quality of Life (ADDQoL) [14,15], a validated and widely recommended instrument with good psychometric properties [16–18].
We aimed to assess the health status and quality of life (QoL) of patients with diabetes and explore the associated factors in a French-speaking region of Switzerland.
This cross-sectional study analyzed self-reported data from 585 patients with diabetes. We ran univariate and multivariate regressions analyses on health status (Physical and Mental Component Summary scores (PCS, MCS) of the SF-12) and diabetes-specific QoL (ADDQoL score).
Mean PCS was 43.1 ± 10.4 and mean MCS was 46.7 ± 11.1. The overall ADDQoL score was −1.6 ± 1.6; the life domains of the ADDQoL with the lowest scores were freedom to eat, sex life and freedom to drink. Being older was independently and significantly associated with higher mental health and QoL, while lower income was associated with lower physical health, mental health, and QoL. Having diabetes for over 10 years was associated with lower QoL, while insulin treatment and complications were correlated with lower physical health and QoL.
This study provides key information on the health status and QoL of patients with diabetes in Switzerland and their associated factors, which can help healthcare providers to identify patients at higher risk of lower health and QoL.
Performance of a Bayesian Approach for Imputing Missing Data on the SF-12 Health-Related Quality-of-Life Measure
2018, Value in Health
Citation Excerpt :
Then, four other models were applied, as tested by Perneger and Burnand [13]. The four other methods of imputing weights associated with PCS-12 or MCS-12 were 1) missing data could be replaced by 0 (zero model [ZM]); 2) missing data could be replaced by the mean weight in the population (mean weight model [MWM]); 3) missing data could be replaced by the mean weight predicted from age and education (weight from regression with age model [WRAM]); and 4) missing data could be replaced by the mean weight predicted from the sum of weights of the remaining items (weight from regression with weight model [WRWM]) [13]. The latter two were derived from a linear regression model.
Missing data in health-related quality-of-life outcomes are an ongoing problem. The 12-item short form health survey (SF-12) scores are no exception. Data imputation is complicated, because missingness may be partially predicted by the missing data themselves.
To compare the performance of a Bayesian method for imputing SF-12 data with previously described frequentist imputation methods.
SF-12 data were extracted from a trial assessing continence promotion on health-related quality of life in older women (n = 1052); the data set was split into a model development cohort for creating predictive models and a validation cohort to validate these models. Algorithms were constructed using data from the model development cohort to compute SF-12–related scores (physical health composite scale, the mental health composite scale, and the six-dimensional health state short form utilities). The Bayesian models used missing at random and missing not at random algorithms to impute missing SF-12 answers as categorical data. Comparative models replaced missing data with 0, used the mean weight of the sample, and regressed parameters from sociodemographic predictors. Data randomly deleted from the validation cohort were imputed with each algorithm, and the mean absolute error was used to gauge goodness of fit.
Each cohort included 526 persons; mean age was 78.1 ± 7.8 years. In the model development cohort, 15.6% of the participants had missing data. For the physical health composite scale, the mental health composite scale, and the six-dimensional health state short form utilities, the Bayesian model with missing at random data significantly outperformed all five comparison models, including the Bayesian models with missing not at random data.
Bayesian imputation was superior to other previously described methods for computing missing SF-12 data.
Impact of Gastrostomy Placement on Nutritional Status, Physical Health, and Parental Well-Being of Females with Rett Syndrome: A Longitudinal Study of an Australian Population
2018, Journal of Pediatrics
To evaluate how age-related trends in nutritional status, physical health, and parental well-being in females with Rett syndrome may be related to gastrostomy placement and to examine the impact of the procedure on mortality.
We included 323 females from the Australian Rett Syndrome Study and analyzed their demographic, genetic, and child and parental health data collected from over 6 waves of follow-up questionnaire between 2000 and 2011. We used mixed-effects models to estimate the association between repeated measures of outcomes and age, gastrostomy placement and their interaction and Cox proportional hazards regression models to estimate relative risks of mortality for individuals with gastrostomy.
Nearly one-third (30.3%) of the cases underwent gastrostomy placement. Nutritional status based on weight, height, and body mass index (BMI) improved over time, and BMI was greater in individuals with gastrostomy placement than in those without (adjusted β = 0.87, 95% CI 0.02-1.73). There was no association between gastrostomy placement and individual's physical health outcomes or parental physical and mental health, nor did the age trend of these outcomes vary by gastrostomy insertion status. Nevertheless, among those at risk of suboptimal weight, the all-cause mortality rate was greater in those who had gastrostomy placement compared with those who had not (hazard ratio 4.07, 95% CI 1.96-8.45).
Gastrostomy placement was associated with improvement in BMI in females with Rett syndrome, but its long-term impact on individuals and their families is unclear.

View all citing articles on Scopus

View full text

Original ArticleA simple imputation algorithm reduced missing data in SF-12 health surveys

Abstract

Objective

Study design and setting

Results

Conclusions

Introduction

Section snippets

Survey

Questionnaire content

Response rate

Missing items

Discussion

J Clin Epidemiol

J Clin Epidemiol

J Clin Epidemiol

Clin Ther

J Clin Epidemiol

Characteristics of nonresponders in a community survey of the elderly

J Am Geriatr Soc

Assessment and control of nonresponse bias in a survey of medicine use by the elderly

Med Care

Comparison of early and late respondents to a postal health survey questionnaire

Epidemiology

Late response and item nonresponse in the Finbalt Health Monitor survey

Eur J Public Health

Item non-response to lifestyle assessment in an elderly cohort

Int J Epidemiol

Using logistic regression to detect item-level non-response bias in surveys

J Appl Meas

Scale development: theory and applications

Sage Publications

SF-36 Health Survey. Manual & interpretation guide

Original Article
A simple imputation algorithm reduced missing data in SF-12 health surveys