Does socioeconomic status modify how individuals perceive or describe their own health? An assessment of reporting heterogeneity in the Health Survey for England
•,,.
...
Abstract
Background Self-reported health (SRH) is widely used as a proxy for health status; it is a simple, holistic measure of health and has been associated with other health outcomes. However, variation in these associations has been found by subgroup, leaving open the possibility of systematic reporting differences, including by socioeconomic status.
Methods Using data from the 2017 and 2018 waves of the Health Survey for England, we assessed the relationship between deprivation quintile and SRH using multiple linear regression models for EQ-5D score, a health-related quality of life measure. Reporting heterogeneity between deprivation quintiles was assessed by the significance of model interaction terms. Analyses were stratified by sex and accounted for age group, ethnicity, marital status and religion (females only).
Results Significant interactions were found between deprivation quintile and SRH in the models for EQ-5D score for participants reporting poor health in the two most deprived quintiles, with coefficients ranging from –0.041 to –0.098. The largest differences were in the second most deprived quintile for men (–0.075, 95% CI: –0.110 to –0.040, p<0.001) and the most deprived quintile for women (–0.098, 95% CI: –0.128 to –0.067, p<0.001). Secondary analyses using body mass index as the response variable, for models structured in a similar way to those for EQ-5D, found no significant interaction terms between deprivation quintile and SRH.
Conclusions This analysis suggests that people from more deprived areas who report themselves to be in poor health may have worse health, as measured by EQ-5D, than those from less deprived areas. This could lead to an underestimation of health inequalities, including in measures, such as healthy life expectancy in England, that incorporate SRH data.
What is already known on this subject
Self-reported health (SRH) is a widely used measure of health, but studies have shown mixed results on whether it may be subject to reporting differences by socioeconomic status.
As SRH data are used in England to calculate healthy life expectancy, a population health metric often used for quantifying health inequalities, further country-specific analysis will support public health practice and policy.
What this study adds
This is the first UK study to use a health utility measure as a benchmark for ‘objective’ health in assessing reporting differences by socioeconomic status, and the first to use the Index of Multiple Deprivation, facilitating its practical application, as the latter is widely used for measuring health inequalities in England.
It finds that people who live in the most deprived two quintiles of England who report poor health have lower health utility scores than would be expected based on their deprivation level and SRH status.
How this study might affect research, practice or policy
This study implies that using SRH as a proxy for health status could potentially underestimate health inequalities by socioeconomic status, and hence further research may wish to quantify what impact this may have on healthy life expectancy estimates and, potentially, public health resourcing.
Introduction
A single-question measure of self-reported health (SRH) has been widely used in health research globally. Variations exist, but the question typically asks: ‘How is your health in general? Would you say it is: very good, good, fair, bad, or very bad?’ Despite its simplicity, a range of associations with health outcomes have been found, including with mortality.1 2
SRH is also used to inform policy; for example, in England, it is used as the basis for the health component of healthy life expectancy, a measure summarising the health of populations relative to their life expectancy.3 Nonetheless, variation has been found in the magnitude (or existence) of associations by subgroup, suggesting that SRH may not mean the same for all groups.1 4–6
As Adams and White highlight, SRH may incorporate expectations about what ‘good’ or ‘bad’ health is and an assessment of how one’s own health relates to this.7 Given the known relationship between health and socioeconomic status—with gaps in life expectancy of 9.7 years for men and 8.0 years for women between the 10% most and least deprived areas in England in 2018–208—and qualitative research that also highlights how marginalised groups have lower health expectations,9 it is plausible that those from deprived areas may have a higher threshold for reporting poor health. This could lead to underestimates of health inequalities when SRH is used as a proxy for health.
However, disentangling differences in health perception and underlying health state are not straightforward. One response to this is to use SRH solely as a measure of people’s perception and reporting of health.10 However, this does not stop it being widely used in practice as a proxy for health status,11 with implications for resource allocation.5 12 In order to assess reporting heterogeneity, studies have generally looked for variation or interactions between SRH and one or more alternative health measures by socioeconomic status. Alternative health measures have included biomarkers,7 11 13 health utility values5 14 and self-reported health conditions/functional limitations, either separately6 or as part of a synthetic index.12
Findings have been mixed, with several studies suggesting the health of those with lower socioeconomic status is worse for the same level of SRH, but within papers this has depended on the health measure, sex, socioeconomic group and the level of health reported.6 7 11–13 Layes et al reported that those in the top two income deciles had better health utility values than their SRH level would predict, while the three lowest income deciles had lower health utility values than their SRH level would predict.5 Lindeboom and van Doorslaer did not find, overall, that income or education modified SRH.14
To our knowledge, only one study has previously used the Health Survey for England to explore reporting heterogeneity in SRH7 and none has used a health utility measure to explore this within a UK setting. This study aims to update and extend previous research by looking at the relationship between SRH and a health utility measure, EQ-5D, by socioeconomic status and sex in England.
Methods
We used a health-related quality of life measure, EQ-5D, as a proxy for ‘objective’ health to explore whether differences in SRH by deprivation reflect only ‘true’ differences in health status or whether reporting heterogeneity might modify this. We tested the hypothesis that if there were no reporting heterogeneity, no interaction between deprivation quintile and SRH would be seen in a linear regression model for EQ-5D utility score. Secondary analyses used body mass index (BMI) as the response variable. We also performed descriptive analyses to characterise the relationship between SRH, deprivation and other sociodemographic variables.
Data
This study uses data from the 2017 and 2018 waves of the Health Survey for England, a nationally representative, annual cross-sectional survey.15 16 It is commissioned nationally and each wave consists of a core set of questions/measures in addition to some which are wave-specific. The design and methods used in the 2017 and 2018 waves are summarised below, but full details are available in the relevant Methods reports.17 18
Each wave used a random stratified sampling design, first selecting primary sampling units based on postcode sectors, and then selecting addresses within these.17 18 Only one household per address was selected; this was chosen randomly for addresses with multiple dwelling units and/or households. Within each household, all adults (up to 10) were eligible for interview and up to four children, based on their ages. Interviews were face-to-face and used computer-assisted personal interviewing, consisting of a household questionnaire and an individual questionnaire, and a self-completion booklet. Participants were also weighed and measured at this interview. A subsequent nurse visit was also offered to all participants in 2017 and a subset of households in 2018, but no variables from this visit were included in our analysis. Fieldwork took place across the year in question and up to March of the following year (ie, January 2017 to March 2018 for the 2017 wave). Only private households are covered by the Health Survey for England and not institutional settings.
The 2017 and 2018 waves were chosen as they reflect two consecutive years when EQ-5D data were collected, as these are not collected in every wave, and the same version of the Index of Multiple Deprivation (IMD) was available. The use of two successive waves created a larger sample for analysis.
Variables
EQ-5D-5L (EQ-5D) is a health-related quality of life measure developed by EuroQol19 consisting of five domains (mobility, self-care, usual activities, pain/discomfort and anxiety/depression) that are assessed using five levels of response (none, slight, moderate, severe and extreme, scored as 1–5). EQ-5D is the preferred health utility tool of the National Institute for Health and Care Excellence (NICE), a body that provides recommendations on which medicines/treatments should be publicly funded in England and Wales.
We mapped the profile of responses (eg, 11111) using van Hout et al’s ‘crosswalk’ approach to a three-level value set derived from a UK population, as recommended by NICE at the time analysis on this study began.20–22 The values provide a population-based judgement of where each profile sits on a scale of –0.594 to 1, where 1 equates to optimal health and 0 notionally to death. Negative values are possible where a state is felt to be worse than death.
As EQ-5D is self-reported, one potential criticism is that it may itself be subject to reporting heterogeneity.11 However, Layes et al also used a health utility measure (the Health Utility Index Mark 3) to assess reporting heterogeneity in SRH on the basis that it covers a holistic view of health, the questions relate to functional measures of health, which may reduce subjectivity, and that the value set that responses are mapped to has been derived from the population rather than the individual in question.5 Nonetheless, to explore this potential limitation, we conducted secondary analyses using BMI (weight in kilograms divided by the square of height in metres) as the response variable instead of EQ-5D, on the basis that BMI can be objectively measured, and provides an informative complementary analysis to that provided by EQ-5D.
SRH was assessed via the question: ‘How is your health in general? Would you say it was…very good, good, fair, bad or very bad?’ We dichotomised responses into very good and good (referred to as ‘good’), and fair, bad, and very bad (referred to as ‘poor’). While dichotomisation results in a loss of information, this is how SRH is used in practice for the healthy life expectancy metric in England3 and hence this cut-point is relevant for understanding policy implications.
Socioeconomic status was measured using 2015 IMD quintiles.23 IMD is a composite, place-based measure of deprivation that is widely used for the measurement of health inequalities in England. The most deprived quintile indicates that the participant lived in the upper 20% most deprived neighbourhoods in England.
Age group, ethnicity, marital status, and religion were included as covariates in models.
Table 1 provides further details on how data were collected for each variable.
To test whether deprivation quintile interacted with SRH, multiple linear regression models for EQ-5D score were fitted, assuming approximate normality for EQ-5D scores which is consistent with modelling approaches used elsewhere, with (preplanned) separate models for males and females. Terms were added using a forward selection methodology; variables were added to a base model initially including only age group and models of increasing complexity were compared using likelihood ratio tests (LRTs) to the previous (less complex) model to assess whether adding the term(s) had improved the overall model fit. The final models included age, ethnicity, marital status, religion (females only), IMD quintile, SRH, and the interaction between IMD quintile and SRH. The same approach to model fitting was used for the secondary analysis with BMI as the response variable.
The regression analyses were stratified by sex; there is reason to believe that reporting heterogeneity in SRH may differ by sex14 and this has been the approach taken in other studies.6 11
The descriptive analyses were directly age standardised using an average of the 2017 and 2018 Office for National Statistics mid-year population estimates for England and a final age group of 85 years and above. These descriptive analyses were weighted using the weights recommended by the Health Survey for England, to allow generalisability to the English population. Weights were not used in the linear regression models as the analysis did not attempt to create nationally representative prevalence estimates.
All analyses were implemented in R (V.4.1.2 and V.4.2.3)24 and assessed significance at the 5% level.
Missing data
As EQ-5D was only collected from those aged 16 years and above, we excluded children under the age of 16 from our analysis, leaving a potential sample of 16 175 participants. Participants with missing data on variables of interest were excluded, with the analysis using an assumption that complete cases were, all other things being equal, the same as cases with missing data: that is, data were missing at random. These exclusions were made prior to models being fitted. All participants had data on age group, sex and IMD quintile, while data for SRH, marital status and ethnicity were highly complete (each greater than 99.5%, with SRH and marital status missing data from fewer than five participants each). EQ-5D and religion, which were both collected through a self-completion booklet, were less complete: 11% of participants were missing EQ-5D and 11% were missing religion. After exclusions were made for those with incomplete SRH, EQ-5D, marital status, ethnicity or religion, the final sample for analysis was 14 116 (87%). Due to the overlap between those missing EQ-5D and religion, the added impact of excluding religion after all other exclusions was only 1.7% of the sample aged 16 and above. Participant characteristics are summarised in online supplemental appendix table A.1. Due to missing BMI data, the sample used for the secondary analysis was 12 182 (75%).
Patient and public involvement
No patients or members of the public were involved in the design or completion of this study.
Results
Descriptive analysis
Weighted, age-standardised percentages of participants reporting good health by deprivation quintile and various characteristics are shown in table 2. Some categories were grouped for this analysis and BMI was categorised to allow for age-standardisation.
Table 2
|
Weighted and age-standardised proportion of participants reporting good health
As expected, there was a downward trend in self-reported good health with increasing deprivation, from 82.9% in the least deprived quintile to 63.9% in the most deprived quintile. This increased in each successive age group such that in those aged 65 years and above, the proportion reporting good health in quintile 1 was 1.6 times that of quintile 5 (70.2% vs 43.2%). In participants aged 50–64 years, below the age at which you can draw a state pension in the UK, 81.9% of those in quintile 1 reported good health compared with 55.3% in quintile 5. The former is similar to or slightly higher than the 80.0% of 16–34-year olds reporting good health in quintile 5.
A similar but slightly smaller proportion of females reported good SRH relative to males (74.5% vs 77.0%), while a smaller proportion of those who were single, separated, divorced or widowed reported good health relative to those who were married or cohabiting (68.2% vs 77.8%). Although the BMI group with the highest proportion reporting good health was the normal weight group (83.1%), this was similar to the overweight group (80.3%). In contrast, the proportion reporting good health in the obese group was lower (66.7%).
Linear regression
Linear regression models for EQ-5D utility scores were fitted, with separate models for males and females, including age, ethnicity, marital status, IMD quintile, SRH and the interaction between IMD quintile and SRH (table 3). Religion was included in the final female but not male model as including this in the latter did not improve fit.
Table 3
|
Linear regression models predicting EQ-5D score
Adding an interaction term between IMD quintile and SRH increased the explanatory power of the model for both males and females relative to models without this (LRT, p<0.001). For both males and females, the two most deprived quintiles showed significant interactions with SRH. The coefficients indicate that the individuals in those two quintiles who reported themselves to be in poor health have a lower EQ-5D score than would be predicted from their deprivation quintile and SRH status alone.
The magnitude of these differences in EQ-5D score ranged from –0.041 to –0.098. The largest difference for men was in the second most deprived quintile (–0.075, 95% CI: –0.110 to –0.040, p<0.001), which is similar to the impact of age for 55–59-year olds relative to 16–19-year olds (–0.076). The largest difference for women was in the most deprived quintile (–0.098, 95% CI: –0.128 to –0.067, p<0.001), which is higher in magnitude than the impact of age in all age groups up to and including those aged 80–84 years relative to those aged 16–19 years (–0.062).
As a secondary analysis, we fitted models with BMI as the response variable (table 4). Unlike the EQ-5D models, the male model with the deprivation and SRH interaction term was not a better fit than one without it (LRT, p=0.335), although the female one was (LRT, p=0.006). Although poor SRH and deprivation were predictors of higher BMI, none of the categories in the interaction term showed significant results for males or females.
Table 4
|
Linear regression models predicting BMI
Further EQ-5D models that included an interaction term between age and SRH did show an improvement relative to models without it for both males and females (online supplemental appendix table A.2). However, the interaction between the two most deprived quintiles and SRH remained significant for both males and females and this did not have a large effect on the magnitude of the coefficients (range: –0.004 to 0.003).
Discussion
This is the first study to assess reporting heterogeneity in SRH in the Health Survey for England using IMD as a measure of socioeconomic status and EQ-5D as a reference standard for health status. It also provides further descriptive analysis on the association between SRH and deprivation and how this varies by population characteristics.
We found that there was a statistically significant interaction between SRH and socioeconomic status for the relationships between those in the most deprived two quintiles who reported poor health, relative to those in the least deprived quintile reporting good health. The coefficients indicated that the EQ-5D score for these two quintiles (for both males and females) was lower when this additional information was included in the model (–0.041 to –0.098). The minimum clinically important difference for EQ-5D for the three-level value set used by this study has been estimated at 0.074,25 and two of the four findings meet this threshold: males in the second most deprived quintile (–0.075) and females in the most deprived quintile (–0.098). For the former, the additional effect on EQ-5D is similar to being aged 55–59 years (relative to being aged 16–19 years), whereas for the latter, this effect was greater than the impact of age for age groups up to and including 80–84 (relative to the 16–19 age group).
Furthermore, a summary of the ‘disutility’ or change in EQ-5D value associated with various conditions suggests that the interaction effects we found are in the same range as the disutility associated with asthma (–0.046), acute myocardial infarction (–0.063), diabetes (–0.071) or neurotic disorders (–0.094).26 A further study using Health Survey for England data estimated the impact of smoking 20 or more cigarettes a day (relative to never smoking) on EQ-5D as –0.062 and the impact of being morbidly obese (BMI≥40, relative to a BMI of 18.5 to <25) as –0.105.27 Given their similarity in size to those associated with known health issues and risk factors, this suggests that the magnitude of our findings may be meaningful.
The models predicting BMI did not show a significant interaction between SRH and deprivation. One explanation may be that although BMI is an objective measure, it is not necessarily a proxy for overall health; people may be unwell and have a normal BMI, and those with a raised BMI may be at greater risk of ill health but not yet be experiencing this. These models were also not a good fit, with adjusted R-squared values of 0.12 for males and 0.10 for females. SRH and deprivation (for the two/three most deprived deciles, depending on sex) were nonetheless independently associated with increased BMI.
This study’s strengths were that it used a large sample from a nationally representative survey and used the IMD as its measure of socioeconomic status. Although this is a place-based measure rather than a person-based one, it is widely used in the study of health inequalities in England, facilitating the interpretation of the results into practice. It was also available for all participants, reducing the risk of response bias or the need to limit the age range to account for those who might not have finished their studies or for whom occupational status may be less accurate, which has been a feature of other studies on reporting heterogeneity in SRH.5–7 12 13 Full data on all variables of interest were available for 87% of survey participants aged 16 and above, supporting its generalisability to English settings.
One potential limitation was the use of EQ-5D; as this is self-reported, it could be vulnerable to the same bias under investigation. This would mean our analyses underestimate the true difference in objective health for a given subjective health. However, as discussed above, there is reason to believe that if there is such a bias, it might not be so acute due to the focus of the questions on functional limitations and the use of a population-based value set.5 Nonetheless, future research may wish to explore alternative, more objectively defined, measures of health. A second limitation is that the models explained only around 30%–35% of the variation in EQ-5D. This could suggest that there is not a good match between the health concepts described by EQ-5D and SRH, or that SRH, as a categorical variable, and particularly in its dichotomised form, does not contain enough information to be able to explain the variation in EQ-5D values. One explanation for our findings could be that the distribution of responses within ‘poor’ health (fair/bad/very bad) may be more concentrated towards the very bad health end of the spectrum in participants from more deprived areas and hence that use of the full range of responses may not show reporting heterogeneity. Nonetheless, as the dichotomised version is what is used in practice for healthy life expectancy in England, our findings remain relevant. Further analysis may wish to explore reporting heterogeneity by SRH state.
This study implies that the use of healthy life expectancy measures based on SRH could potentially underestimate inequalities based on socioeconomic deprivation. Further study to quantify any impact on healthy life expectancy at area level would be of value to understand the policy implications. However, even if there is an element of underestimation, SRH remains a powerful tool for evidencing health inequalities in England, such as in our descriptive analyses—where 50–64-year olds in the least deprived quintile reported similar or slightly better health than 16–34-year olds in the most deprived quintile—and the simplicity of the SRH question has enabled it to be incorporated into routine, large-scale surveys, allowing monitoring of geographical inequalities and change over time.