Discussion
The National Disease Registration Service (NDRS) is committed to fulfilling the NHS’s Public Sector Equality Duty by publishing National and Official statistics broken down by health inequality dimensions, as standard, wherever this is feasible and compliant with data privacy rules. Currently, most regular reports of cancer epidemiology in England are provided broken down by age group, gender, ethnic background, IMD quintile and geography. Despite its inclusion in NHS England’s ‘Core20PLUS5’ framework, publishing by coastal status has not been implemented in these reports yet, because there is no agreed definition of ‘coastal communities’. This study has sought to rectify this deficit by nominating a simple, parsimonious definition that can be replicated throughout cancer data reporting. Our coastal definition is at LSOA-level because this is a widely used census geography in the UK, and because geographies of lower granularity mask important intercommunity variation in health outcomes. We have chosen a candidate variable that best explained differences in cancer outcomes in the presence of socioeconomic adjustment, but as the definition of ‘coastal’ chosen excluded these factors, our candidate variable still enables adjustment or stratification by accepted measures of deprivation and rurality, thus allowing multidimensional exploration of geographical variation in cancer outcomes.
Despite a range of evidence for health inequalities for coastal communities,3 5 6 8 25 data and insight into this area have been hampered by the lack of a coherent definition of ‘coastal’ that allows for the inclusion of both rural and urban coastal communities and is sufficiently granular to avoid the ‘masking’ of pockets of deprivation by more wealthy nearby areas. A report by Atterton et al (2006)5 employed the Vickers classification of Coastal Britain, which uses a local authority-level classification derived from a principal component analysis and k-means clustering algorithm, using initially 129 different geographical and demographical variables.26 A report by Public Health England in 2020 stated that those older people living in coastal areas may be at higher risk of social isolation and loneliness.25 This report comprised a literature review of studies that referenced coastal health/inequalities. Their working definition of ‘coastal’ was ‘any coastal settlement within a local authority area whose boundaries include the UK foreshore, including local authorities whose boundaries only include estuarine foreshore. Coastal settlements include seaside towns, ports and other areas which have a clear connection to the coastal economy’. Unfortunately, ‘clear’ in this context, was not defined. This report also noted that there was a paucity of data on coastal health in England, in comparison to studies of rural health inequalities.25
Asthana and Gibson (2022) mapped several health outcomes, including cancer, by Lower and Middle Super Output areas in England. Cardiovascular disease showed a distinct core/periphery distinction, as did several other health conditions. The working definition of ‘coastal’ in this study was LSOAs which include or overlap a built-up area of any size which lies within 500 m of the ‘Mean High Water Mark’ coastline. Again, it was noted that most health data reporting is made available at local authority or higher geographies, thus masking this core/periphery distinction. At the time of writing, several research groups are working to identify other definitions of ‘coastal’ for varied uses. This includes an ESRC funded project at the University of Plymouth and the ONS’s Built-Up Areas classification. These have been produced or are aimed at seeking definitions to support policy research, whereas our focus was purely associated with cancer data reporting. We anticipate that in future, a range of novel disease-specific empirically derived variables may be produced, which may share similarities with those reported here. At that point, it may be that a more harmonised, cross-disciplinary definition can be ascertained collaboratively, but this work must start with the development of more subject-specific solutions, as we provide here. It would also be interesting to compare how our ‘simpler’ methodology compares to these more complex approaches at capturing coastal inequalities.
Using an holistic definition of cancer, we sought to identify a coastal variable that would be broadly applicable to all cancers, but we recognise that, just as IMD quintile may not be the most appropriate measure of deprivation as it relates to all diseases, having a single definition of a concept enables acceleration in research and understanding of that concept. We anticipate that our selected candidate variable will not be the statistically superior version to capture coastal health variation across all cancer types and metrics, but statistical precision must be balanced against pragmatic utility. A single, easily computed variable can be widely implemented quickly across all analyses, providing comparability, consistency, and clarity, which is preferable to a suite of variables whose use varies per disease and which cannot then be directly compared.
Although our focus with this study was to produce a method to report cancer data by coastal status, it is unclear whether this description would be applicable in other disease settings. We report a simple sensitivity analysis showing that our coastal variable captures a significant amount of residual variation in CHD diagnoses above our adjustment set as an illustration, but further exploration of other diseases was beyond the scope of this work. Understanding coastal variation in other diseases represents an interesting future direction for research. Additionally, future research should apply this work to understand the causal processes through which coastal inequalities materialise to ensure that any measure of coastal status is capturing the correct information.
Strengths and limitations
We employed LSOA-level cancer statistics derived from the NDRS databases, covering all of England over multiple years. This constitutes gold-standard cancer data at a very granular level, which is a major strength of this study. We also back-up our cancer-specific findings with corroboration of the usefulness of our candidate variable in a non-cancer setting, namely in the modelling of CHD, to illustrate the potential cross-disciplinary utility of our definition.
The cancer data used in this study comprised the combined years 2016–2020 to obtain enough case numbers across all cancer metrics for statistical analysis, and to avoid most of the COVID-19 era data which may have differed meaningfully in its characteristics.27 Single-year analyses were beyond the scope of this study but given the relative stability of the presumed causal patterns at play, we feel that using this combined period was a valid approach.
Not all cancer types are associated with deprivation and demography in the same ways, for example, breast cancer incidence shows a seemingly paradoxical relationship with IMD.28 When considering the relationship of coastal geography with varied cancer metrics as we have done here, we cannot guarantee that the chosen variable would be the ‘best performing’ candidate for all cancer types. This cancer-specific work was beyond the scope of this study but given that the principal aim of this work was to derive a single, maximally ‘useful’ coastal variable, we posit that differences in results for specific cancers would be of academic interest only at this stage and would not alter our recommendations.
Our cancer and confounder datasets were not available for the 2021 boundary definitions of LSOAs at the time of writing, meaning that we could not revalidate our model using more recent data to match the 2021 boundary changes. This may limit uptake of their usage. To minimise this issue, we have included the 2021 LSOA codes and their ‘G_25_5’ coastal designation in our online supplemental file 3.2. These analyses will be updated following the release of the relevant datasets.