2023-03-09Zeitschriftenartikel
Detection of missed deaths in cancer registry data to reduce bias in long-term survival estimation
| dc.contributor.author | Dahm, Stefan | |
| dc.contributor.author | Barnes, Benjamin | |
| dc.contributor.author | Kraywinkel, Klaus | |
| dc.date.accessioned | 2026-01-15T08:17:23Z | |
| dc.date.available | 2026-01-15T08:17:23Z | |
| dc.date.issued | 2023-03-09 | none |
| dc.identifier.other | 10.3389/fonc.2023.1088657 | |
| dc.identifier.uri | http://edoc.rki.de/176904/13138 | |
| dc.description.abstract | Background: Population-based cancer survival estimates can provide insight into the real-world impacts of healthcare interventions and preventive services. However, estimation of survival rates obtained from population-based cancer registries can be biased due to missed incidence or incomplete vital status data. Long-term survival estimates in particular are prone to overestimation, since the proportion of deaths that are missed, for example through unregistered emigration, increases with follow-up time. This also applies to registry-based long-term prevalence estimates. The aim of this report is to introduce a method to detect missed deaths within cancer registry data such that long-term survival of cancer patients does not exceed survival in the general population. Methods: We analyzed data from 15 German epidemiologic cancer registries covering the years 1970-2016 and from Surveillance, Epidemiology, and End Results (SEER)-18 registries covering 1975-2015. The method is based on comparing survival times until exit (death or follow-up end) and ages at exit between deceased patients and surviving patients, stratified by diagnosis group, sex, age group and stage. Deceased patients with both follow-up time and age at exit in the highest percentile were regarded as outliers and used to fit a logistic regression. The regression was then used to classify each surviving patient as a survivor or a missed death. The procedure was repeated for lower percentile thresholds regarding deceased persons until long-term survival rates no longer exceeded the survival rates in the general population. Results: For the German cancer registry data, 0.9% of total deaths were classified as having been missed. Excluding these missed deaths reduced 20-year relative survival estimates for all cancers combined from 140% to 51%. For the whites in SEER data, classified missed deaths amounted to 0.02% of total deaths, resulting in 0.4 percent points lower 20-year relative survival rate for all cancers combined. Conclusion: The method described here classified a relatively small proportion of missed deaths yet reduced long-term survival estimates to more plausible levels. The effects of missed deaths should be considered when calculating long-term survival or prevalence estimates. | eng |
| dc.language.iso | eng | none |
| dc.publisher | Robert Koch-Institut | |
| dc.rights | (CC BY 3.0 DE) Namensnennung 3.0 Deutschland | ger |
| dc.rights.uri | http://creativecommons.org/licenses/by/3.0/de/ | |
| dc.subject | cancer registry data | eng |
| dc.subject | missed deaths | eng |
| dc.subject | long-term survival | eng |
| dc.subject | classification algorithm | eng |
| dc.subject | relative survival | eng |
| dc.subject.ddc | 610 Medizin und Gesundheit | none |
| dc.title | Detection of missed deaths in cancer registry data to reduce bias in long-term survival estimation | none |
| dc.type | article | |
| dc.identifier.urn | urn:nbn:de:0257-176904/13138-4 | |
| dc.type.version | publishedVersion | none |
| local.edoc.container-title | Frontiers in Oncology | none |
| local.edoc.type-name | Zeitschriftenartikel | |
| local.edoc.container-type | periodical | |
| local.edoc.container-type-name | Zeitschrift | |
| local.edoc.container-publisher-name | Frontiers Media SA | none |
| local.edoc.container-reportyear | 2023 | none |
| local.edoc.container-firstpage | 01 | none |
| local.edoc.container-lastpage | 12 | none |
| dc.description.version | Peer Reviewed | none |
