Zur Kurzanzeige

2021-04-02Zeitschriftenartikel
Facilitating harmonized data quality assessments. A data quality framework for observational health research data collections with software implementations in R
dc.contributor.authorSchmidt, Carsten Oliver
dc.contributor.authorStruckmann, Stephan
dc.contributor.authorEnzenbach, Cornelia
dc.contributor.authorReineke, Achim
dc.contributor.authorStausberg, Jürgen
dc.contributor.authorDamerow, Stefan
dc.contributor.authorHuebner, Marianne
dc.contributor.authorSchmidt, Börge
dc.contributor.authorSauerbrei, Willi
dc.contributor.authorRichter, Adrian
dc.date.accessioned2024-08-26T10:52:57Z
dc.date.available2024-08-26T10:52:57Z
dc.date.issued2021-04-02none
dc.identifier.other10.1186/s12874-021-01252-7
dc.identifier.urihttp://edoc.rki.de/176904/11961
dc.description.abstractBackground No standards exist for the handling and reporting of data quality in health research. This work introduces a data quality framework for observational health research data collections with supporting software implementations to facilitate harmonized data quality assessments. Methods Developments were guided by the evaluation of an existing data quality framework and literature reviews. Functions for the computation of data quality indicators were written in R. The concept and implementations are illustrated based on data from the population-based Study of Health in Pomerania (SHIP). Results The data quality framework comprises 34 data quality indicators. These target four aspects of data quality: compliance with pre-specified structural and technical requirements (integrity); presence of data values (completeness); inadmissible or uncertain data values and contradictions (consistency); unexpected distributions and associations (accuracy). R functions calculate data quality metrics based on the provided study data and metadata and R Markdown reports are generated. Guidance on the concept and tools is available through a dedicated website. Conclusions The presented data quality framework is the first of its kind for observational health research data collections that links a formal concept to implementations in R. The framework and tools facilitate harmonized data quality assessments in pursue of transparent and reproducible research. Application scenarios comprise data quality monitoring while a study is carried out as well as performing an initial data analysis before starting substantive scientific analyses but the developments are also of relevance beyond researcheng
dc.language.isoengnone
dc.publisherRobert Koch-Institut
dc.rights(CC BY 3.0 DE) Namensnennung 3.0 Deutschlandger
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/de/
dc.subjectdata qualityeng
dc.subjectobservational health studieseng
dc.subjectdata qality indicatorseng
dc.subjectdata quality monitoringeng
dc.subjectinitial data analysiseng
dc.subjectReng
dc.subject.ddc610 Medizin und Gesundheitnone
dc.titleFacilitating harmonized data quality assessments. A data quality framework for observational health research data collections with software implementations in Rnone
dc.typearticle
dc.identifier.urnurn:nbn:de:0257-176904/11961-2
dc.type.versionpublishedVersionnone
local.edoc.container-titleBMC Medical Research Methodologynone
local.edoc.container-issn1471-2288none
local.edoc.pages15none
local.edoc.type-nameZeitschriftenartikel
local.edoc.container-typeperiodical
local.edoc.container-type-nameZeitschrift
local.edoc.container-urlhttps://bmcmedresmethodol.biomedcentral.com/none
local.edoc.container-publisher-nameSpringer Naturenone
local.edoc.container-volume21none
local.edoc.container-reportyear2021none
dc.description.versionPeer Reviewednone

Zur Kurzanzeige