TY - JOUR
T1 - Data quality evaluation for observational multiple sclerosis registries
AU - Kalincik, Tomas
AU - Kuhle, Jens
AU - Pucci, Eugenio
AU - Rojas, Juan Ignacio
AU - Tsolaki, Magda
AU - Sirbu, Carmen Adella
AU - Slee, Mark
AU - Butzkueven, Helmut
AU - MSBase Scientific Leadership Group and MSBase Study Group
PY - 2017/4/1
Y1 - 2017/4/1
N2 - Objective: Objective and reproducible evaluation of data quality is of paramount importance for studies of 'real-world' observational data. Here, we summarise a standardised data quality, density and generalisability process implemented by MSBase, a global multiple sclerosis (MS) cohort study. Methods: Error rate, data density score and generalisability score were developed using all 35,869 patients enrolled in MSBase as of November 2015. The data density score was calculated across six domains (follow-up, demography, visits, MS relapses, paraclinical data and therapy) and emphasised data completeness. The error rate evaluated syntactic accuracy and consistency of data. The generalisability score evaluated believability of the demographic and treatment information. Correlations among the three scores and the number of patients per centre were evaluated. Results: Errors were identified at the median rate of 3 per 100 patient-years. The generalisability score indicated the samples' representativeness of the known MS epidemiology. Moderate correlation between the density and generalisability scores (0.58) and a weak correlation between the error rate and the other two scores ('0.32 to '0.33) were observed. The generalisability score was strongly correlated with centre size ( 0.79). Conclusion: The implemented scores enable objective evaluation of the quality of observational MS data, with an impact on the design of future analyses.
AB - Objective: Objective and reproducible evaluation of data quality is of paramount importance for studies of 'real-world' observational data. Here, we summarise a standardised data quality, density and generalisability process implemented by MSBase, a global multiple sclerosis (MS) cohort study. Methods: Error rate, data density score and generalisability score were developed using all 35,869 patients enrolled in MSBase as of November 2015. The data density score was calculated across six domains (follow-up, demography, visits, MS relapses, paraclinical data and therapy) and emphasised data completeness. The error rate evaluated syntactic accuracy and consistency of data. The generalisability score evaluated believability of the demographic and treatment information. Correlations among the three scores and the number of patients per centre were evaluated. Results: Errors were identified at the median rate of 3 per 100 patient-years. The generalisability score indicated the samples' representativeness of the known MS epidemiology. Moderate correlation between the density and generalisability scores (0.58) and a weak correlation between the error rate and the other two scores ('0.32 to '0.33) were observed. The generalisability score was strongly correlated with centre size ( 0.79). Conclusion: The implemented scores enable objective evaluation of the quality of observational MS data, with an impact on the design of future analyses.
KW - cohort study
KW - data reporting
KW - generalisability
KW - Multiple sclerosis
KW - quality
KW - representativeness
UR - http://www.scopus.com/inward/record.url?scp=85046082463&partnerID=8YFLogxK
UR - http://purl.org/au-research/grants/NHMRC/1080518
UR - http://purl.org/au-research/grants/NHMRC/1083539
UR - http://purl.org/au-research/grants/NHMRC/1032484
UR - http://purl.org/au-research/grants/NHMRC/1001216
U2 - 10.1177/1352458516662728
DO - 10.1177/1352458516662728
M3 - Article
C2 - 27481209
AN - SCOPUS:85046082463
SN - 1352-4585
VL - 23
SP - 647
EP - 655
JO - Multiple Sclerosis
JF - Multiple Sclerosis
IS - 5
ER -