Systematic Review of Approaches to Preserve Machine Learning Performance in the Presence of Temporal Dataset Shift in Clinical Medicine

Lin Lawrence Guo¹, Stephen R Pfohl², Jason Fries², Jose Posada², Scott Lanyon Fleming², Catherine Aftandilian³, Nigam Shah², Lillian Sung^{1

4}

Affiliations

¹ Program in Child Health Evaluative Sciences, The Hospital for Sick Children, Toronto, Canada.
² Biomedical Informatics Research, Stanford University, Palo Alto, California, United States.
³ Division of Pediatric Hematology/Oncology, Stanford University, Palo Alto, United States.
⁴ Division of Haematology/Oncology, The Hospital for Sick Children, Toronto, Canada.

PMID: 34470057
PMCID: PMC8410238
DOI: 10.1055/s-0041-1735184

Systematic Review of Approaches to Preserve Machine Learning Performance in the Presence of Temporal Dataset Shift in Clinical Medicine

Lin Lawrence Guo et al. Appl Clin Inform. 2021 Aug.

. 2021 Aug;12(4):808-815.

doi: 10.1055/s-0041-1735184. Epub 2021 Sep 1.

Authors

Lin Lawrence Guo¹, Stephen R Pfohl², Jason Fries², Jose Posada², Scott Lanyon Fleming², Catherine Aftandilian³, Nigam Shah², Lillian Sung^{1

4}

Affiliations

¹ Program in Child Health Evaluative Sciences, The Hospital for Sick Children, Toronto, Canada.
² Biomedical Informatics Research, Stanford University, Palo Alto, California, United States.
³ Division of Pediatric Hematology/Oncology, Stanford University, Palo Alto, United States.
⁴ Division of Haematology/Oncology, The Hospital for Sick Children, Toronto, Canada.

PMID: 34470057
PMCID: PMC8410238
DOI: 10.1055/s-0041-1735184

Abstract

Objective: The change in performance of machine learning models over time as a result of temporal dataset shift is a barrier to machine learning-derived models facilitating decision-making in clinical practice. Our aim was to describe technical procedures used to preserve the performance of machine learning models in the presence of temporal dataset shifts.

Methods: Studies were included if they were fully published articles that used machine learning and implemented a procedure to mitigate the effects of temporal dataset shift in a clinical setting. We described how dataset shift was measured, the procedures used to preserve model performance, and their effects.

Results: Of 4,457 potentially relevant publications identified, 15 were included. The impact of temporal dataset shift was primarily quantified using changes, usually deterioration, in calibration or discrimination. Calibration deterioration was more common (n = 11) than discrimination deterioration (n = 3). Mitigation strategies were categorized as model level or feature level. Model-level approaches (n = 15) were more common than feature-level approaches (n = 2), with the most common approaches being model refitting (n = 12), probability calibration (n = 7), model updating (n = 6), and model selection (n = 6). In general, all mitigation strategies were successful at preserving calibration but not uniformly successful in preserving discrimination.

Conclusion: There was limited research in preserving the performance of machine learning models in the presence of temporal dataset shift in clinical medicine. Future research could focus on the impact of dataset shift on clinical decision making, benchmark the mitigation strategies on a wider range of datasets and tasks, and identify optimal strategies for specific settings.

PubMed Disclaimer

Conflict of interest statement

None declared.

References

1. Challener D W, Prokop L J, Abu-Saleh O. The proliferation of reports on clinical scoring systems: issues about uptake and clinical utility. JAMA. 2019;321(24):2405–2406. - PubMed
1. Rajkomar A, Oren E, Chen K. Scalable and accurate deep learning with electronic health records. NPJ Digit Med. 2018;1:18. - PMC - PubMed
1. Harutyunyan H, Khachatrian H, Kale D C, Ver Steeg G, Galstyan A. Multitask learning and benchmarking with clinical time series data. Sci Data. 2019;6(01):96. - PMC - PubMed
1. Sendak M P, Balu S, Schulman K A. Barriers to Achieving Economies of Scale in Analysis of EHR Data. A Cautionary Tale. Appl Clin Inform. 2017;8(03):826–831. - PMC - PubMed
1. MI in Healthcare Workshop Working Group . Cutillo C M, Sharma K R, Foschini L, Kundu S, Mackintosh M, Mandl K D. Machine intelligence in healthcare-perspectives on trustworthiness, explainability, usability, and transparency. NPJ Digit Med. 2020;3:47. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Systematic Review of Approaches to Preserve Machine Learning Performance in the Presence of Temporal Dataset Shift in Clinical Medicine

Affiliations

Systematic Review of Approaches to Preserve Machine Learning Performance in the Presence of Temporal Dataset Shift in Clinical Medicine

Authors

Affiliations

Abstract

Conflict of interest statement

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources