Personalized mortality prediction driven by electronic medical data and a patient similarity metric

dc.contributor.authorLee, Joon
dc.contributor.authorMaslove, David M.
dc.contributor.authorDubin, Joel A.
dc.date.accessioned2026-06-02T19:17:32Z
dc.date.available2026-06-02T19:17:32Z
dc.date.issued2015-05-15
dc.description© 2015 Lee et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
dc.description.abstractBackground Clinical outcome prediction normally employs statis, one-size-fits-all models that perform well for the average patient but are sub-optimal for individual patients with unique characteristics. In the era of digital healthcare, it is feasible to dynamically personalize decision support by identifying and analyzing similar past patients, in a way that is analogous to personalized product recommendation in e-commerce. Our objectives were: 1) to prove that analyzing only similar patients leads to better outcome prediction performance than analyzing all available patients, and 2) to characterize the trade-off between training data size and the degree of similarity between the training data and the index patient for whom prediction is to be made. Methods and Findings We deployed a cosine-similarity-based patient similarity metric (PSM) to an intensive care unit (ICU) database to identify patients that are most similar to each patient and subsequently to custom-build 30-day mortality prediction models. Rich clinical and administrative data from the first day in the ICU from 17,152 adult ICU admissions were analyzed. The results confirmed that using data from only a small subset of most similar patients for training improves predictive performance in comparison with using data from all available patients. The results also showed that when too few similar patients are used for training, predictive performance degrades due to the effects of small sample sizes. Our PSM-based approach outperformed well-known ICU severity of illness scores. Although the improved prediction performance is achieved at the cost of increased computational burden, Big Data technologies can help realize personalized data-driven decision support at the point of care. Conclusions The present study provides crucial empirical evidence for the promising potential of personalized data-driven decision support systems. With the increasing adoption of electronic medical record (EMR) systems, our novel medical data analytics contributes to meaningful use of EMR data.
dc.description.sponsorshipNatural Sciences and Engineering Research Council of Canada (NSERC), Discovery Grant RGPIN-2014-04743 || NSERC, Discovery Grant RGPIN-2014-05911.
dc.identifier.urihttps://doi.org/10.1371/journal.pone.0127428
dc.identifier.urihttps://hdl.handle.net/10012/23518
dc.language.isoen
dc.publisherPublic Library of Science
dc.relation.ispartofseriesPLoS ONE; 10(5); e0127428
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectintensive care units
dc.subjectforecasting
dc.subjectelectronic medical records
dc.subjectdeath rates
dc.subjectclinical medicine
dc.subjectdecision trees
dc.subjecthospitals
dc.subjectmedical risk factors
dc.titlePersonalized mortality prediction driven by electronic medical data and a patient similarity metric
dc.typeArticle
dcterms.bibliographicCitationLee J, Maslove DM, Dubin JA (2015) Personalized Mortality Prediction Driven by Electronic Medical Data and a Patient Similarity Metric. PLoS ONE 10(5): e0127428. https://doi.org/10.1371/journal.pone.0127428
uws.contributor.affiliation1Faculty of Health
uws.contributor.affiliation1Faculty of Mathematics
uws.contributor.affiliation2School of Public Health Sciences
uws.contributor.affiliation2Statistics and Actuarial Science
uws.peerReviewStatusReviewed
uws.scholarLevelFaculty
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
file - 2026-06-02T150559.903.pdf
Size:
706.47 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
4.47 KB
Format:
Item-specific license agreed upon to submission
Description: