Purdue University Libraries Open Access Publishing Fund

Interpretable Machine Learning Models for Hospital Readmission Prediction: A Two-step Extracted Regression Tree Approach

Xiaoquan Gao, Purdue University
Sabriya Alam, University of California, Berkeley
Pengyi Shi, Purdue University
Franklin Dexter, University of Iowa
Nan Kong, Purdue University

Recommended Citation

Gao, X., Alam, S., Shi, P. et al. Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach. BMC Med Inform Decis Mak 23, 104 (2023). https://doi.org/10.1186/s12911-023-02193-5

DOI

10.1186/s12911-023-02193-5

Date of this Version

6-5-2023

Keywords

Hospital readmission, interpretable machine learning, risk prediction, administrative data, risk factors

Abstract

Background

Advanced machine learning models have received wide attention in assisting medical decision making due to the greater accuracy they can achieve. However, their limited interpretability imposes barriers for practitioners to adopt them. Recent advancements in interpretable machine learning tools allow us to look inside the black box of advanced prediction methods to extract interpretable models while maintaining similar prediction accuracy, but few studies have investigated the specific hospital readmission prediction problem with this spirit.

Methods

Our goal is to develop a machine-learning (ML) algorithm that can predict 30- and 90- day hospital readmissions as accurately as black box algorithms while providing medically interpretable insights into readmission risk factors. Leveraging a state-of-art interpretable ML model, we use a two-step Extracted Regression Tree approach to achieve this goal. In the first step, we train a black box prediction algorithm. In the second step, we extract a regression tree from the output of the black box algorithm that allows direct interpretation of medically relevant risk factors. We use data from a large teaching hospital in Asia to learn the ML model and verify our two-step approach.

Results

The two-step method can obtain similar prediction performance as the best black box model, such as Neural Networks, measured by three metrics: accuracy, the Area Under the Curve (AUC) and the Area Under the Precision-Recall Curve (AUPRC), while maintaining interpretability. Further, to examine whether the prediction results match the known medical insights (i.e., the model is truly interpretable and produces reasonable results), we show that key readmission risk factors extracted by the two-step approach are consistent with those found in the medical literature.

Conclusions

The proposed two-step approach yields meaningful prediction results that are both accurate and interpretable. This study suggests a viable means to improve the trust of machine learning based models in clinical practice for predicting readmissions through the two-step approach.

Comments

This is the published version of the Gao, X., Alam, S., Shi, P. et al. Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach. BMC Med Inform Decis Mak 23, 104 (2023). https://doi.org/10.1186/s12911-023-02193-5

Download

Find in your library

COinS

Purdue University Libraries Open Access Publishing Fund

Interpretable Machine Learning Models for Hospital Readmission Prediction: A Two-step Extracted Regression Tree Approach

Recommended Citation

DOI

Date of this Version

Keywords

Abstract

Background

Methods

Results

Conclusions

Comments

Search

Links

Links for Authors

Browse

Links

Purdue University Libraries Open Access Publishing Fund

Interpretable Machine Learning Models for Hospital Readmission Prediction: A Two-step Extracted Regression Tree Approach

Authors

Recommended Citation

DOI

Date of this Version

Keywords

Abstract

Background

Methods

Results

Conclusions

Comments

Share

Search

Links

Links for Authors

Browse

Links