TY - GEN
T1 - An Interpretable Model for Predicting Acute Myocardial Infarction in Distinct Patient Profiles
AU - Onoja, Anthony
AU - Zahid, Abdullah
AU - Elomaa, Kris
AU - Geifman, Nophar
N1 - Publisher Copyright:
© 2025 The Authors.
PY - 2025/5/15
Y1 - 2025/5/15
N2 - Introduction: Acute myocardial infarction (AMI) is highly prevalent (3.8% in developed countries), affecting heterogenous populations, and can be influenced by varied factors, including demographics, clinical risk factors, and comorbidities. Identifying distinct AMI patient profiles can aid in understanding the disease and developing personalised treatment strategies. Methods: This study analysed data from UK Biobank participants with an AMI diagnosis. Using unsupervised clustering techniques - UMAP, latent profile analysis, and K-means clustering - distinct and robust patient profiles were identified and associated with co-morbidity prevalence. Next, we trained three supervised machine learning classifiers (Logistic Regression, Random Forest, and XGBoost) to predict profile membership from 28 biochemistry markers. SHAP values were used for post-hoc interpretation of the best-performing model. Finding: Four distinct patient profiles were identified: “CMR-GIRespRenal”, “AG-CMS”, “CM-MultiCardio”, and “PostMeno-CMSurgGI”. Each profile showed unique characteristics in socio-demographics, clinical risk factors (e.g., BMI, age, smoking, alcohol intake, waist and hip circumference) and disease prevalence. The Random Forest classifier outperformed all others, achieving an average weighted AUROC score of 78%. SHAP analysis highlighted key biochemical markers, such as Testosterone, Creatinine, Vitamin D, Urate, and lipid profile markers, as significant predictors of AMI profiles. Conclusion: This study underscores the heterogeneity of AMI patients and the importance of integrating patient profiles with biochemical markers for improved stratification in diagnosis and treatment. These identified profiles can guide personalised treatment strategies, tailoring interventions to the specific needs of each group. Understanding these profiles may also lead to novel therapeutic targets.
AB - Introduction: Acute myocardial infarction (AMI) is highly prevalent (3.8% in developed countries), affecting heterogenous populations, and can be influenced by varied factors, including demographics, clinical risk factors, and comorbidities. Identifying distinct AMI patient profiles can aid in understanding the disease and developing personalised treatment strategies. Methods: This study analysed data from UK Biobank participants with an AMI diagnosis. Using unsupervised clustering techniques - UMAP, latent profile analysis, and K-means clustering - distinct and robust patient profiles were identified and associated with co-morbidity prevalence. Next, we trained three supervised machine learning classifiers (Logistic Regression, Random Forest, and XGBoost) to predict profile membership from 28 biochemistry markers. SHAP values were used for post-hoc interpretation of the best-performing model. Finding: Four distinct patient profiles were identified: “CMR-GIRespRenal”, “AG-CMS”, “CM-MultiCardio”, and “PostMeno-CMSurgGI”. Each profile showed unique characteristics in socio-demographics, clinical risk factors (e.g., BMI, age, smoking, alcohol intake, waist and hip circumference) and disease prevalence. The Random Forest classifier outperformed all others, achieving an average weighted AUROC score of 78%. SHAP analysis highlighted key biochemical markers, such as Testosterone, Creatinine, Vitamin D, Urate, and lipid profile markers, as significant predictors of AMI profiles. Conclusion: This study underscores the heterogeneity of AMI patients and the importance of integrating patient profiles with biochemical markers for improved stratification in diagnosis and treatment. These identified profiles can guide personalised treatment strategies, tailoring interventions to the specific needs of each group. Understanding these profiles may also lead to novel therapeutic targets.
KW - Acute Myocardial Infarction
KW - Blood markers
KW - Clustering
KW - Disease Prevalence
KW - Machine Learning
UR - https://www.scopus.com/pages/publications/105005816779
U2 - 10.3233/SHTI250378
DO - 10.3233/SHTI250378
M3 - Conference contribution
C2 - 40380488
AN - SCOPUS:105005816779
T3 - Studies in Health Technology and Informatics
SP - 452
EP - 456
BT - Intelligent Health Systems - From Technology to Data and Knowledge, Proceedings of MIE 2025
A2 - Andrikopoulou, Elisavet
A2 - Gallos, Parisis
A2 - Arvanitis, Theodoros N.
A2 - Austin, Rosalynn
A2 - Benis, Arriel
A2 - Cornet, Ronald
A2 - Chatzistergos, Panagiotis
A2 - Dejaco, Alexander
A2 - Dusseljee-Peute, Linda
A2 - Mohasseb, Alaa
A2 - Natsiavas, Pantelis
A2 - Nakkas, Haythem
A2 - Scott, Philip
PB - IOS Press BV
T2 - 35th Medical Informatics Europe Conference, MIE 2025
Y2 - 19 May 2025 through 21 May 2025
ER -