TY - JOUR
T1 - Bringing big data analytics closer to practice
T2 - A methodological explanation and demonstration of classification algorithms
AU - Ben-Assuli, Ofir
AU - Heart, Tsipi
AU - Shlomo, Nir
AU - Klempfner, Robert
N1 - Funding Information:
This research was supported by a Grant from the GIF (# I-2499-201.2/2018 ), the German-Israeli Foundation for Scientific Research and Development.
Funding Information:
This research was supported by a Grant from the GIF (#I-2499-201.2/2018), the German-Israeli Foundation for Scientific Research and Development. This work has also benefited from discussions at WITS 2017 (27th Annual Workshop on Information Technologies and Systems, Seoul, Korea (December 14?15, 2017)). This research was supported by a Grant from the GIF (#I-2499-201.2/2018), the German-Israeli Foundation for Scientific Research and Development. None. 4904-18-SMC, IRB Committee, Sheba Medical Center Israel.
Funding Information:
This research was supported by a Grant from the GIF ( #I-2499-201.2/2018 ), the German-Israeli Foundation for Scientific Research and Development . This work has also benefited from discussions at WITS 2017 (27th Annual Workshop on Information Technologies and Systems, Seoul, Korea (December 14–15, 2017)).
Publisher Copyright:
© 2019 Fellowship of Postgraduate Medicine
PY - 2019/3/1
Y1 - 2019/3/1
N2 - Background: Big data analytics are becoming more prevalent due to the recent availability of health data. Yet in spite of evidence supporting the potential contribution of big data analytics to health policy makers and care providers, these tools are still too complex to be routinely used. Further, access to comprehensive datasets required for more accurate results is complex and costly. Consequently, big data analytics are mostly used by researchers and experts who are far removed from actual clinical practice. Hence, policy makers should allocate resources to encourage studies that clarify and simplify big data analytics so it can be used by non-experts (e.g., clinicians, practitioners and decision-makers who may not have advanced computer skills). It is also important to fund data collection and integration from various health IT, a pre-condition for any big data analytics project. Objectives: To methodologically clarify the rationale and logic behind several analytics algorithms to help non-expert users employ big data analytics by understanding how to implement relatively easy to use platforms as Azure ML. Methods: We demonstrate the predictive power of four known algorithms and compare their accuracy in predicting early mortality of Congestive Heart Failure (CHF) patients. Results: The results of our models outperform those reported in the literature, attesting to the strength of some of the models, and the utility of comprehensive data. Conclusions: The results support our call to policy makers to allocate resources to establishing comprehensive, integrated health IT systems, and to projects aimed at simplifying ML analytics.
AB - Background: Big data analytics are becoming more prevalent due to the recent availability of health data. Yet in spite of evidence supporting the potential contribution of big data analytics to health policy makers and care providers, these tools are still too complex to be routinely used. Further, access to comprehensive datasets required for more accurate results is complex and costly. Consequently, big data analytics are mostly used by researchers and experts who are far removed from actual clinical practice. Hence, policy makers should allocate resources to encourage studies that clarify and simplify big data analytics so it can be used by non-experts (e.g., clinicians, practitioners and decision-makers who may not have advanced computer skills). It is also important to fund data collection and integration from various health IT, a pre-condition for any big data analytics project. Objectives: To methodologically clarify the rationale and logic behind several analytics algorithms to help non-expert users employ big data analytics by understanding how to implement relatively easy to use platforms as Azure ML. Methods: We demonstrate the predictive power of four known algorithms and compare their accuracy in predicting early mortality of Congestive Heart Failure (CHF) patients. Results: The results of our models outperform those reported in the literature, attesting to the strength of some of the models, and the utility of comprehensive data. Conclusions: The results support our call to policy makers to allocate resources to establishing comprehensive, integrated health IT systems, and to projects aimed at simplifying ML analytics.
KW - Boosted decision tree
KW - Congestive heart failure
KW - Logistic regression
KW - Machine learning
KW - Neural network
KW - Support vector machine
UR - http://www.scopus.com/inward/record.url?scp=85060136119&partnerID=8YFLogxK
U2 - 10.1016/j.hlpt.2018.12.003
DO - 10.1016/j.hlpt.2018.12.003
M3 - Article
AN - SCOPUS:85060136119
SN - 2211-8837
VL - 8
SP - 7
EP - 13
JO - Health Policy and Technology
JF - Health Policy and Technology
IS - 1
ER -