Abstract
In the intricate landscape of healthcare analytics, effective feature selection is a prerequisite for generating robust predictive models, especially given the common challenges of sample sizes and potential biases. Zoish uniquely addresses these issues by employing Shapley additive values - an idea rooted in cooperative game theory - to enable both transparent and automated feature selection. Unlike existing tools, Zoish is versatile, designed to seamlessly integrate with an array of machine learning libraries including scikit-learn, XGBoost, CatBoost, and imbalanced-learn. The distinct advantage of Zoish lies in its dual algorithmic approach for calculating Shapley values, allowing it to efficiently manage both large and small datasets. This adaptability renders it exceptionally suitable for a wide spectrum of healthcare-related tasks. The tool also places a strong emphasis on interpretability, providing comprehensive visualizations for analyzed features. Its customizable settings offer users fine-grained control over feature selection, thus optimizing for specific predictive objectives. This manuscript elucidates the mathematical framework underpinning Zoish and how it uniquely combines local and global feature selection into a single, streamlined process. To validate Zoishs efficiency and adaptability, we present case studies in breast cancer prediction and Montreal Cognitive Assessment (MoCA) prediction in Parkinsons disease, along with evaluations on 300 synthetic datasets. These applications underscore Zoishs unparalleled performance in diverse healthcare contexts and against its counterparts.
| Original language | English |
|---|---|
| Title of host publication | Pacific Symposium on Biocomputing 2024, PSB 2024 |
| Editors | Russ B. Altman, Lawrence Hunter, Marylyn D. Ritchie, Tiffany Murray, Teri E. Klein |
| Publisher | World Scientific |
| Pages | 81-95 |
| Number of pages | 15 |
| Edition | 2024 |
| ISBN (Electronic) | 9789811286414 |
| DOIs | |
| State | Published - 1 Jan 2024 |
| Externally published | Yes |
| Event | 29th Pacific Symposium on Biocomputing, PSB 2024 - Kohala Coast, United States Duration: 3 Jan 2024 → 7 Jan 2024 |
Conference
| Conference | 29th Pacific Symposium on Biocomputing, PSB 2024 |
|---|---|
| Country/Territory | United States |
| City | Kohala Coast |
| Period | 3/01/24 → 7/01/24 |
Keywords
- Feature Selectors
- SHapley Additive exPlanations
- Zoish
ASJC Scopus subject areas
- Biomedical Engineering
- Computational Theory and Mathematics