Abstract
Amyotrophic lateral sclerosis (ALS) is a devastating and incurable disease affecting motor neurons, leading to progressive paralysis and death on average within three to five years from onset. The disease is characterized by highly variable patterns and rates of progression, which pose challenges to developing reliable and accurate ALS disease state prediction models to be used on a daily basis in clinics with little data. To meet these challenges, we suggest domain adaptation from a large, but unfortunately biased, clinical trials database to that of a tertiary care ALS clinic. To evaluate the reliability and accuracy of the suggested paradigm, we examine a naïve approach by which training is based only on the clinical trials data compared with a domain adaptation approach of an initial training using this same data followed by fine-tuning training using the clinic data. We also allow summarization of the clinical longitudinal data to evaluate non-temporal models, e.g., random forest (RF), XGBoost (XGB), and multilayer perceptron (MLP), partially exploiting the dynamic information hidden in patient clinical records, in comparison to the long short-term memory (LSTM) recurrent neural network, fully exploiting the temporal information in the data. First, we notice the XGB outperformance in terms of the ALS disease state prediction error to the RF and MLP, but surprisingly also to the LSTM regardless of prediction time (up to 24 months ahead). We contribute the inferiority of the highly parametrized neural network to the impact of the curse of dimensionality. Second, we show that this error does not significantly increase when the model is trained using only the clinical trials data, especially for LSTM in long prediction times. Finally, we demonstrate that fine-tuning of the clinical trials-based pre-trained model using the clinic data improves the LSTM and MLP performance compared to using solely the clinical trials or clinic data.
Original language | English |
---|---|
Title of host publication | Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020 |
Editors | M. Arif Wani, Feng Luo, Xiaolin Li, Dejing Dou, Francesco Bonchi |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 539-544 |
Number of pages | 6 |
ISBN (Electronic) | 9781728184708 |
DOIs | |
State | Published - 1 Dec 2020 |
Event | 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020 - Virtual, Miami, United States Duration: 14 Dec 2020 → 17 Dec 2020 |
Conference
Conference | 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020 |
---|---|
Country/Territory | United States |
City | Virtual, Miami |
Period | 14/12/20 → 17/12/20 |
Keywords
- Amyotrophic lateral sclerosis (ALS)
- LSTM
- clinical trials data
- disease-state prediction
- domain adaptation
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Science Applications
- Computer Vision and Pattern Recognition
- Hardware and Architecture