TY - GEN
T1 - ACKEM
T2 - Future of Information and Communication Conference, FICC 2021
AU - Zagagy, Ben
AU - Herman, Maya
AU - Levi, Ofer
N1 - Publisher Copyright:
© 2021, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2021/1/1
Y1 - 2021/1/1
N2 - Over the last couple of years, Deep Learning (DL) methods for objects and features classification have been shown to overcome previous state-of-the-art classification techniques in multiple areas, such as image classification and speech recognition. In our previous paper MESRS – Model Ensemble Speech Recognition System, we have described a unique speech recognition system for automatic classification of voice commands. The work described in this paper, presents a novel method for classification that continues our previous work by extending the system-supported input to the image space, not just the audio space. Aside from supporting multiple input types, this paper also describes an automated method of models ensemble based on the K-Nearest Neighbors algorithm. The automatic method of ensemble selection was added in order to improve the system’s running times and achieve the highest possible accuracy results. The work in this paper shows that applying dynamic input-based classification over multiple architectures can significantly improve the final classification results. Since different models with different architectures could achieve different results on different inputs, the task of producing the best results could be achieved by selecting the best fitted model for the given input. This method was tested over multiple datasets including Chest X Ray Pneumonia Dataset, Malaria Cells Dataset, Road Potholes Dataset, and the Voice Commands Dataset which also served us in our previous work. This paper proves that our method works and has the ability to improve the classification quality on top of all of the above datasets. Our results were compared with previous results obtained by similar works on top of the above datasets and a significant improvement was shown for all of the tested datasets. These findings prove the effectiveness of our method and motivate us to develop it further, in order to achieve even better results in future work.
AB - Over the last couple of years, Deep Learning (DL) methods for objects and features classification have been shown to overcome previous state-of-the-art classification techniques in multiple areas, such as image classification and speech recognition. In our previous paper MESRS – Model Ensemble Speech Recognition System, we have described a unique speech recognition system for automatic classification of voice commands. The work described in this paper, presents a novel method for classification that continues our previous work by extending the system-supported input to the image space, not just the audio space. Aside from supporting multiple input types, this paper also describes an automated method of models ensemble based on the K-Nearest Neighbors algorithm. The automatic method of ensemble selection was added in order to improve the system’s running times and achieve the highest possible accuracy results. The work in this paper shows that applying dynamic input-based classification over multiple architectures can significantly improve the final classification results. Since different models with different architectures could achieve different results on different inputs, the task of producing the best results could be achieved by selecting the best fitted model for the given input. This method was tested over multiple datasets including Chest X Ray Pneumonia Dataset, Malaria Cells Dataset, Road Potholes Dataset, and the Voice Commands Dataset which also served us in our previous work. This paper proves that our method works and has the ability to improve the classification quality on top of all of the above datasets. Our results were compared with previous results obtained by similar works on top of the above datasets and a significant improvement was shown for all of the tested datasets. These findings prove the effectiveness of our method and motivate us to develop it further, in order to achieve even better results in future work.
KW - Data mining
KW - Deep Learning
KW - Ensemble classifier
KW - KNN
UR - http://www.scopus.com/inward/record.url?scp=85105909331&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-73103-8_38
DO - 10.1007/978-3-030-73103-8_38
M3 - Conference contribution
AN - SCOPUS:85105909331
SN - 9783030731021
T3 - Advances in Intelligent Systems and Computing
SP - 536
EP - 557
BT - Advances in Information and Communication - Proceedings of the 2021 Future of Information and Communication Conference, FICC
A2 - Arai, Kohei
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 29 April 2021 through 30 April 2021
ER -