TY - GEN
T1 - Using Recursive Feature Elimination Feature Selection based Machine Learning Classifier for Attack Classification on UNSW-NB 15 dataset
AU - Albasheer Mohamed, Fawzia Omer
AU - Agarwal, Mayank
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024/1/1
Y1 - 2024/1/1
N2 - In modern computer networks, intrusion detection systems (IDS) play a crucial role by detecting and notifying administrators about malicious activities. Ensuring network security and identifying unauthorized access to network resources is paramount in constructing an effective IDS. Traditionally, intrusion detection research relied heavily on the KDDCUP99 dataset, but its limitations in evaluating network intrusion detection systems (NIDS) compared to the UNSW-NB15 dataset have become apparent. This article introduces a novel approach that incorporates feature selection techniques to enhance the performance of three classification methods: K-Nearest Neighbors (KNN), Decision Trees (DT), and Logistic Regression (LR). By utilizing the recursive feature elimination (RFE) technique, irrelevant features are eliminated, allowing the model to focus on the most discriminative attributes for intrusion detection. This not only improves classification accuracy but also reduces computational complexity. The study compares the performance of KNN, DT, and LR algorithms using various metrics such as accuracy, precision, recall, and F1-score. The findings provide a comprehensive analysis of each algorithm's strengths and weaknesses, offering valuable guidance to practitioners and researchers when selecting appropriate algorithms and feature selection techniques for real-world intrusion detection in network environments.
AB - In modern computer networks, intrusion detection systems (IDS) play a crucial role by detecting and notifying administrators about malicious activities. Ensuring network security and identifying unauthorized access to network resources is paramount in constructing an effective IDS. Traditionally, intrusion detection research relied heavily on the KDDCUP99 dataset, but its limitations in evaluating network intrusion detection systems (NIDS) compared to the UNSW-NB15 dataset have become apparent. This article introduces a novel approach that incorporates feature selection techniques to enhance the performance of three classification methods: K-Nearest Neighbors (KNN), Decision Trees (DT), and Logistic Regression (LR). By utilizing the recursive feature elimination (RFE) technique, irrelevant features are eliminated, allowing the model to focus on the most discriminative attributes for intrusion detection. This not only improves classification accuracy but also reduces computational complexity. The study compares the performance of KNN, DT, and LR algorithms using various metrics such as accuracy, precision, recall, and F1-score. The findings provide a comprehensive analysis of each algorithm's strengths and weaknesses, offering valuable guidance to practitioners and researchers when selecting appropriate algorithms and feature selection techniques for real-world intrusion detection in network environments.
KW - DT
KW - IDS
KW - KNN
KW - LR
KW - Machine Learning
KW - Recursive feature selection
KW - UNSW-NB15 dataset
UR - http://www.scopus.com/inward/record.url?scp=85196830967&partnerID=8YFLogxK
U2 - 10.1109/I2CT61223.2024.10544076
DO - 10.1109/I2CT61223.2024.10544076
M3 - Conference contribution
AN - SCOPUS:85196830967
T3 - 2024 IEEE 9th International Conference for Convergence in Technology, I2CT 2024
BT - 2024 IEEE 9th International Conference for Convergence in Technology, I2CT 2024
PB - Institute of Electrical and Electronics Engineers
T2 - 9th IEEE International Conference for Convergence in Technology, I2CT 2024
Y2 - 5 April 2024 through 7 April 2024
ER -