Home > Published Issues > 2025 > Volume 16, No. 3, 2025 >
JAIT 2025 Vol.16(3): 283-302
doi: 10.12720/jait.16.3.283-302

Optimizing Intrusion Detection: Advanced Feature Selection and Machine Learning Techniques Using the CSE-CIC-IDS2018 Dataset

Qusay M. Alzubi 1,*, Sharif Naser Makhadmeh 2, and Yousef Sanjalawe 2
1. Department of Information Technology (Cybersecurity Program), Information Technology and Computer Science Faculty, Yarmouk University, Irbid, Jordan
2. Department of Information Technology, King Abdullah II School for Information Technology, University of Jordan (UJ), Amman, Jordan
Email: qusayz@yu.edu.jo (Q.M.A.); s_makhadmeh@ju.edu.jo (S.N.M.); y.sanjalawe@ju.edu.jo (Y.S.)
*Corresponding author

Manuscript received October 1, 2024; revised October 26, 2024; accepted December 2, 2024; published March 6, 2025.

Abstract—The escalation of cyber threats in large-scale local area networks necessitate advanced strategies for efficient anomaly detection and intrusion prevention. This paper explores the integration of sophisticated machine learning techniques and feature selection methods to enhance the performance of Network Intrusion Detection Systems. Focusing on the complex landscape of cyber threats, accentuated by the proliferation of technologies such as Internet of Things, 5G, and cloud computing, the proposed study evaluates the application of three advanced feature selection algorithms—Grey Wolf Optimizer, Bat Algorithm, and Pigeon-inspired Optimization—to identify an optimal subset of features that accurately differentiate between diverse cyberattacks and normal network traffic. Employing the comprehensive CSE-CIC-IDS2018 dataset, the experimental results demonstrate that the feature set was successfully reduced from 80 to subsets of 10, 6, and 7 features while maintaining a high detection accuracy close to 99%. This reduction in feature space significantly decreases computational overhead without compromising detection capability. This research contributes to the cybersecurity domain by presenting a scalable, efficient, and highly accurate model for intrusion detection, setting a foundation for future advancements in Network Intrusion Detection Systems optimization and the broader field of cyber defense mechanisms.
 
Keywords—feature selection algorithm, network intrusion detection systems, machine learning, cybersecurity, anomaly detection, cyber threats

Cite: Qusay M. Alzubi, Sharif Naser Makhadmeh, and Yousef Sanjalawe, "Optimizing Intrusion Detection: Advanced Feature Selection and Machine Learning Techniques Using the CSE-CIC-IDS2018 Dataset," Journal of Advances in Information Technology, Vol. 16, No. 3, pp. 283-302, 2025. doi: 10.12720/jait.16.3.283-302

Copyright © 2025 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Article Metrics in Dimensions