TY - GEN
T1 - Impact of Balancing Techniques on NIDS Classification Performance
T2 - 2025 International Conference on Computing and Applied Informatics, ICCAI 2025
AU - Elabd, Ahmed
AU - Pahlevanzadeh, Bahareh
AU - Nadi, Farhad
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Network Intrusion Detection Systems (NIDS) are essential for cybersecurity, but their effectiveness is often compromised by imbalanced datasets. This imbalance stems from factors such as the rarity of certain attacks, data collection biases, and ethical constraints. This research investigates the impact of dataset balancing techniques, including undersampling, oversampling, and class weighting, on the performance of NIDS. Eight machine learning models (Logistic Regression, Linear Discriminant Analysis, Classification and Regression Tree, Random Forest, Naïve Bayes, XGBoost, Convolutional Neural Network, and Long Short-Term Memory) were evaluated on two imbalanced datasets (CSE-CIC-IDS2017 and CSE-CIC-IDS2018). The study assessed the impact of balancing techniques on detection accuracy and false positive rates. The findings demonstrate the strengths and limitations of these techniques in addressing data imbalance and improving NIDS performance, thereby contributing to more robust cybersecurity defenses in dynamic threat environments.
AB - Network Intrusion Detection Systems (NIDS) are essential for cybersecurity, but their effectiveness is often compromised by imbalanced datasets. This imbalance stems from factors such as the rarity of certain attacks, data collection biases, and ethical constraints. This research investigates the impact of dataset balancing techniques, including undersampling, oversampling, and class weighting, on the performance of NIDS. Eight machine learning models (Logistic Regression, Linear Discriminant Analysis, Classification and Regression Tree, Random Forest, Naïve Bayes, XGBoost, Convolutional Neural Network, and Long Short-Term Memory) were evaluated on two imbalanced datasets (CSE-CIC-IDS2017 and CSE-CIC-IDS2018). The study assessed the impact of balancing techniques on detection accuracy and false positive rates. The findings demonstrate the strengths and limitations of these techniques in addressing data imbalance and improving NIDS performance, thereby contributing to more robust cybersecurity defenses in dynamic threat environments.
KW - cybersecurity
KW - Imbalanced dataset
KW - Network Intrusion Detection Systems
UR - https://www.scopus.com/pages/publications/105033565122
U2 - 10.1109/ICCAI65301.2025.11279325
DO - 10.1109/ICCAI65301.2025.11279325
M3 - Conference contribution
AN - SCOPUS:105033565122
T3 - 2025 International Conference on Computing and Applied Informatics, ICCAI 2025
BT - 2025 International Conference on Computing and Applied Informatics, ICCAI 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 16 September 2025 through 17 September 2025
ER -