A hybrid machine learning model for intrusion detection in wireless sensor networks leveraging data balancing and dimensionality reduction

Abstract Intrusion detection systems are essential for securing wireless sensor networks (WSNs) and Internet of Things (IoT) environments against various threats. This study presents a novel hybrid machine learning (ML) model that integrates KMeans-SMOTE (KMS) for data balancing and principal compon...

Full description

Saved in:
Bibliographic Details
Main Authors: Md. Alamin Talukder, Majdi Khalid, Nasrin Sultana
Format: Article
Language:English
Published: Nature Portfolio 2025-02-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-87028-1
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823862323615367168
author Md. Alamin Talukder
Majdi Khalid
Nasrin Sultana
author_facet Md. Alamin Talukder
Majdi Khalid
Nasrin Sultana
author_sort Md. Alamin Talukder
collection DOAJ
description Abstract Intrusion detection systems are essential for securing wireless sensor networks (WSNs) and Internet of Things (IoT) environments against various threats. This study presents a novel hybrid machine learning (ML) model that integrates KMeans-SMOTE (KMS) for data balancing and principal component analysis (PCA) for dimensionality reduction, evaluated using the WSN-DS and TON-IoT datasets. The model employs classifiers such as Decision Tree Classifier, Random Forest Classifier (RFC), and gradient boosting techniques like XGBoost (XGBC) to enhance detection accuracy and efficiency. The proposed hybrid (KMS + PCA + RFC) approach achieves remarkable performance, with an accuracy of 99.94% and an f1-score of 99.94% on the WSN-DS dataset. For the TON-IoT dataset, it achieves 99.97% accuracy and an f1-score of 99.97%, outperforming traditional SMOTE TomekLink and Generative Adversarial Network-based data balancing techniques. This hybrid approach addresses class imbalance and high-dimensionality challenges, providing scalable and robust intrusion detection. Complexity analysis reveals that the proposed model reduces training and prediction times, making it suitable for real-time applications.
format Article
id doaj-art-9b648893af954bcbbc42d7089fefe3b2
institution Kabale University
issn 2045-2322
language English
publishDate 2025-02-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-9b648893af954bcbbc42d7089fefe3b22025-02-09T12:34:12ZengNature PortfolioScientific Reports2045-23222025-02-0115112310.1038/s41598-025-87028-1A hybrid machine learning model for intrusion detection in wireless sensor networks leveraging data balancing and dimensionality reductionMd. Alamin Talukder0Majdi Khalid1Nasrin Sultana2Department of Computer Science and Engineering, International University of Business Agriculture and TechnologyDepartment of Computer Science and Artificial Intelligence, College of Computing, Umm Al-Qura UniversityDepartment of ICT, Future Technology, RMIT UniversityAbstract Intrusion detection systems are essential for securing wireless sensor networks (WSNs) and Internet of Things (IoT) environments against various threats. This study presents a novel hybrid machine learning (ML) model that integrates KMeans-SMOTE (KMS) for data balancing and principal component analysis (PCA) for dimensionality reduction, evaluated using the WSN-DS and TON-IoT datasets. The model employs classifiers such as Decision Tree Classifier, Random Forest Classifier (RFC), and gradient boosting techniques like XGBoost (XGBC) to enhance detection accuracy and efficiency. The proposed hybrid (KMS + PCA + RFC) approach achieves remarkable performance, with an accuracy of 99.94% and an f1-score of 99.94% on the WSN-DS dataset. For the TON-IoT dataset, it achieves 99.97% accuracy and an f1-score of 99.97%, outperforming traditional SMOTE TomekLink and Generative Adversarial Network-based data balancing techniques. This hybrid approach addresses class imbalance and high-dimensionality challenges, providing scalable and robust intrusion detection. Complexity analysis reveals that the proposed model reduces training and prediction times, making it suitable for real-time applications.https://doi.org/10.1038/s41598-025-87028-1Intrusion detection systemsWireless sensor networksInternet of ThingsHybrid machine learningModelDimensionality reduction
spellingShingle Md. Alamin Talukder
Majdi Khalid
Nasrin Sultana
A hybrid machine learning model for intrusion detection in wireless sensor networks leveraging data balancing and dimensionality reduction
Scientific Reports
Intrusion detection systems
Wireless sensor networks
Internet of Things
Hybrid machine learning
Model
Dimensionality reduction
title A hybrid machine learning model for intrusion detection in wireless sensor networks leveraging data balancing and dimensionality reduction
title_full A hybrid machine learning model for intrusion detection in wireless sensor networks leveraging data balancing and dimensionality reduction
title_fullStr A hybrid machine learning model for intrusion detection in wireless sensor networks leveraging data balancing and dimensionality reduction
title_full_unstemmed A hybrid machine learning model for intrusion detection in wireless sensor networks leveraging data balancing and dimensionality reduction
title_short A hybrid machine learning model for intrusion detection in wireless sensor networks leveraging data balancing and dimensionality reduction
title_sort hybrid machine learning model for intrusion detection in wireless sensor networks leveraging data balancing and dimensionality reduction
topic Intrusion detection systems
Wireless sensor networks
Internet of Things
Hybrid machine learning
Model
Dimensionality reduction
url https://doi.org/10.1038/s41598-025-87028-1
work_keys_str_mv AT mdalamintalukder ahybridmachinelearningmodelforintrusiondetectioninwirelesssensornetworksleveragingdatabalancinganddimensionalityreduction
AT majdikhalid ahybridmachinelearningmodelforintrusiondetectioninwirelesssensornetworksleveragingdatabalancinganddimensionalityreduction
AT nasrinsultana ahybridmachinelearningmodelforintrusiondetectioninwirelesssensornetworksleveragingdatabalancinganddimensionalityreduction
AT mdalamintalukder hybridmachinelearningmodelforintrusiondetectioninwirelesssensornetworksleveragingdatabalancinganddimensionalityreduction
AT majdikhalid hybridmachinelearningmodelforintrusiondetectioninwirelesssensornetworksleveragingdatabalancinganddimensionalityreduction
AT nasrinsultana hybridmachinelearningmodelforintrusiondetectioninwirelesssensornetworksleveragingdatabalancinganddimensionalityreduction