Identification of Plasma Proteins Associated with Alzheimer's Disease Using Feature Selection Techniques and Machine Learning Algorithms

Alzheimer’s disease (AD) is a chronic, progressive neurodegenerative disorder that typically affects elderly individuals. Detecting Alzheimer’s using plasma proteins is a critical step toward improving treatment results for this disease. This study aims to use computational algorithms to explore th...

Full description

Saved in:
Bibliographic Details
Main Authors: Zakaria Mokadem, Mohamed Djerioui, Bilal Attallah, Youcef Brik
Format: Article
Language:English
Published: IMS Vogosca 2025-02-01
Series:Science, Engineering and Technology
Subjects:
Online Access:https://www.setjournal.com/SET/article/view/189
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823865126695993344
author Zakaria Mokadem
Mohamed Djerioui
Bilal Attallah
Youcef Brik
author_facet Zakaria Mokadem
Mohamed Djerioui
Bilal Attallah
Youcef Brik
author_sort Zakaria Mokadem
collection DOAJ
description Alzheimer’s disease (AD) is a chronic, progressive neurodegenerative disorder that typically affects elderly individuals. Detecting Alzheimer’s using plasma proteins is a critical step toward improving treatment results for this disease. This study aims to use computational algorithms to explore the relationship between plasma proteins and AD progression by identifying a panel of plasma proteins that can serve as biomarkers for tracking and diagnosing AD. We applied two feature selection methods, Sequential Backward Feature Selection (SBFS) and Analysis of Variance (ANOVA) to extract significant proteins from a dataset of 146  proteins. The data was collected from the plasma of 566 individuals, comprising both Alzheimer’s patients and healthy controls. The SBFS technique generated all possible combinations of protein groups from the 146 proteins, which were then trained and tested using five machine learning models: Decision Tree, Random Forest, Extremely Randomized Trees, Extreme Gradient Boosting, and Adaptive Boosting. Subsequently, ANOVA was applied to refine and reduce the selected panel size. Finally, we used XGBoost and AdaBoost models to validate the final panel. The findings introduce a plasma protein panel consisting of A2Macro, BNP, BTC, PPP, and PYY proteins for diagnosing AD. This panel achieved a sensitivity of 88.88%, a specificity of 66.66%, and an AUC of 0.85. These results demonstrate that plasma protein biomarkers can facilitate timely interventions, potentially slowing disease progression and improving patient outcomes. This non-invasive and affordable diagnostic method has the potential to make Alzheimer’s screening accessible to a broader population.
format Article
id doaj-art-a8cf233653fd4e858ac70bbd6d0c368f
institution Kabale University
issn 2831-1043
2744-2527
language English
publishDate 2025-02-01
publisher IMS Vogosca
record_format Article
series Science, Engineering and Technology
spelling doaj-art-a8cf233653fd4e858ac70bbd6d0c368f2025-02-08T11:45:04ZengIMS VogoscaScience, Engineering and Technology2831-10432744-25272025-02-015110.54327/set2025/v5.i1.189Identification of Plasma Proteins Associated with Alzheimer's Disease Using Feature Selection Techniques and Machine Learning AlgorithmsZakaria Mokadem0Mohamed Djerioui1Bilal Attallah2Youcef Brik3LASS Laboratory, Faculty of Technology, University of M’sila, University Pole Road Bordj Bou Arreridj, M’sila 28000, Algeria.LASS Laboratory, Faculty of Technology, University of M’sila, University Pole Road Bordj Bou Arreridj, M’sila 28000, Algeria.LASS Laboratory, Faculty of Technology, University of M’sila, University Pole Road Bordj Bou Arreridj, M’sila 28000, Algeria.LASS Laboratory, Faculty of Technology, University of M’sila, University Pole Road Bordj Bou Arreridj, M’sila 28000, Algeria. Alzheimer’s disease (AD) is a chronic, progressive neurodegenerative disorder that typically affects elderly individuals. Detecting Alzheimer’s using plasma proteins is a critical step toward improving treatment results for this disease. This study aims to use computational algorithms to explore the relationship between plasma proteins and AD progression by identifying a panel of plasma proteins that can serve as biomarkers for tracking and diagnosing AD. We applied two feature selection methods, Sequential Backward Feature Selection (SBFS) and Analysis of Variance (ANOVA) to extract significant proteins from a dataset of 146  proteins. The data was collected from the plasma of 566 individuals, comprising both Alzheimer’s patients and healthy controls. The SBFS technique generated all possible combinations of protein groups from the 146 proteins, which were then trained and tested using five machine learning models: Decision Tree, Random Forest, Extremely Randomized Trees, Extreme Gradient Boosting, and Adaptive Boosting. Subsequently, ANOVA was applied to refine and reduce the selected panel size. Finally, we used XGBoost and AdaBoost models to validate the final panel. The findings introduce a plasma protein panel consisting of A2Macro, BNP, BTC, PPP, and PYY proteins for diagnosing AD. This panel achieved a sensitivity of 88.88%, a specificity of 66.66%, and an AUC of 0.85. These results demonstrate that plasma protein biomarkers can facilitate timely interventions, potentially slowing disease progression and improving patient outcomes. This non-invasive and affordable diagnostic method has the potential to make Alzheimer’s screening accessible to a broader population. https://www.setjournal.com/SET/article/view/189Alzheimer’s disease ANOVAblood biomarkerFeature selectionMachine learningPlasma proteins
spellingShingle Zakaria Mokadem
Mohamed Djerioui
Bilal Attallah
Youcef Brik
Identification of Plasma Proteins Associated with Alzheimer's Disease Using Feature Selection Techniques and Machine Learning Algorithms
Science, Engineering and Technology
Alzheimer’s disease
ANOVA
blood biomarker
Feature selection
Machine learning
Plasma proteins
title Identification of Plasma Proteins Associated with Alzheimer's Disease Using Feature Selection Techniques and Machine Learning Algorithms
title_full Identification of Plasma Proteins Associated with Alzheimer's Disease Using Feature Selection Techniques and Machine Learning Algorithms
title_fullStr Identification of Plasma Proteins Associated with Alzheimer's Disease Using Feature Selection Techniques and Machine Learning Algorithms
title_full_unstemmed Identification of Plasma Proteins Associated with Alzheimer's Disease Using Feature Selection Techniques and Machine Learning Algorithms
title_short Identification of Plasma Proteins Associated with Alzheimer's Disease Using Feature Selection Techniques and Machine Learning Algorithms
title_sort identification of plasma proteins associated with alzheimer s disease using feature selection techniques and machine learning algorithms
topic Alzheimer’s disease
ANOVA
blood biomarker
Feature selection
Machine learning
Plasma proteins
url https://www.setjournal.com/SET/article/view/189
work_keys_str_mv AT zakariamokadem identificationofplasmaproteinsassociatedwithalzheimersdiseaseusingfeatureselectiontechniquesandmachinelearningalgorithms
AT mohameddjerioui identificationofplasmaproteinsassociatedwithalzheimersdiseaseusingfeatureselectiontechniquesandmachinelearningalgorithms
AT bilalattallah identificationofplasmaproteinsassociatedwithalzheimersdiseaseusingfeatureselectiontechniquesandmachinelearningalgorithms
AT youcefbrik identificationofplasmaproteinsassociatedwithalzheimersdiseaseusingfeatureselectiontechniquesandmachinelearningalgorithms