Risk factor analysis for stunting incidence using sparse categorical principal component logistic regression

The risk factors for stunting incidence involve categorical data in both the response and predictor variables. Therefore, we developed a sparse categorical principal component logistic regression model capable of handling data with multicollinearity. The parameters of the sparse categorical principa...

Full description

Saved in:
Bibliographic Details
Main Authors: Anna Islamiyati, Muhammad Nur, Abdul Salam, Wan Zuki Azman Wan Muhamad, Dwi Auliyah
Format: Article
Language:English
Published: Elsevier 2025-06-01
Series:MethodsX
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2215016125000342
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1825206841676660736
author Anna Islamiyati
Muhammad Nur
Abdul Salam
Wan Zuki Azman Wan Muhamad
Dwi Auliyah
author_facet Anna Islamiyati
Muhammad Nur
Abdul Salam
Wan Zuki Azman Wan Muhamad
Dwi Auliyah
author_sort Anna Islamiyati
collection DOAJ
description The risk factors for stunting incidence involve categorical data in both the response and predictor variables. Therefore, we developed a sparse categorical principal component logistic regression model capable of handling data with multicollinearity. The parameters of the sparse categorical principal component logistic regression model were estimated using the maximum likelihood method and the Newton-Raphson iterative approach. The analysis yielded a likelihood ratio value of 144.81 and a chi-square statistic value of 11.07, indicating that all factors included in the model are statistically significant. The results highlight that medical history, inadequate complementary feeding, formula feeding, lack of complementary feeding programs, and lack of iron supplementation for mothers are highly associated with the risk of stunting in toddlers. This emphasizes the need for attention to maternal nutrition from pregnancy through breastfeeding, as well as the nutrition of the toddler. Some important points proposed in this method are: • Stunting data consists of categorical variables containing multicollinearity. • The method applied is sparse logistic regression combined with categorical principal component analysis. • Analysis of risk factors for stunting in toddlers is based on the child's own condition, as well as parental factors, namely age, education, and intake of additional food and supplementary tablets during pregnancy.
format Article
id doaj-art-02616e66b8ec4880831ec95e0bb6dd93
institution Kabale University
issn 2215-0161
language English
publishDate 2025-06-01
publisher Elsevier
record_format Article
series MethodsX
spelling doaj-art-02616e66b8ec4880831ec95e0bb6dd932025-02-07T04:47:41ZengElsevierMethodsX2215-01612025-06-0114103186Risk factor analysis for stunting incidence using sparse categorical principal component logistic regressionAnna Islamiyati0Muhammad Nur1Abdul Salam2Wan Zuki Azman Wan Muhamad3Dwi Auliyah4Department of Statistics, Faculty of Mathematical and Natural Sciences, Hasanuddin University, Makassar 90245, Indonesia; Corresponding author.Department of Mathematics, Faculty of Mathematical and Natural Sciences, Hasanuddin University, Makassar 90245, IndonesiaDepartment of Nutrition, Faculty of Public Health, Hasanuddin University, Makassar 90245, IndonesiaInstitute of Engineering Mathematics, Universiti Malaysia Perlis, Arau 02600, MalaysiaDepartment of Statistics, Faculty of Mathematical and Natural Sciences, Hasanuddin University, Makassar 90245, IndonesiaThe risk factors for stunting incidence involve categorical data in both the response and predictor variables. Therefore, we developed a sparse categorical principal component logistic regression model capable of handling data with multicollinearity. The parameters of the sparse categorical principal component logistic regression model were estimated using the maximum likelihood method and the Newton-Raphson iterative approach. The analysis yielded a likelihood ratio value of 144.81 and a chi-square statistic value of 11.07, indicating that all factors included in the model are statistically significant. The results highlight that medical history, inadequate complementary feeding, formula feeding, lack of complementary feeding programs, and lack of iron supplementation for mothers are highly associated with the risk of stunting in toddlers. This emphasizes the need for attention to maternal nutrition from pregnancy through breastfeeding, as well as the nutrition of the toddler. Some important points proposed in this method are: • Stunting data consists of categorical variables containing multicollinearity. • The method applied is sparse logistic regression combined with categorical principal component analysis. • Analysis of risk factors for stunting in toddlers is based on the child's own condition, as well as parental factors, namely age, education, and intake of additional food and supplementary tablets during pregnancy.http://www.sciencedirect.com/science/article/pii/S2215016125000342Sparse Categorical Principal Component Logistic Regression
spellingShingle Anna Islamiyati
Muhammad Nur
Abdul Salam
Wan Zuki Azman Wan Muhamad
Dwi Auliyah
Risk factor analysis for stunting incidence using sparse categorical principal component logistic regression
MethodsX
Sparse Categorical Principal Component Logistic Regression
title Risk factor analysis for stunting incidence using sparse categorical principal component logistic regression
title_full Risk factor analysis for stunting incidence using sparse categorical principal component logistic regression
title_fullStr Risk factor analysis for stunting incidence using sparse categorical principal component logistic regression
title_full_unstemmed Risk factor analysis for stunting incidence using sparse categorical principal component logistic regression
title_short Risk factor analysis for stunting incidence using sparse categorical principal component logistic regression
title_sort risk factor analysis for stunting incidence using sparse categorical principal component logistic regression
topic Sparse Categorical Principal Component Logistic Regression
url http://www.sciencedirect.com/science/article/pii/S2215016125000342
work_keys_str_mv AT annaislamiyati riskfactoranalysisforstuntingincidenceusingsparsecategoricalprincipalcomponentlogisticregression
AT muhammadnur riskfactoranalysisforstuntingincidenceusingsparsecategoricalprincipalcomponentlogisticregression
AT abdulsalam riskfactoranalysisforstuntingincidenceusingsparsecategoricalprincipalcomponentlogisticregression
AT wanzukiazmanwanmuhamad riskfactoranalysisforstuntingincidenceusingsparsecategoricalprincipalcomponentlogisticregression
AT dwiauliyah riskfactoranalysisforstuntingincidenceusingsparsecategoricalprincipalcomponentlogisticregression