Optimization of multiple sampling for solving network boundary specification problem

Abstract Missing data caused by boundary specification has a detrimental effect on the analysis of network structures, and designing optimal sampling methods is crucial for conducting network investigations. The present study discusses the boundary specification problem in multiple surveys, and prop...

Full description

Saved in:
Bibliographic Details
Main Author: Ruochen Zhang
Format: Article
Language:English
Published: Nature Portfolio 2025-02-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-87760-8
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823862240169689088
author Ruochen Zhang
author_facet Ruochen Zhang
author_sort Ruochen Zhang
collection DOAJ
description Abstract Missing data caused by boundary specification has a detrimental effect on the analysis of network structures, and designing optimal sampling methods is crucial for conducting network investigations. The present study discusses the boundary specification problem in multiple surveys, and proposes a mathematical model for optimizing the sampling strategy in each independent survey. A memetic algorithm that maximizes the sample representativeness is proposed as well, and experiments have proved the effectiveness and efficiency of the proposed algorithm. Zachary’s Karate Club network and three networks of migrant workers are also performed to explain the social meaning of the optimal sampling method.
format Article
id doaj-art-d0b06006ba1a44ab8c513993c388b5f3
institution Kabale University
issn 2045-2322
language English
publishDate 2025-02-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-d0b06006ba1a44ab8c513993c388b5f32025-02-09T12:37:26ZengNature PortfolioScientific Reports2045-23222025-02-0115111710.1038/s41598-025-87760-8Optimization of multiple sampling for solving network boundary specification problemRuochen Zhang0School of Economics and Management, Xi’an Shiyou UniversityAbstract Missing data caused by boundary specification has a detrimental effect on the analysis of network structures, and designing optimal sampling methods is crucial for conducting network investigations. The present study discusses the boundary specification problem in multiple surveys, and proposes a mathematical model for optimizing the sampling strategy in each independent survey. A memetic algorithm that maximizes the sample representativeness is proposed as well, and experiments have proved the effectiveness and efficiency of the proposed algorithm. Zachary’s Karate Club network and three networks of migrant workers are also performed to explain the social meaning of the optimal sampling method.https://doi.org/10.1038/s41598-025-87760-8OptimizationSocial networksAlgorithmsBoundary specification
spellingShingle Ruochen Zhang
Optimization of multiple sampling for solving network boundary specification problem
Scientific Reports
Optimization
Social networks
Algorithms
Boundary specification
title Optimization of multiple sampling for solving network boundary specification problem
title_full Optimization of multiple sampling for solving network boundary specification problem
title_fullStr Optimization of multiple sampling for solving network boundary specification problem
title_full_unstemmed Optimization of multiple sampling for solving network boundary specification problem
title_short Optimization of multiple sampling for solving network boundary specification problem
title_sort optimization of multiple sampling for solving network boundary specification problem
topic Optimization
Social networks
Algorithms
Boundary specification
url https://doi.org/10.1038/s41598-025-87760-8
work_keys_str_mv AT ruochenzhang optimizationofmultiplesamplingforsolvingnetworkboundaryspecificationproblem