A data pipeline for secure extraction and sharing of social determinants of health.

<h4>Objectives</h4>Linking neighborhood- and patient-level data provides valuable information about the influence of upstream social determinants of health (SDOH). However, sharing of these data across health systems presents challenges. We set out to develop a pipeline to acquire, deide...

Full description

Saved in:
Bibliographic Details
Main Authors: Tyler Schappe, Lisa M McElroy, Moronke Ogundolie, Roland Matsouaka, Ursula Rogers, Nrupen A Bhavsar
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0317215
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1825206814789074944
author Tyler Schappe
Lisa M McElroy
Moronke Ogundolie
Roland Matsouaka
Ursula Rogers
Nrupen A Bhavsar
author_facet Tyler Schappe
Lisa M McElroy
Moronke Ogundolie
Roland Matsouaka
Ursula Rogers
Nrupen A Bhavsar
author_sort Tyler Schappe
collection DOAJ
description <h4>Objectives</h4>Linking neighborhood- and patient-level data provides valuable information about the influence of upstream social determinants of health (SDOH). However, sharing of these data across health systems presents challenges. We set out to develop a pipeline to acquire, deidentify, and share neighborhood-level SDOH data across multiple health systems.<h4>Methods</h4>We created a pipeline centered around Decentralized Geomarker Assessment for Multi-Site Studies (DeGAUSS) that utilizes containerization to geocode patient addresses and obtain neighborhood-level SDOH variables. We compared DeGAUSS to a third-party vendor geocoding tool available at Duke Health using a cohort of adult patients referred for abdominal transplant from January 1, 2016, to December 31, 2022. We calculated Cohen's Kappa and percent disagreement at census block group and tract levels, and by Area Deprivation Index, urbanicity, and year.<h4>Results</h4>The pipeline successfully generated SDOH data for 97.8% of addresses. There was high concordance between DeGAUSS and the vendor tool at the census block group (0.93) and tract levels (0.95). At the block group level, disagreement proportion differed by year and urbanicity, with larger disagreement in the rural category than in micropolitan and metropolitan categories (13%, 7%, 6.2%, respectively).<h4>Discussion and conclusion</h4>We describe a novel pipeline that can facilitate the secure acquisition and sharing of neighborhood-level SDOH without sharing PHI. The pipeline can be scaled to include additional social, climate, and environmental variables, and can be extended to an unlimited number of health systems.
format Article
id doaj-art-abd885711dd0450599ce149a55a68e6e
institution Kabale University
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-abd885711dd0450599ce149a55a68e6e2025-02-07T05:30:42ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01201e031721510.1371/journal.pone.0317215A data pipeline for secure extraction and sharing of social determinants of health.Tyler SchappeLisa M McElroyMoronke OgundolieRoland MatsouakaUrsula RogersNrupen A Bhavsar<h4>Objectives</h4>Linking neighborhood- and patient-level data provides valuable information about the influence of upstream social determinants of health (SDOH). However, sharing of these data across health systems presents challenges. We set out to develop a pipeline to acquire, deidentify, and share neighborhood-level SDOH data across multiple health systems.<h4>Methods</h4>We created a pipeline centered around Decentralized Geomarker Assessment for Multi-Site Studies (DeGAUSS) that utilizes containerization to geocode patient addresses and obtain neighborhood-level SDOH variables. We compared DeGAUSS to a third-party vendor geocoding tool available at Duke Health using a cohort of adult patients referred for abdominal transplant from January 1, 2016, to December 31, 2022. We calculated Cohen's Kappa and percent disagreement at census block group and tract levels, and by Area Deprivation Index, urbanicity, and year.<h4>Results</h4>The pipeline successfully generated SDOH data for 97.8% of addresses. There was high concordance between DeGAUSS and the vendor tool at the census block group (0.93) and tract levels (0.95). At the block group level, disagreement proportion differed by year and urbanicity, with larger disagreement in the rural category than in micropolitan and metropolitan categories (13%, 7%, 6.2%, respectively).<h4>Discussion and conclusion</h4>We describe a novel pipeline that can facilitate the secure acquisition and sharing of neighborhood-level SDOH without sharing PHI. The pipeline can be scaled to include additional social, climate, and environmental variables, and can be extended to an unlimited number of health systems.https://doi.org/10.1371/journal.pone.0317215
spellingShingle Tyler Schappe
Lisa M McElroy
Moronke Ogundolie
Roland Matsouaka
Ursula Rogers
Nrupen A Bhavsar
A data pipeline for secure extraction and sharing of social determinants of health.
PLoS ONE
title A data pipeline for secure extraction and sharing of social determinants of health.
title_full A data pipeline for secure extraction and sharing of social determinants of health.
title_fullStr A data pipeline for secure extraction and sharing of social determinants of health.
title_full_unstemmed A data pipeline for secure extraction and sharing of social determinants of health.
title_short A data pipeline for secure extraction and sharing of social determinants of health.
title_sort data pipeline for secure extraction and sharing of social determinants of health
url https://doi.org/10.1371/journal.pone.0317215
work_keys_str_mv AT tylerschappe adatapipelineforsecureextractionandsharingofsocialdeterminantsofhealth
AT lisammcelroy adatapipelineforsecureextractionandsharingofsocialdeterminantsofhealth
AT moronkeogundolie adatapipelineforsecureextractionandsharingofsocialdeterminantsofhealth
AT rolandmatsouaka adatapipelineforsecureextractionandsharingofsocialdeterminantsofhealth
AT ursularogers adatapipelineforsecureextractionandsharingofsocialdeterminantsofhealth
AT nrupenabhavsar adatapipelineforsecureextractionandsharingofsocialdeterminantsofhealth
AT tylerschappe datapipelineforsecureextractionandsharingofsocialdeterminantsofhealth
AT lisammcelroy datapipelineforsecureextractionandsharingofsocialdeterminantsofhealth
AT moronkeogundolie datapipelineforsecureextractionandsharingofsocialdeterminantsofhealth
AT rolandmatsouaka datapipelineforsecureextractionandsharingofsocialdeterminantsofhealth
AT ursularogers datapipelineforsecureextractionandsharingofsocialdeterminantsofhealth
AT nrupenabhavsar datapipelineforsecureextractionandsharingofsocialdeterminantsofhealth