TepiSense: A Social Computing-Based Real-Time Epidemic Surveillance System Using Artificial Intelligence

Artificial Intelligence (AI) technologies have enabled researchers to develop tools to monitor real-world events and user behavior using social media platforms. Twitter is particularly useful for gathering invaluable information related to diseases and public health to build real-time disease survei...

Full description

Saved in:
Bibliographic Details
Main Authors: Bilal Tahir, Muhammad Amir Mehmood
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10858732/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823859639045849088
author Bilal Tahir
Muhammad Amir Mehmood
author_facet Bilal Tahir
Muhammad Amir Mehmood
author_sort Bilal Tahir
collection DOAJ
description Artificial Intelligence (AI) technologies have enabled researchers to develop tools to monitor real-world events and user behavior using social media platforms. Twitter is particularly useful for gathering invaluable information related to diseases and public health to build real-time disease surveillance systems. Such systems offer a cost-effective and efficient alternative to the passive, expensive, and time-consuming process of using data from healthcare organizations and hospitals. In this paper, we propose a novel system of TepiSense to automatically perform disease surveillance of epidemic-prone diseases. Our system classifies tweets related to diseases and further identifies ‘indication’ tweets that highlight the presence of patients. Our system consists of four distinct modules of pre-processor, feature extractor, classifier, and evaluator. TepiSense compares the performance of 3 feature extraction techniques, 9 machine/deep learning models, and 3 Large Language Models (LLMs). To test the performance of our system, we build a dataset of Twitter Epidemic Surveillance Corpus (TESC) containing 23.9K English and 13K labelled Urdu tweets related to six diseases: COVID19, hepatitis, malaria, flu, dengue, and HIV/AIDS. Our results show that mBERT LLM achieves the highest F-measure values of 0.96 and 0.83 for topic and indication tweets classification, respectively. Furthermore, we compute the correlation of signals generated by our system with real-world cases to test the efficacy on COVID19 disease. We notice that real-world cases have a correlation of 0.58-0.63 with the indication category tweets. Finally, we develop an interactive and user-friendly dashboard to disseminate the analytics of our system. Overall, our system offers a powerful tool for real-time disease surveillance using social media with potential implications for public health policy and decision-making.
format Article
id doaj-art-bd29c09b61c44e74b2b2f7de73eed897
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-bd29c09b61c44e74b2b2f7de73eed8972025-02-11T00:01:17ZengIEEEIEEE Access2169-35362025-01-0113238162383210.1109/ACCESS.2025.353716810858732TepiSense: A Social Computing-Based Real-Time Epidemic Surveillance System Using Artificial IntelligenceBilal Tahir0https://orcid.org/0000-0002-4907-0988Muhammad Amir Mehmood1https://orcid.org/0000-0002-6652-5104Al-Khawarizmi Institute of Computer Science, University of Engineering and Technology, Lahore, PakistanFaculty of Computer and Information Systems, Islamic University of Madinah, Madinah, Saudi ArabiaArtificial Intelligence (AI) technologies have enabled researchers to develop tools to monitor real-world events and user behavior using social media platforms. Twitter is particularly useful for gathering invaluable information related to diseases and public health to build real-time disease surveillance systems. Such systems offer a cost-effective and efficient alternative to the passive, expensive, and time-consuming process of using data from healthcare organizations and hospitals. In this paper, we propose a novel system of TepiSense to automatically perform disease surveillance of epidemic-prone diseases. Our system classifies tweets related to diseases and further identifies ‘indication’ tweets that highlight the presence of patients. Our system consists of four distinct modules of pre-processor, feature extractor, classifier, and evaluator. TepiSense compares the performance of 3 feature extraction techniques, 9 machine/deep learning models, and 3 Large Language Models (LLMs). To test the performance of our system, we build a dataset of Twitter Epidemic Surveillance Corpus (TESC) containing 23.9K English and 13K labelled Urdu tweets related to six diseases: COVID19, hepatitis, malaria, flu, dengue, and HIV/AIDS. Our results show that mBERT LLM achieves the highest F-measure values of 0.96 and 0.83 for topic and indication tweets classification, respectively. Furthermore, we compute the correlation of signals generated by our system with real-world cases to test the efficacy on COVID19 disease. We notice that real-world cases have a correlation of 0.58-0.63 with the indication category tweets. Finally, we develop an interactive and user-friendly dashboard to disseminate the analytics of our system. Overall, our system offers a powerful tool for real-time disease surveillance using social media with potential implications for public health policy and decision-making.https://ieeexplore.ieee.org/document/10858732/Natural language processingepidemic intelligencepublic healthdata miningsmart citye-health
spellingShingle Bilal Tahir
Muhammad Amir Mehmood
TepiSense: A Social Computing-Based Real-Time Epidemic Surveillance System Using Artificial Intelligence
IEEE Access
Natural language processing
epidemic intelligence
public health
data mining
smart city
e-health
title TepiSense: A Social Computing-Based Real-Time Epidemic Surveillance System Using Artificial Intelligence
title_full TepiSense: A Social Computing-Based Real-Time Epidemic Surveillance System Using Artificial Intelligence
title_fullStr TepiSense: A Social Computing-Based Real-Time Epidemic Surveillance System Using Artificial Intelligence
title_full_unstemmed TepiSense: A Social Computing-Based Real-Time Epidemic Surveillance System Using Artificial Intelligence
title_short TepiSense: A Social Computing-Based Real-Time Epidemic Surveillance System Using Artificial Intelligence
title_sort tepisense a social computing based real time epidemic surveillance system using artificial intelligence
topic Natural language processing
epidemic intelligence
public health
data mining
smart city
e-health
url https://ieeexplore.ieee.org/document/10858732/
work_keys_str_mv AT bilaltahir tepisenseasocialcomputingbasedrealtimeepidemicsurveillancesystemusingartificialintelligence
AT muhammadamirmehmood tepisenseasocialcomputingbasedrealtimeepidemicsurveillancesystemusingartificialintelligence