Comparison of algorithms for the recognition of ChatGPT paraphrased texts

Abstract The rapid development of artificial intelligence, especially AI assistants, is leading to new forms of plagiarism that are difficult to detect using existing methods. Paraphrasing tools make this problem even more complex and challenging especially in minor languages with inadequate resourc...

Full description

Saved in:
Bibliographic Details
Main Authors: Aleksandar Kartelj, Miljana Mladenović, Staša Vujičić Stanković
Format: Article
Language:English
Published: SpringerOpen 2025-02-01
Series:Journal of Big Data
Subjects:
Online Access:https://doi.org/10.1186/s40537-025-01082-0
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823862010822000640
author Aleksandar Kartelj
Miljana Mladenović
Staša Vujičić Stanković
author_facet Aleksandar Kartelj
Miljana Mladenović
Staša Vujičić Stanković
author_sort Aleksandar Kartelj
collection DOAJ
description Abstract The rapid development of artificial intelligence, especially AI assistants, is leading to new forms of plagiarism that are difficult to detect using existing methods. Paraphrasing tools make this problem even more complex and challenging especially in minor languages with inadequate resources and tools. This study explores strategies to help detect plagiarism generated by ChatGPT 4.0 and altered by paraphrasing tools. We propose two new datasets consisting of abstracts of doctoral theses in English and Serbian. Both datasets were subjected to ChatGPT paraphrasing, which allowed us to form two classes of texts: human-written and AI-generated, i.e., AI-paraphrased. We then comprehensively compare 19 widely used classification algorithms based on two feature sets: word unigrams and character multigrams. In addition, we compare these to the results of a commercially available pre-trained ChatGPT content detector, ZeroGPT. The results on the English corpus turn out to be very accurate, achieving an accuracy of 95% or more. In contrast, the results on the Serbian corpus were less accurate, achieving an accuracy of just over 85%. Syntax analysis of the training datasets has shown that in Serbian GPT-paraphrased texts, 33.2% of sentences remain the same, and they are found in 63% of documents. GPT-paraphrased English texts showed that 3.2% of sentences remain the same, and they are found in 16% of documents. Syntax analysis of the test datasets has shown that the change of the model temperature influences syntactic features (average number of words and sentences) in English texts and slightly or not in Serbian texts. We attribute all these differences to GPT’s lower paraphrasing ability in minor languages such as Serbian. Presented findings underscore the necessity for making persistent effort in developing tools made for detecting AI-paraphrased texts in academic and professional settings, particularly for minor languages with limited NLP resources, to preserve content integrity and authenticity.
format Article
id doaj-art-e41fa5df348c41b7aa9df76d9db53dc1
institution Kabale University
issn 2196-1115
language English
publishDate 2025-02-01
publisher SpringerOpen
record_format Article
series Journal of Big Data
spelling doaj-art-e41fa5df348c41b7aa9df76d9db53dc12025-02-09T12:41:17ZengSpringerOpenJournal of Big Data2196-11152025-02-0112111710.1186/s40537-025-01082-0Comparison of algorithms for the recognition of ChatGPT paraphrased textsAleksandar Kartelj0Miljana Mladenović1Staša Vujičić Stanković2Faculty of Mathematics, University of BelgradeFaculty of Education, University of NišFaculty of Mathematics, University of BelgradeAbstract The rapid development of artificial intelligence, especially AI assistants, is leading to new forms of plagiarism that are difficult to detect using existing methods. Paraphrasing tools make this problem even more complex and challenging especially in minor languages with inadequate resources and tools. This study explores strategies to help detect plagiarism generated by ChatGPT 4.0 and altered by paraphrasing tools. We propose two new datasets consisting of abstracts of doctoral theses in English and Serbian. Both datasets were subjected to ChatGPT paraphrasing, which allowed us to form two classes of texts: human-written and AI-generated, i.e., AI-paraphrased. We then comprehensively compare 19 widely used classification algorithms based on two feature sets: word unigrams and character multigrams. In addition, we compare these to the results of a commercially available pre-trained ChatGPT content detector, ZeroGPT. The results on the English corpus turn out to be very accurate, achieving an accuracy of 95% or more. In contrast, the results on the Serbian corpus were less accurate, achieving an accuracy of just over 85%. Syntax analysis of the training datasets has shown that in Serbian GPT-paraphrased texts, 33.2% of sentences remain the same, and they are found in 63% of documents. GPT-paraphrased English texts showed that 3.2% of sentences remain the same, and they are found in 16% of documents. Syntax analysis of the test datasets has shown that the change of the model temperature influences syntactic features (average number of words and sentences) in English texts and slightly or not in Serbian texts. We attribute all these differences to GPT’s lower paraphrasing ability in minor languages such as Serbian. Presented findings underscore the necessity for making persistent effort in developing tools made for detecting AI-paraphrased texts in academic and professional settings, particularly for minor languages with limited NLP resources, to preserve content integrity and authenticity.https://doi.org/10.1186/s40537-025-01082-0Plagiarism detectionChatGPTClassificationLarge language models
spellingShingle Aleksandar Kartelj
Miljana Mladenović
Staša Vujičić Stanković
Comparison of algorithms for the recognition of ChatGPT paraphrased texts
Journal of Big Data
Plagiarism detection
ChatGPT
Classification
Large language models
title Comparison of algorithms for the recognition of ChatGPT paraphrased texts
title_full Comparison of algorithms for the recognition of ChatGPT paraphrased texts
title_fullStr Comparison of algorithms for the recognition of ChatGPT paraphrased texts
title_full_unstemmed Comparison of algorithms for the recognition of ChatGPT paraphrased texts
title_short Comparison of algorithms for the recognition of ChatGPT paraphrased texts
title_sort comparison of algorithms for the recognition of chatgpt paraphrased texts
topic Plagiarism detection
ChatGPT
Classification
Large language models
url https://doi.org/10.1186/s40537-025-01082-0
work_keys_str_mv AT aleksandarkartelj comparisonofalgorithmsfortherecognitionofchatgptparaphrasedtexts
AT miljanamladenovic comparisonofalgorithmsfortherecognitionofchatgptparaphrasedtexts
AT stasavujicicstankovic comparisonofalgorithmsfortherecognitionofchatgptparaphrasedtexts