Comprehensive Study on Zero-Shot Text Classification Using Category Mapping

Existing zero-shot text classification methods based on large pre-trained models with added prompts exhibit strong representational capacity and scalability but have relatively poor commercial applicability. Approaches that fine-tune smaller models using label mappings and existing datasets for zero...

Full description

Saved in:
Bibliographic Details
Main Authors: Kai Zhang, Qiuxia Zhang, Chung-Che Wang, Jyh-Shing Roger Jang
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10870154/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823859628457328640
author Kai Zhang
Qiuxia Zhang
Chung-Che Wang
Jyh-Shing Roger Jang
author_facet Kai Zhang
Qiuxia Zhang
Chung-Che Wang
Jyh-Shing Roger Jang
author_sort Kai Zhang
collection DOAJ
description Existing zero-shot text classification methods based on large pre-trained models with added prompts exhibit strong representational capacity and scalability but have relatively poor commercial applicability. Approaches that fine-tune smaller models using label mappings and existing datasets for zero-shot classification are simpler but suffer from weaker generalization capabilities. This paper employs three strategies to improve the accuracy and generalization of pre-trained models in zero-shot text classification tasks: 1) Utilizing a pre-trained model that transforms inputs into a standardized multiple-choice format. 2) Constructing a text classification training set using Wikipedia text data to fine-tune the pre-trained model; 3) Proposing a zero-shot category mapping method based on GloVe text similarity, using Wikipedia categories as substitutes for text labels. Without fine-tuning on the target labels, this method achieves performance comparable to the best models fine-tuned with target labels.
format Article
id doaj-art-9876ddcdd6cf46d39f5087f41c52b46f
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-9876ddcdd6cf46d39f5087f41c52b46f2025-02-11T00:01:02ZengIEEEIEEE Access2169-35362025-01-0113235262354610.1109/ACCESS.2025.353810310870154Comprehensive Study on Zero-Shot Text Classification Using Category MappingKai Zhang0https://orcid.org/0009-0004-1688-1815Qiuxia Zhang1Chung-Che Wang2https://orcid.org/0000-0002-1414-9848Jyh-Shing Roger Jang3Yiwu Industrial and Commercial College, Yiwu, Zhejiang, ChinaDepartment of Computer Science and Information Engineering, National Taiwan University, Taipei, TaiwanDepartment of Computer Science and Information Engineering, National Taiwan University, Taipei, TaiwanDepartment of Computer Science and Information Engineering, National Taiwan University, Taipei, TaiwanExisting zero-shot text classification methods based on large pre-trained models with added prompts exhibit strong representational capacity and scalability but have relatively poor commercial applicability. Approaches that fine-tune smaller models using label mappings and existing datasets for zero-shot classification are simpler but suffer from weaker generalization capabilities. This paper employs three strategies to improve the accuracy and generalization of pre-trained models in zero-shot text classification tasks: 1) Utilizing a pre-trained model that transforms inputs into a standardized multiple-choice format. 2) Constructing a text classification training set using Wikipedia text data to fine-tune the pre-trained model; 3) Proposing a zero-shot category mapping method based on GloVe text similarity, using Wikipedia categories as substitutes for text labels. Without fine-tuning on the target labels, this method achieves performance comparable to the best models fine-tuned with target labels.https://ieeexplore.ieee.org/document/10870154/Natural language processingpre-trained language modelzero-shot text classificationclassificationGloVe
spellingShingle Kai Zhang
Qiuxia Zhang
Chung-Che Wang
Jyh-Shing Roger Jang
Comprehensive Study on Zero-Shot Text Classification Using Category Mapping
IEEE Access
Natural language processing
pre-trained language model
zero-shot text classification
classification
GloVe
title Comprehensive Study on Zero-Shot Text Classification Using Category Mapping
title_full Comprehensive Study on Zero-Shot Text Classification Using Category Mapping
title_fullStr Comprehensive Study on Zero-Shot Text Classification Using Category Mapping
title_full_unstemmed Comprehensive Study on Zero-Shot Text Classification Using Category Mapping
title_short Comprehensive Study on Zero-Shot Text Classification Using Category Mapping
title_sort comprehensive study on zero shot text classification using category mapping
topic Natural language processing
pre-trained language model
zero-shot text classification
classification
GloVe
url https://ieeexplore.ieee.org/document/10870154/
work_keys_str_mv AT kaizhang comprehensivestudyonzeroshottextclassificationusingcategorymapping
AT qiuxiazhang comprehensivestudyonzeroshottextclassificationusingcategorymapping
AT chungchewang comprehensivestudyonzeroshottextclassificationusingcategorymapping
AT jyhshingrogerjang comprehensivestudyonzeroshottextclassificationusingcategorymapping