Research on YOLOv5 Oracle Recognition Algorithm Based on Multi-Module Fusion
The recognition of oracle bone script is of significant importance for understanding the evolution of Chinese characters, their morphological features, and semantic changes. However, traditional methods and some deep learning models have limited ability to capture the complex forms and fine details...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2025-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10858166/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1823859587348955136 |
---|---|
author | Xinhang Zhang Zhenhua Ma Yaru Zhang Huiying Ru |
author_facet | Xinhang Zhang Zhenhua Ma Yaru Zhang Huiying Ru |
author_sort | Xinhang Zhang |
collection | DOAJ |
description | The recognition of oracle bone script is of significant importance for understanding the evolution of Chinese characters, their morphological features, and semantic changes. However, traditional methods and some deep learning models have limited ability to capture the complex forms and fine details of oracle bone script, which makes it difficult to fully detect subtle differences between characters. Additionally, models trained on such data tend to struggle with recognizing rare or unseen characters, often leading to recognition errors. Therefore, improving the robustness of these models is essential. This paper presents a novel recognition algorithm based on YOLOv5, incorporating BiFPN-SDI, C3-DAttention, and Detect_Efficient to significantly enhance detection performance. BiFPN-SDI enables more precise feature fusion and attention mechanisms, improving the detection of small targets. C3-DAttention combines channel and spatial attention mechanisms to enhance feature extraction in deep convolutional neural networks. Detect_Efficient further improves the model’s detection and recognition capabilities. Experimental results show that the proposed improvements lead to a 0.7% increase in precision, a 1.1% increase in recall, and a 0.3% improvement in MAP@50. Furthermore, the model’s parameter count is reduced to 1,009,668, and its processing speed is increased to 90 fps, significantly improving the ability to extract and recognize features in oracle bone script. |
format | Article |
id | doaj-art-005cb876fffa4cbbb7c57029aedde772 |
institution | Kabale University |
issn | 2169-3536 |
language | English |
publishDate | 2025-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj-art-005cb876fffa4cbbb7c57029aedde7722025-02-11T00:00:54ZengIEEEIEEE Access2169-35362025-01-0113243582436710.1109/ACCESS.2025.353655310858166Research on YOLOv5 Oracle Recognition Algorithm Based on Multi-Module FusionXinhang Zhang0https://orcid.org/0009-0006-1831-1650Zhenhua Ma1https://orcid.org/0000-0002-2916-7117Yaru Zhang2Huiying Ru3https://orcid.org/0000-0002-6875-4525School of Science, Hebei University of Architecture, Zhangjiakou, Hebei, ChinaSchool of Science, Hebei University of Architecture, Zhangjiakou, Hebei, ChinaSchool of Science, Hebei University of Architecture, Zhangjiakou, Hebei, ChinaSchool of Science, Hebei University of Architecture, Zhangjiakou, Hebei, ChinaThe recognition of oracle bone script is of significant importance for understanding the evolution of Chinese characters, their morphological features, and semantic changes. However, traditional methods and some deep learning models have limited ability to capture the complex forms and fine details of oracle bone script, which makes it difficult to fully detect subtle differences between characters. Additionally, models trained on such data tend to struggle with recognizing rare or unseen characters, often leading to recognition errors. Therefore, improving the robustness of these models is essential. This paper presents a novel recognition algorithm based on YOLOv5, incorporating BiFPN-SDI, C3-DAttention, and Detect_Efficient to significantly enhance detection performance. BiFPN-SDI enables more precise feature fusion and attention mechanisms, improving the detection of small targets. C3-DAttention combines channel and spatial attention mechanisms to enhance feature extraction in deep convolutional neural networks. Detect_Efficient further improves the model’s detection and recognition capabilities. Experimental results show that the proposed improvements lead to a 0.7% increase in precision, a 1.1% increase in recall, and a 0.3% improvement in MAP@50. Furthermore, the model’s parameter count is reduced to 1,009,668, and its processing speed is increased to 90 fps, significantly improving the ability to extract and recognize features in oracle bone script.https://ieeexplore.ieee.org/document/10858166/YOLOv5feature extractionC3_DAttentionBiFPN_SDIdetect_efficientoracle recognition |
spellingShingle | Xinhang Zhang Zhenhua Ma Yaru Zhang Huiying Ru Research on YOLOv5 Oracle Recognition Algorithm Based on Multi-Module Fusion IEEE Access YOLOv5 feature extraction C3_DAttention BiFPN_SDI detect_efficient oracle recognition |
title | Research on YOLOv5 Oracle Recognition Algorithm Based on Multi-Module Fusion |
title_full | Research on YOLOv5 Oracle Recognition Algorithm Based on Multi-Module Fusion |
title_fullStr | Research on YOLOv5 Oracle Recognition Algorithm Based on Multi-Module Fusion |
title_full_unstemmed | Research on YOLOv5 Oracle Recognition Algorithm Based on Multi-Module Fusion |
title_short | Research on YOLOv5 Oracle Recognition Algorithm Based on Multi-Module Fusion |
title_sort | research on yolov5 oracle recognition algorithm based on multi module fusion |
topic | YOLOv5 feature extraction C3_DAttention BiFPN_SDI detect_efficient oracle recognition |
url | https://ieeexplore.ieee.org/document/10858166/ |
work_keys_str_mv | AT xinhangzhang researchonyolov5oraclerecognitionalgorithmbasedonmultimodulefusion AT zhenhuama researchonyolov5oraclerecognitionalgorithmbasedonmultimodulefusion AT yaruzhang researchonyolov5oraclerecognitionalgorithmbasedonmultimodulefusion AT huiyingru researchonyolov5oraclerecognitionalgorithmbasedonmultimodulefusion |