A lightweight wheat ear counting model in UAV images based on improved YOLOv8
Wheat (Triticum aestivum L.) is one of the significant food crops in the world, and the number of wheat ears serves as a critical indicator of wheat yield. Accurate quantification of wheat ear counts is crucial for effective scientific management of wheat fields. To address the challenges of missed...
Saved in:
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2025-02-01
|
Series: | Frontiers in Plant Science |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fpls.2025.1536017/full |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1823859176300871680 |
---|---|
author | Ruofan Li Xiaohua Sun Kun Yang Zhenxue He Zhenxue He Xinxin Wang Xinxin Wang Chao Wang Chao Wang Bin Wang Fushun Wang Fushun Wang Hongquan Liu Hongquan Liu |
author_facet | Ruofan Li Xiaohua Sun Kun Yang Zhenxue He Zhenxue He Xinxin Wang Xinxin Wang Chao Wang Chao Wang Bin Wang Fushun Wang Fushun Wang Hongquan Liu Hongquan Liu |
author_sort | Ruofan Li |
collection | DOAJ |
description | Wheat (Triticum aestivum L.) is one of the significant food crops in the world, and the number of wheat ears serves as a critical indicator of wheat yield. Accurate quantification of wheat ear counts is crucial for effective scientific management of wheat fields. To address the challenges of missed detections, false detections, and diminished detection accuracy arising from the dense distribution, small size, and high overlap of wheat ears in Unmanned Aerial Vehicle (UAV) imagery, we propose a lightweight model, PSDS-YOLOv8 (P2-SPD-DySample-SCAM-YOLOv8), on the basis of the improved YOLOv8 framework, for the accurate detection of wheat ears in UAV images. First, the high resolution micro-scale detection layer (P2) is introduced to enhance the model’s ability to recognize and localize small targets, while the large-scale detection layer (P5) is eliminated to minimize computational redundancy. Then, the Spatial Pyramid Dilated Convolution (SPD-Conv) module is employed to improve the ability of the network to learn features, thereby enhancing the representation of weak features of small targets and preventing information loss caused by low image resolution or small target sizes. Additionally, a lightweight dynamic upsampler, Dynamic Sample (DySample), is introduced to decrease computational complexity of the upsampling process by dynamically adjusting interpolation positions. Finally, the lightweight module Spatial Context-Aware Module (SCAM) is utilized to accurately map the connection between small targets and global features, enhancing the discrimination of small targets from the background. Experimental results demonstrate that the improved PSDS-YOLOv8 model achieves Mean Average Precision(mAP) 50 and mAP50:95 scores of 96.5% and 55.2%, which increases by 2.8% and 4.4%, while the number of parameters is reduced by 40.6% in comparison with the baseline YOLOv8 model. Compared to YOLOv5, YOLOv7, YOLOv9, YOLOv10, YOLOv11, Faster RCNN, SSD, and RetinaNet, the improved model demonstrates superior accuracy and fewer parameters, exhibiting the best overall performance. The methodology proposed in this study enhances model accuracy while concurrently reducing resource consumption and effectively addressing the issues of missed and false detections of wheat ears, thereby providing technical support and theoretical guidance for intelligent counting of wheat ears in UAV imagery. |
format | Article |
id | doaj-art-5eba3140280b486a91e9e00f8d078dd9 |
institution | Kabale University |
issn | 1664-462X |
language | English |
publishDate | 2025-02-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Plant Science |
spelling | doaj-art-5eba3140280b486a91e9e00f8d078dd92025-02-11T06:59:34ZengFrontiers Media S.A.Frontiers in Plant Science1664-462X2025-02-011610.3389/fpls.2025.15360171536017A lightweight wheat ear counting model in UAV images based on improved YOLOv8Ruofan Li0Xiaohua Sun1Kun Yang2Zhenxue He3Zhenxue He4Xinxin Wang5Xinxin Wang6Chao Wang7Chao Wang8Bin Wang9Fushun Wang10Fushun Wang11Hongquan Liu12Hongquan Liu13College of Information Science and Technology, Hebei Agricultural University, Baoding, ChinaDepartment of Digital Media, Hebei Software Institute, Baoding, ChinaCollege of Information Science and Technology, Hebei Agricultural University, Baoding, ChinaCollege of Information Science and Technology, Hebei Agricultural University, Baoding, ChinaHebei Key Laboratory of Agricultural Big Data, Hebei Agricultural University, Baoding, ChinaCollege of Horticulture, Hebei Agricultural University, Baoding, ChinaAgricultural Engineering Technology Research Center of National North Mountainous Area, Hebei Agricultural University, Baoding, ChinaCollege of Information Science and Technology, Hebei Agricultural University, Baoding, ChinaHebei Key Laboratory of Agricultural Big Data, Hebei Agricultural University, Baoding, ChinaCollege of Information Science and Technology, Hebei Agricultural University, Baoding, ChinaCollege of Information Science and Technology, Hebei Agricultural University, Baoding, ChinaHebei Key Laboratory of Agricultural Big Data, Hebei Agricultural University, Baoding, ChinaCollege of Urban and Rural Construction, Hebei Agricultural University, Baoding, ChinaState Key Laboratory of North China Crop Improvement and Regulation, Hebei Agricultural University, Baoding, ChinaWheat (Triticum aestivum L.) is one of the significant food crops in the world, and the number of wheat ears serves as a critical indicator of wheat yield. Accurate quantification of wheat ear counts is crucial for effective scientific management of wheat fields. To address the challenges of missed detections, false detections, and diminished detection accuracy arising from the dense distribution, small size, and high overlap of wheat ears in Unmanned Aerial Vehicle (UAV) imagery, we propose a lightweight model, PSDS-YOLOv8 (P2-SPD-DySample-SCAM-YOLOv8), on the basis of the improved YOLOv8 framework, for the accurate detection of wheat ears in UAV images. First, the high resolution micro-scale detection layer (P2) is introduced to enhance the model’s ability to recognize and localize small targets, while the large-scale detection layer (P5) is eliminated to minimize computational redundancy. Then, the Spatial Pyramid Dilated Convolution (SPD-Conv) module is employed to improve the ability of the network to learn features, thereby enhancing the representation of weak features of small targets and preventing information loss caused by low image resolution or small target sizes. Additionally, a lightweight dynamic upsampler, Dynamic Sample (DySample), is introduced to decrease computational complexity of the upsampling process by dynamically adjusting interpolation positions. Finally, the lightweight module Spatial Context-Aware Module (SCAM) is utilized to accurately map the connection between small targets and global features, enhancing the discrimination of small targets from the background. Experimental results demonstrate that the improved PSDS-YOLOv8 model achieves Mean Average Precision(mAP) 50 and mAP50:95 scores of 96.5% and 55.2%, which increases by 2.8% and 4.4%, while the number of parameters is reduced by 40.6% in comparison with the baseline YOLOv8 model. Compared to YOLOv5, YOLOv7, YOLOv9, YOLOv10, YOLOv11, Faster RCNN, SSD, and RetinaNet, the improved model demonstrates superior accuracy and fewer parameters, exhibiting the best overall performance. The methodology proposed in this study enhances model accuracy while concurrently reducing resource consumption and effectively addressing the issues of missed and false detections of wheat ears, thereby providing technical support and theoretical guidance for intelligent counting of wheat ears in UAV imagery.https://www.frontiersin.org/articles/10.3389/fpls.2025.1536017/fullwheat ear detectionunmanned aerial vehiclesmall targetYOLOv8lightweight |
spellingShingle | Ruofan Li Xiaohua Sun Kun Yang Zhenxue He Zhenxue He Xinxin Wang Xinxin Wang Chao Wang Chao Wang Bin Wang Fushun Wang Fushun Wang Hongquan Liu Hongquan Liu A lightweight wheat ear counting model in UAV images based on improved YOLOv8 Frontiers in Plant Science wheat ear detection unmanned aerial vehicle small target YOLOv8 lightweight |
title | A lightweight wheat ear counting model in UAV images based on improved YOLOv8 |
title_full | A lightweight wheat ear counting model in UAV images based on improved YOLOv8 |
title_fullStr | A lightweight wheat ear counting model in UAV images based on improved YOLOv8 |
title_full_unstemmed | A lightweight wheat ear counting model in UAV images based on improved YOLOv8 |
title_short | A lightweight wheat ear counting model in UAV images based on improved YOLOv8 |
title_sort | lightweight wheat ear counting model in uav images based on improved yolov8 |
topic | wheat ear detection unmanned aerial vehicle small target YOLOv8 lightweight |
url | https://www.frontiersin.org/articles/10.3389/fpls.2025.1536017/full |
work_keys_str_mv | AT ruofanli alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT xiaohuasun alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT kunyang alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT zhenxuehe alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT zhenxuehe alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT xinxinwang alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT xinxinwang alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT chaowang alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT chaowang alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT binwang alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT fushunwang alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT fushunwang alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT hongquanliu alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT hongquanliu alightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT ruofanli lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT xiaohuasun lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT kunyang lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT zhenxuehe lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT zhenxuehe lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT xinxinwang lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT xinxinwang lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT chaowang lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT chaowang lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT binwang lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT fushunwang lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT fushunwang lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT hongquanliu lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 AT hongquanliu lightweightwheatearcountingmodelinuavimagesbasedonimprovedyolov8 |