An efficient detection model based on improved YOLOv5s for abnormal surface features of fish

Zheng Zhang; Xiang Lu; Shouqi Cao; Zheng Zhang; Xiang Lu; Shouqi Cao

doi:10.3934/mbe.2024076

Mathematical Biosciences and Engineering

2024, Volume 21, Issue 2: 1765-1790. doi: 10.3934/mbe.2024076

Previous Article Next Article

Research article

An efficient detection model based on improved YOLOv5s for abnormal surface features of fish

College of Engineering Science and Technology, Shanghai Ocean University, Shanghai 201306, China

Academic Editor: Vladimir Mityushev

Received: 12 November 2023 Revised: 11 December 2023 Accepted: 13 December 2023 Published: 02 January 2024

Detecting abnormal surface features is an important method for identifying abnormal fish. However, existing methods face challenges in excessive subjectivity, limited accuracy, and poor real-time performance. To solve these challenges, a real-time and accurate detection model of abnormal surface features of in-water fish is proposed, based on improved YOLOv5s. The specific enhancements include: 1) We optimize the complete intersection over union and non-maximum suppression through the normalized Gaussian Wasserstein distance metric to improve the model's ability to detect tiny targets. 2) We design the DenseOne module to enhance the reusability of abnormal surface features, and introduce MobileViTv2 to improve detection speed, which are integrated into the feature extraction network. 3) According to the ACmix principle, we fuse the omni-dimensional dynamic convolution and convolutional block attention module to solve the challenge of extracting deep features within complex backgrounds. We carried out comparative experiments on 160 validation sets of in-water abnormal fish, achieving precision, recall, mAP₅₀, mAP_50:95 and frames per second (FPS) of 99.5, 99.1, 99.1, 73.9% and 88 FPS, respectively. The results of our model surpass the baseline by 1.4, 1.2, 3.2, 8.2% and 1 FPS. Moreover, the improved model outperforms other state-of-the-art models regarding comprehensive evaluation indexes.

Keywords:

abnormal surface features of fish,
YOLOv5s,
normalized Gaussian Wasserstein distance metric,
MobileViTv2 module,
Densone module,
ACmix,
ODC-CBAM

Citation: Zheng Zhang, Xiang Lu, Shouqi Cao. An efficient detection model based on improved YOLOv5s for abnormal surface features of fish[J]. Mathematical Biosciences and Engineering, 2024, 21(2): 1765-1790. doi: 10.3934/mbe.2024076

Related Papers:

[1]	Jiaming Ding, Peigang Jiao, Kangning Li, Weibo Du . Road surface crack detection based on improved YOLOv5s. Mathematical Biosciences and Engineering, 2024, 21(3): 4269-4285. doi: 10.3934/mbe.2024188
[2]	Bowen Xing, Min Sun, Minyang Ding, Chuang Han . Fish sonar image recognition algorithm based on improved YOLOv5. Mathematical Biosciences and Engineering, 2024, 21(1): 1321-1341. doi: 10.3934/mbe.2024057
[3]	Miaolong Cao, Hao Fu, Jiayi Zhu, Chenggang Cai . Lightweight tea bud recognition network integrating GhostNet and YOLOv5. Mathematical Biosciences and Engineering, 2022, 19(12): 12897-12914. doi: 10.3934/mbe.2022602
[4]	Mingju Chen, Zhongxiao Lan, Zhengxu Duan, Sihang Yi, Qin Su . HDS-YOLOv5: An improved safety harness hook detection algorithm based on YOLOv5s. Mathematical Biosciences and Engineering, 2023, 20(8): 15476-15495. doi: 10.3934/mbe.2023691
[5]	Chen Chen, Guowu Yuan, Hao Zhou, Yi Ma . Improved YOLOv5s model for key components detection of power transmission lines. Mathematical Biosciences and Engineering, 2023, 20(5): 7738-7760. doi: 10.3934/mbe.2023334
[6]	Yiwen Jiang . Surface defect detection of steel based on improved YOLOv5 algorithm. Mathematical Biosciences and Engineering, 2023, 20(11): 19858-19870. doi: 10.3934/mbe.2023879
[7]	Siyuan Shen, Xing Zhang, Wenjing Yan, Shuqian Xie, Bingjia Yu, Shizhi Wang . An improved UAV target detection algorithm based on ASFF-YOLOv5s. Mathematical Biosciences and Engineering, 2023, 20(6): 10773-10789. doi: 10.3934/mbe.2023478
[8]	Lu Yuan, Yuming Ma, Yihui Liu . Protein secondary structure prediction based on Wasserstein generative adversarial networks and temporal convolutional networks with convolutional block attention modules. Mathematical Biosciences and Engineering, 2023, 20(2): 2203-2218. doi: 10.3934/mbe.2023102
[9]	Jing Zhang, Haoliang Zhang, Ding Lang, Yuguang Xu, Hong-an Li, Xuewen Li . Research on rainy day traffic sign recognition algorithm based on PMRNet. Mathematical Biosciences and Engineering, 2023, 20(7): 12240-12262. doi: 10.3934/mbe.2023545
[10]	Xiaoguang Liu, Yubo Wu, Meng Chen, Tie Liang, Fei Han, Xiuling Liu . A double-channel multiscale depthwise separable convolutional neural network for abnormal gait recognition. Mathematical Biosciences and Engineering, 2023, 20(5): 8049-8067. doi: 10.3934/mbe.2023349

Abstract

1. Introduction

Aquaculture provides humans with a wealth of nutrients and has become an important part of the global agricultural economy. According to statistics, 88% of global annual fishery production is directly consumed by humans ^[1,2]. With population growth and economic development, the demand for fish continues to increase, and the scale of aquaculture is gradually expanding, which brings huge challenges to fish farming ^[3]. During fish farming, abnormalities such as diseases and parasites may occur in fish, resulting in a decrease in fish attributes, quality and fish welfare. Fish abnormality detection helps farmers adjust breeding strategies in a timely manner, prevent disease outbreaks and improve breeding efficiency ^[4,5]. In the past, manual visual inspection was the primary method for abnormal fish detection. However, this method has problems such as low efficiency, high missed detection rate and strong subjectivity. Detecting abnormal features on the surface of fish is an important basis for distinguishing abnormal fish. Therefore, rapid and accurate detecting of abnormal surface features of fish has become a hot issue in aquaculture.

Computer vision technology is an effective, cost-efficient and non-invasive detection technique, carrying substantial significance in driving the automation and intelligence of aquaculture ^[6]. It has great potential for abnormal fish detection in aquaculture ^[7]. With the development of artificial intelligence, such as computer vision and deep learning, especially in object detection, image classification and image segmentation, researchers have begun to detect abnormal fish surface features by applying neural networks to distinguish abnormal fish.

Yasruddin et al. ^[8] used computer vision and deep convolutional neural networks to detect fish diseases and used Faster-RCNN to train the surface features of diseased fishes. The results showed that the recognition accuracy was satisfactory. Ashraf and Atia ^[9] used a transfer learning model to learn two different shrimp disease signatures and detect diseased shrimps from normal shrimps. Wang et al. ^[10] proposed a computer vision-based detection method for abnormal surface features of Penaeus vannamei. Rapid detection of Penaeus vannamei diseases is achieved through image enhancement methods such as denoising and feature enhancement, as well as the LeNet model. The accuracy of the deep learning model used reaches approximately 96.1%. Chen et al. ^[11] proposed a two-stage ImageNet deep learning model with a convolutional neural network structure. The model was able to classify three abnormal appearances of grouper, achieving a high average accuracy of 98.94%. Gupta et al. ^[12] used a convolutional neural network based on VGG19 for fish wound detection, which can classify normal fish and abnormal fish, and the recognition accuracy reached 96.7%. In this body of research researches, although deep learning techniques have shown promising results for fish behavior detection, there are still certain limitations: 1) The detection of abnormal fish in complex backgrounds presents challenges of missing and inaccurate detection. 2) The fish abnormal surface feature data sets used were collected on the workbench and cannot be suitable for abnormal fish detection in underwater scenes. 3) Previous enhancements made by convolutional neural networks have some drawbacks such as insufficient feature extraction and complex model network structure, resulting in an inability to maintain a balance between model complexity, detection speed and detection accuracy.

You only look once (YOLO) ^{[13,14,15,16,17,18]} is an advanced single-stage object detection algorithm, which can be used, for example, small target detection in aquaculture, detection of key components of power transmission lines and detection of cigarette appearance defects, etc. Due to its exceptional performance, it has found extensive applications in land-based recirculating aquaculture systems. Yu et al. ^[19] proposed a fish skin disease detection model based on the YOLOv4 model, combined with depth-separable convolution and optimized feature extraction network and activation function. The proposed model has high learning ability and the model is lightweight. Compared with the baseline, its mean average precision (mAP) and detection speed are increased by 12.39% and 19.31 FPS, respectively. Wang et al. ^[20] proposed a diseased fish detection model based on improved YOLOv5s, using the C3 structure instead of the cross-stage partial (CSP) structure, and replacing all 3 × 3 convolutions in the backbone network with parallel 3 × 3, 1 × 3 and 3 × 1 convolutions. The convolution kernel group composed of kernels and the introduction of convolutional block attention module (CBAM) attention mechanism achieved an average accuracy of 99.38%. Prasetyo et al. ^[21] enhanced the YOLOv4-tiny model for the determination of fish freshness, species classification, and biomass estimation. Their approach involved the integration of novel techniques such as the wing convolutional layer (WCL) and tiny spatial pyramid pooling (Tiny-SPP) to refine and balance diverse feature representations. They effectively optimized computational resources by employing bottleneck and expansion convolution (BEC) for feature fusion. To further improve the model's detection accuracy, they introduced an additional small object detector. Zhao et al. ^[22] proposed a high-precision lightweight model that uses an improved YOLOv4 to detect dead fish, significantly reducing the number of model parameters and computational amounts. Li et al. ^[23] introduced a real-time detection approach for identifying abnormal fish behaviors, which combines images of mosaic pixel points with an enhanced version of YOLOv5s, referred to as BCS-YOLOv5. Their proposed method not only improves the extraction of positional information for abnormal fish, but also enables quantitative detection of similar abnormal behaviors. Based on image fusion, BCS-YOLOv5 achieved an impressive inference accuracy of 96.69% on the dataset. The majority of the aforementioned studies have focused on enhancing YOLOv5 for specific detection tasks, resulting in notable improvements and achieving commendable evaluation metrics.

The above-mentioned studies show that good accuracy has been achieved in detecting obvious abnormal fish surface features. However, there are certain limitations in extracting abnormal surface features for small targets and complex scenes. Therefore, this study presents an enhanced YOLOv5-based detection model designed for abnormal surface features. Several novel improvements are introduced in our method, distinguishing it from prior research, as outlined below:

● We introduce the normalized Gaussian Wasserstein distance (NWD) metric to optimize the loss function and non-maximum suppression (NMS) of YOLOv5s to enhance the model's ability to detect small targets and speed up the model's convergence speed.

● We introduce the lightweight MobileViTv2 module and designed DenseOne module. These enhancements improve detection accuracy, while reducing the model size and parameters for resource-constrained edge devices.

● According to the ACmix principle, we obtain the ODC-CBAM module by fusing omni-dimensional dynamic convolution (ODConv) and CBAM, and further integrate it into the feature extraction network, which reduces the missed detection rate and false detection rate of abnormal surface features located in complex scenes.

The rest of this article is as follows. Section Ⅱ proposes methods for the problem and improves the detailed description of the structure of YOLOv5s and the improvement point. Section Ⅲ describes experimental data collection, data set construction and some experimental details. Section Ⅳ analyzes the experimental results. Section Ⅴ summarizes the work of this article.

2. Methodology

2.1. Detection methods for abnormal fish

After reviewing the literature, research and interviews with relevant breeders, we identified the following challenges in discerning abnormal fish by the detection of surface features: 1) Since abnormal surface features of fish are an occasional phenomenon in aquaculture, this creates a problem of data scarcity. Moreover, annotating the abnormal surface features requires a lot of time and resources. Therefore, it is difficult to construct a data set of abnormal fish surface features. 2) As shown in the Figure 1, the abnormal surface features of longsnout catfish are clearly visible on the workbench. In-water environments differ from those on the table and often exhibit phenomena such as reflectivity, which can result in unclear fish images. Longsnout catfish in the water may also exhibit such as pixel blur, small size due to variations in distance and serious overlap. 3) Although the current convolutional neural networks exhibit good detection accuracy, they have shortcomings such as weak learning ability for abnormal surface features of small targets and slow detection speed.

Figure 1. Images of abnormal surface features of fish photographed at a table.

Models	P (%)	R (%)	mAP₅₀ (%)	mAP_50:95 (%)	Model Size (MB)	FLOPs (G)	FPS
YOLOv5n	0.911	0.917	0.922	0.574	3.8	4.1	101
YOLOv5s	0.949	0.945	0.964	0.661	14.3	15.8	87
YOLOv5m	0.949	0.948	0.962	0.679	42.1	47.9	76
YOLOv5l	0.948	0.946	0.964	0.675	92.7	107.6	68
YOLOv5x	0.951	0.944	0.959	0.688	173	203.8	61

Configuration	Parameter
CPU	Inter(R) Xeon(R) W-2223
GPU	Nvidia GeForce RTX 2080ti
Operating system	Windows10
Accelerated environment	CUDA11.7 and Cudnn8.0.5
Interpreter setting	Python3.8 and torch1.13.1

Parameter	Value
Image size	640 × 640
Optimizer	SGD
Learning rate	0.01
Momentum	0.937
Epoch	500

Models	1	2	3	4	Parameters	mAP₅₀ (%)	mAP_50:95 (%)	Models Size (MB)	FLOPs (G)	FPS
1	×	×	×	×	7022326	0.964	0.661	14.3	15.8	87
2	√	×	×	×	7022326	0.987	0.676	14.0	14.9	85
3	×	√	×	×	6965935	0.985	0.680	14.3	15.4	67
4	×	×	√	×	7074005	0.987	0.693	14.5	15.1	41
5	×	×	×	√	7361878	0.980	0.685	15.1	16.9	77
6	√	√	×	×	6965935	0.992	0.707	14.3	15.3	66
7	√	√	√	×	7021301	0.993	0.742	15.0	16.6	39
8	√	√	√	√	7369077	0.993	0.741	15.2	16.2	88
Note: "1" represents NWD improvement. "2" represents DenseOne improvement. "3" represents ODC-CBAM improvement. "4" represents MobileViTv2 improvement. "×" representatives do not introduce this improvement strategy. "√" representatives introduce this improvement strategy.

[1]	E. A. O'Neil, N. J. Rowan, A. M. Fogarty, Novel use of the alga Pseudokirchneriella subcapitata, as an early-warning indicator to identify climate change ambiguity in aquatic environments using freshwater finfish farming as a case study, Sci. Total Environ., 692 (2019), 209–218. https://doi.org/10.1016/j.scitotenv.2019.07.243 doi: 10.1016/j.scitotenv.2019.07.243
[2]	Y. Wei, Q. Wei, D. An, Intelligent monitoring and control technologies of open sea cage culture: A review, Comput. Electron. Agric., 169 (2020), 105119. https://doi.org/10.1016/j.compag.2019.105119 doi: 10.1016/j.compag.2019.105119
[3]	S. Zhao, S. Zhang, J. Liu, H. Wang, D. Li, R. Zhao, Application of machine learning in intelligent fish aquaculture: A review, Aquaculture, 540 (2021), 736724. https://doi.org/10.1016/j.aquaculture.2021.736724 doi: 10.1016/j.aquaculture.2021.736724
[4]	C. Liu, Z. Wang, Y. Li, Z. Zhang, J. Li, C. Xu, et al., Research progress of computer vision technology in abnormal fish detection, Aquacultural Eng., 103 (2023), 102350. https://doi.org/10.1016/j.aquaeng.2023.102350 doi: 10.1016/j.aquaeng.2023.102350
[5]	Y. Zhou, J. Yang, A. Tolba, F. Alqahtani, X. Qi, Y. Shen, A data-driven intelligent management scheme for digital industrial aquaculture based on multi-object deep neural network, Math. Biosci. Eng., 20 (2023), 10428–10443. https://doi.org/10.3934/mbe.2023458 doi: 10.3934/mbe.2023458
[6]	L. Zhang, B. Li, X. Sun, Q. Hong, Q. L. Duan, Intelligent fish feeding based on machine vision: A review, Biosyst. Eng., 231 (2023), 133–164. https://doi.org/10.1016/j.biosystemseng.2023.05.010 doi: 10.1016/j.biosystemseng.2023.05.010
[7]	B. Zion, The use of computer vision technologies in aquaculture-A review, Comput. Electron. Agric., 88 (2012), 125–132. https://doi.org/10.1016/j.compag.2012.07.010 doi: 10.1016/j.compag.2012.07.010
[8]	M. L. Yasruddin, M. A. H. Ismail, Z. Husin, W. K. Tan, Feasibility study of fish disease detection using computer vision and deep convolutional neural network (DCNN) algorithm, in 2022 IEEE 18th International Colloquium on Signal Processing & Applications (CSPA), (2022), 272–276. https://doi.org/10.1109/CSPA55076.2022.9782020
[9]	A. Ashraf, A. Atia, Comparative study between transfer learning models to detect shrimp diseases, in 2021 16th International Conference on Computer Engineering and Systems (ICCES), (2021), 1–6. https://doi.org/10.1109/ICCES54031.2021.9686116
[10]	Q. Wang, C. Qian, P. Nie, M. Ye, Rapid detection of Penaeus vannamei diseases via an improved LeNet, Aquacultural Eng., 100 (2023), 102296. https://doi.org/10.1016/j.aquaeng.2022.102296 doi: 10.1016/j.aquaeng.2022.102296
[11]	J. C. Chen, T. Chen, H. Wang, P. Chang, Underwater abnormal classification system based on deep learning: A case study on aquaculture fish farm in Taiwan, 99 (2022), 102290. https://doi.org/10.1016/j.aquaeng.2022.102290
[12]	A. Gupta, E. Bringsdal, K. M. Knausgard, M. Goodwin, Accurate wound and lice detection in atlantic salmon fish using a convolutional neural network, Fishes, 7 (2022), 345. https://doi.org/10.3390/fishes7060345 doi: 10.3390/fishes7060345
[13]	J. Redmon, S. Divvala, R. Girshick, A. Farhadi, You only look once: Unified, real-time object detection, in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2016), 779–778. https://doi.org/10.1109/CVPR.2016.91
[14]	C. Chen, G. Yuan, H. Zhou, Y. Ma, Improved YOLOv5s model for key components detection of power transmission lines, Math. Biosci. Eng., 20 (2023), 7738–7760. https://doi.org/10.3934/mbe.2023334 doi: 10.3934/mbe.2023334
[15]	Y. Ma, G. Yuan, K. Yue, H. Zhou, CJS-YOLOv5n: A high-performance detection model for cigarette appearance defects, Math. Biosci. Eng., 20 (2023), 17886–17904. https://doi.org/10.3934/mbe.2023795 doi: 10.3934/mbe.2023795
[16]	A. Bochkovskiy, C. Wang, H. M. Liao, YOLOv4: Optimal speed and accuracy of object detection, preprint, arXiv: 2004.10934.
[17]	C. Li, L. Li, H. Jiang, K. Weng. Y. Geng, L. Li, et al., YOLOv6: A single-stage object detection framework for industrial applications, preprint, arXiv: 2209.02976.
[18]	C. Wang, A. Bochkovskiy, H. M. Liao, Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors, in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2023), 7464–7475. https://doi.org/10.1109/CVPR52729.2023.00721
[19]	G. Yu, J. Zhang, A. Chen, R. Wan, Detection and identification of fish skin health status referring to four common diseases based on improved YOLOv4 model, Fishes, 8 (2023), 186. https://doi.org/10.3390/fishes8040186 doi: 10.3390/fishes8040186
[20]	Z. Wang, H. Liu, G. Zhang, X. Yang, L. Wen, W. Zhao, Diseased fish detection in the underwater environment using an improved YOLOV5 network for intensive aquaculture, Fishes, 8 (2023), 169. https://doi.org/10.3390/fishes8030169 doi: 10.3390/fishes8030169
[21]	E. Prasetyo, N. Suciati, C. Fatichah, Yolov4-tiny with wing convolution layer for detecting fish body part, Comput. Electron. Agric., 198 (2022), 107023. https://doi.org/10.1016/j.compag.2022.107023 doi: 10.1016/j.compag.2022.107023
[22]	S. Zhao, S. Zhang, J. Lu, H. Wang, Y. Feng, C. Shi, et al., A lightweight dead fish detection method based on deformable convolution and YOLOV4, Comput. Electron. Agric., 198 (2022), 107098. https://doi.org/10.1016/j.compag.2022.107098 doi: 10.1016/j.compag.2022.107098
[23]	X. Li, Y. Hao, P. Zhang, M. Akhter, D. Li, A novel automatic detection method for abnormal behavior of single fish using image fusion, Comput. Electron. Agric., 203 (2022), 107435. https://doi.org/10.1016/j.compag.2022.107435 doi: 10.1016/j.compag.2022.107435
[24]	P. Jiang, D. Ergu, F. Liu, Y. Cai, B. Ma, A Review of Yolo algorithm developments, Proc. Comput. Sci., 199 (2022), 1066–1073. https://doi.org/10.1016/j.procs.2022.01.135 doi: 10.1016/j.procs.2022.01.135
[25]	Z. Zheng, P. Wang, D. Ren, W. Liu, R. Ye, Q. Hu, et al., Enhancing geometric factors in model learning and inference for object detection and instance segmentation, IEEE Trans. Cybern., 52 (2022), 8574–8586. https://doi.org/10.1109/TCYB.2021.3095305 doi: 10.1109/TCYB.2021.3095305
[26]	J. Wang, C. Xu, W. Yang, L. Yu, A normalized gaussian wasserstein distance for tiny object detection, preprint, arXiv: 2110.13389.
[27]	S. Mehta, M. Rastegari, Separable self-attention for mobile vision transformers, preprint, arXiv: 2206.02680.
[28]	G. Huang, Z. Liu, L. Van Der Maaten, K. Q. Weinberger, Densely connected convolutional networks, in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2017), 2261–2269. https://doi.org/10.1109/CVPR.2017.243
[29]	X. Pan, C. Ge, R. Lu, S. Song, G. Chen, Z. Huang, et al., On the integration of self-attention and convolution, in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2022), 815–825. https://doi.org/10.1109/CVPR52688.2022.00089
[30]	C. Li, A. Zhou, A. Yao, Omni-dimensional dynamic convolution, preprint, arXiv: 2209.07947.
[31]	S. Woo, J. Park, J. Lee, I. S. Kweon, CBAM: convolution block attention module, preprint, arXiv: 1807.06521.
[32]	S. Mehta, M. Rastegari, MobileViT: Light-weight, general-purpose, and mobile-friendly vision transformer, preprint, arXiv: 2110.02178.
[33]	C. Wang, H. M. Liao, Y. Wu, P. Chen, J. Hsieh, I. Yeh, CSPNet: A new backbone that can enhance learning capability of CNN, in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), (2020), 1571–1580. https://doi.org/10.1109/CVPRW50498.2020.00203
[34]	J. Hu, L. Shen, G. Sun, Squeeze-and-excitation networks, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2018), 7132–7141. https://doi.org/10.1109/CVPR.2018.00745
[35]	J. Fu, H. Zheng, T. Mei, Look closer to see better: Recurrent attention convolutional neural network for fine-grained image recognition, in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2017), 4476–4484. https://doi.org/10.1109/CVPR.2017.476
[36]	Z. Wang, A. C. Bovik, H. R. Sheikh, E. P. Simoncelli, Image quality assessment: from error visibility to structural similarity, IEEE Trans. Image Process., 13 (2004), 600–612. https://doi.org/10.1109/TIP.2003.819861 doi: 10.1109/TIP.2003.819861
[37]	X. Li, Z. Yang, H. Wu, Face detection based on receptive field enhanced multi-task cascaded convolutional neural networks, IEEE Access, 8 (2020), 174922–174930. https://doi.org/10.1109/ACCESS.2020.3023782 doi: 10.1109/ACCESS.2020.3023782
[38]	R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, D. Batra, Grad-CAM: Visual explanations from deep networks via gradient-based localization, in 2017 IEEE International Conference on Computer Vision (ICCV), (2017), 618–626. https://doi.org/10.1109/ICCV.2017.74

1.	Hui Li, Zixuan Yang, Weimin Qi, Xinchen Yu, Jiaying Wu, Haining Li, Parkinson's image detection and classification based on deep learning, 2024, 24, 1471-2342, 10.1186/s12880-024-01364-8
2.	Jiang Mi, Jingrui Luo, Haixia Zhao, Xingguo Huang, Improved dense residual network with the coordinate and pixel attention mechanisms for helmet detection, 2024, 15, 1868-8071, 5015, 10.1007/s13042-024-02205-4
3.	Lu Zhang, Yapeng Zheng, Zunxu Liu, Ziwen Zhu, Yuanlin Wu, Longcheng Pan, A method for detecting feeding fish in ponds based on FFishNet-YOLOv8, 2025, 230, 01681699, 109873, 10.1016/j.compag.2024.109873

Models	Parameters	mAP₅₀ (%)	mAP_50:95 (%)	Model Size (MB)	FLOPs (G)	FPS
YOLOv4	52.49	0.776	0.571	105.4	118.9	16
YOLOv7	37.19	0.914	0.636	291.4	103.2	34
YOLOv8	11.13	0.921	0.642	22.5	28.4	43
YOLOv5s	7.02	0.964	0.661	14.3	15.8	87
Ours	7.36	0.993	0.741	15.2	16.4	88
SSD	23.61	0.770	0.556	90.6	273.74	12
Faster R-CNN	28.05	0.881	0.597	108.0	947.28	8

Models	Parameters	mAP₅₀ (%)	mAP_50:95 (%)	Model Size (MB)	FLOPs (G)	FPS
YOLOv4	52.49	0.776	0.571	105.4	118.9	16
YOLOv7	37.19	0.914	0.636	291.4	103.2	34
YOLOv8	11.13	0.921	0.642	22.5	28.4	43
YOLOv5s	7.02	0.964	0.661	14.3	15.8	87
Ours	7.36	0.993	0.741	15.2	16.4	88
SSD	23.61	0.770	0.556	90.6	273.74	12
Faster R-CNN	28.05	0.881	0.597	108.0	947.28	8

Models	Parameters	mAP₅₀ (%)	mAP_50:95 (%)	Model Size (MB)	FLOPs (G)	FPS
YOLOv4	52.49	0.776	0.571	105.4	118.9	16
YOLOv7	37.19	0.914	0.636	291.4	103.2	34
YOLOv8	11.13	0.921	0.642	22.5	28.4	43
YOLOv5s	7.02	0.964	0.661	14.3	15.8	87
Ours	7.36	0.993	0.741	15.2	16.4	88
SSD	23.61	0.770	0.556	90.6	273.74	12
Faster R-CNN	28.05	0.881	0.597	108.0	947.28	8

Mathematical Biosciences and Engineering

An efficient detection model based on improved YOLOv5s for abnormal surface features of fish

Related Papers:

Abstract

1. Introduction

2. Methodology

2.1. Detection methods for abnormal fish

2.2. Improved YOLOv5

2.3. Improvements on YOLOv5s

2.3.1. NWD metric

2.3.2. MobileViTv2

2.3.3. DenseOne

2.3.4. ODC-CBAM

3. Datasets and experiments details

3.1. Datasets

3.1.1. Data acquisition

3.1.2. Dataset for improved YOLOv5s

3.2. Experiment platform and training hyperparameters

3.3. Model evaluation

4. Experimental results and analysis

4.1. Training result analysis

4.2. Algorithm performance evaluation

4.2.1. Ablation study

4.2.2. Algorithm performance

4.2.3. Model evaluation on validation set

4.2.4. Grad-CAM visualization results

4.2.5. State-of-the-art models' performance comparison

5. Conclusions

6. Declaration of ethical considerations of computer vision in aquaculture

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. Methodology

2.1. Detection methods for abnormal fish

2.2. Improved YOLOv5

2.3. Improvements on YOLOv5s

2.3.1. NWD metric

2.3.2. MobileViTv2

2.3.3. DenseOne

2.3.4. ODC-CBAM

3. Datasets and experiments details

3.1. Datasets

3.1.1. Data acquisition

3.1.2. Dataset for improved YOLOv5s

3.2. Experiment platform and training hyperparameters

3.3. Model evaluation

4. Experimental results and analysis

4.1. Training result analysis

4.2. Algorithm performance evaluation

4.2.1. Ablation study

4.2.2. Algorithm performance

4.2.3. Model evaluation on validation set

4.2.4. Grad-CAM visualization results

4.2.5. State-of-the-art models' performance comparison

5. Conclusions

6. Declaration of ethical considerations of computer vision in aquaculture

Use of AI tools declaration

Acknowledgments

Conflict of interest

References