Improved YOLOv7-based steel surface defect detection algorithm

Yinghong Xie; Biao Yin; Xiaowei Han; Yan Hao; Yinghong Xie; Biao Yin; Xiaowei Han; Yan Hao

doi:10.3934/mbe.2024016

Mathematical Biosciences and Engineering

2024, Volume 21, Issue 1: 346-368. doi: 10.3934/mbe.2024016

Previous Article Next Article

Research article Special Issues

Improved YOLOv7-based steel surface defect detection algorithm

1.
School of Information Engineering, Shenyang University, Shenyang 110003, China
2.
Institute for Science, Technology and Innovation, Shenyang University, Shenyang 110003, China

Academic Editor: Shangce Gao

Received: 09 October 2023 Revised: 27 November 2023 Accepted: 28 November 2023 Published: 13 December 2023

In response to the limited detection ability and low model generalization ability of the YOLOv7 algorithm for small targets, this paper proposes a detection algorithm based on the improved YOLOv7 algorithm for steel surface defect detection. First, the Transformer-InceptionDWConvolution (TI) module is designed, which combines the Transformer module and InceptionDWConvolution to increase the network's ability to detect small objects. Second, the spatial pyramid pooling fast cross-stage partial channel (SPPFCSPC) structure is introduced to enhance the network training performance. Third, a global attention mechanism (GAM) attention mechanism is designed to optimize the network structure, weaken the irrelevant information in the defect image, and increase the algorithm's ability to detect small defects. Meanwhile, the Mish function is used as the activation function of the feature extraction network to improve the model's generalization ability and feature extraction ability. Finally, a minimum partial distance intersection over union (MPDIoU) loss function is designed to locate the loss and solve the mismatch problem between the complete intersection over union (CIoU) prediction box and the real box directions. The experimental results show that on the Northeastern University Defect Detection (NEU-DET) dataset, the improved YOLOv7 network model improves the mean Average precision (mAP) performance by 6% when compared to the original algorithm, while on the VOC2012 dataset, the mAP performance improves by 2.6%. These results indicate that the proposed algorithm can effectively improve the small defect detection performance on steel surface defects.

Keywords:

Citation: Yinghong Xie, Biao Yin, Xiaowei Han, Yan Hao. Improved YOLOv7-based steel surface defect detection algorithm[J]. Mathematical Biosciences and Engineering, 2024, 21(1): 346-368. doi: 10.3934/mbe.2024016

Related Papers:

[1]	Lili Wang, Chunhe Song, Guangxi Wan, Shijie Cui . A surface defect detection method for steel pipe based on improved YOLO. Mathematical Biosciences and Engineering, 2024, 21(2): 3016-3036. doi: 10.3934/mbe.2024134
[2]	Yiwen Jiang . Surface defect detection of steel based on improved YOLOv5 algorithm. Mathematical Biosciences and Engineering, 2023, 20(11): 19858-19870. doi: 10.3934/mbe.2023879
[3]	Guozhen Dong . A pixel-wise framework based on convolutional neural network for surface defect detection. Mathematical Biosciences and Engineering, 2022, 19(9): 8786-8803. doi: 10.3934/mbe.2022408
[4]	Fang Luo, Yuan Cui, Xu Wang, Zhiliang Zhang, Yong Liao . Adaptive rotation attention network for accurate defect detection on magnetic tile surface. Mathematical Biosciences and Engineering, 2023, 20(9): 17554-17568. doi: 10.3934/mbe.2023779
[5]	Kangjian Sun, Ju Huo, Qi Liu, Shunyuan Yang . An infrared small target detection model via Gather-Excite attention and normalized Wasserstein distance. Mathematical Biosciences and Engineering, 2023, 20(11): 19040-19064. doi: 10.3934/mbe.2023842
[6]	Xian Fu, Xiao Yang, Ningning Zhang, RuoGu Zhang, Zhuzhu Zhang, Aoqun Jin, Ruiwen Ye, Huiling Zhang . Bearing surface defect detection based on improved convolutional neural network. Mathematical Biosciences and Engineering, 2023, 20(7): 12341-12359. doi: 10.3934/mbe.2023549
[7]	Naigong Yu, Hongzheng Li, Qiao Xu . A full-flow inspection method based on machine vision to detect wafer surface defects. Mathematical Biosciences and Engineering, 2023, 20(7): 11821-11846. doi: 10.3934/mbe.2023526
[8]	Chen Chen, Guowu Yuan, Hao Zhou, Yutang Ma, Yi Ma . Optimized YOLOv7-tiny model for smoke detection in power transmission lines. Mathematical Biosciences and Engineering, 2023, 20(11): 19300-19319. doi: 10.3934/mbe.2023853
[9]	Yong Hua, Hongzhen Xu, Jiaodi Liu, Longzhe Quan, Xiaoman Wu, Qingli Chen . A peanut and weed detection model used in fields based on BEM-YOLOv7-tiny. Mathematical Biosciences and Engineering, 2023, 20(11): 19341-19359. doi: 10.3934/mbe.2023855
[10]	Yong Tian, Tian Zhang, Qingchao Zhang, Yong Li, Zhaodong Wang . Feature fusion–based preprocessing for steel plate surface defect recognition. Mathematical Biosciences and Engineering, 2020, 17(5): 5672-5685. doi: 10.3934/mbe.2020305

Abstract

1. Introduction

Steel is the foundation and support of modern construction. The steel production industry is a crucial foundational sector of the national economy, serving as a vital support for the construction of a modern and powerful nation. Moreover, it is a key area for achieving green and low-carbon development. In the process of steel production, various factors can lead to surface defects in steel materials. Steel surface defect detection can effectively screen out unqualified steel and prevent it from entering the market. It can help enterprises identify and solve problems in the production process in a timely manner and improve production processes and equipment. Therefore, the detection of surface defects in steel is a critical technology to ensure the quality of steel products, enhance production efficiency, reduce costs, ensure safety, and maintain credibility. In the steel production process, the detection of defects in steel is an indispensable step.

The traditional manual visual inspection method has a low efficiency, poor stability, high labor intensity, and a high cost, making it difficult to meet the needs of the modern manufacturing industry. To address the limitations of manual inspection, some researchers have explored the use of image processing techniques for steel surface defect detection. In recent years, many scholars have integrated deep learning frameworks with defect detection methods, thus achieving promising results. Shuang et al. ^[1] leveraged the strengths of convolutional neural networks (CNNs) and autoencoders (AEs) to learn the normal patterns in images and use reconstruction error to detect defects. He et al. ^[2] introduced a defect detection framework using regressions and classifications. Additionally, they presented a high-performance deep network architecture and a label generation algorithm to capture defect severity information in data annotations.

In recent years, numerous researchers have integrated deep learning frameworks with defect detection methods to develop end-to-end network models for defect detection, thereby achieving remarkable results. Luo et al. ^[3] proposed a decoupled two-stage object detection framework based on CNNs to address the challenges in detecting surface defects on flexible printed circuit boards (FPCB). They introduced a multilevel hierarchical aggregation (MHA) module as a feature enhancement module for precise defect localization and a local non-local (LNL) module for enhancing spatial encoding features (SEF) in defect classification tasks, thereby effectively localizing defects. Shao et al. ^[4] introduced a method for the pixel-wise, semi-supervised detection of textile defects by integrating a multi-task mean teacher (MT) framework. They established a multi-task detection network (ST-CNN) for detecting defect contours, defect areas, and defect distance maps, thereby aiding in defect segmentation. This model served both as a student network and a teacher network, thereby enabling the effective detection of textile defects with limited annotated samples. Chen et al. ^[5] proposed a genetic algorithm-based Gabor faster region-based CNN (Faster GG R-CNN), which incorporates Gabor kernels into Faster R-CNN. They devised a two-stage training methodology that combined a genetic algorithm (GA) and a backpropagation to train the Faster GG R-CNN model, thereby enabling the effective detection of textile defects under varying backgrounds, positions, and sizes.

In recent years, several YOLO-based algorithms ^[6,7,8,9] have been introduced, thus making significant progress in the domain of defect detection. For instance, Qian et al. ^[10] modified the YOLOv3 algorithm by replacing the original DarkNet53 with ShuffleNetv2. Moreover, they proposed the lightweight feature pyramid network (LFPN) network to enhance feature fusion for an improved efficiency in handling multi-scale features. Yang ^[11] integrated an improved YOLOv5 network into the domain of steel surface defects. They added a convolutional block attention module to the YOLOv5 network, thereby improving the detection accuracy by emphasizing crucial information. Wang et al. ^[12] proposed a defect detection algorithm based on an enhanced YOLOv7 model. They utilized a weight-dismantled weighted bi-directional feature pyramid network (BiFPN) structure to maximize the feature information fusion and to reduce the feature loss during the convolution process. Wang ^[13] introduced an improvement to the model based on You Only Look Once X (YOLOX). They embedded coordinate attention blocks within the backbone to enhance the modeling capabilities, employed zooming loss to address the foreground-background class imbalance, and predicted intersection over union-aware (IoU-aware) classification scores as a detection ranking criteria. Furthermore, they applied complete intersection over union (CIoU) loss to the regression branch to enhance the defect localization performance. Li et al. ^[14] introduced the YOLOv6 series, which incorporated replicated visual geometry group (ReVGG) for structural reparameterization and adopted the scale-invariant intersection over union (SIoU) loss function for a superior detection performance. Wang et al. ^[15] developed the YOLOv7 series, which leveraged an efficient long-range aggregation network and a cascaded model scaling strategy to effectively enhance the algorithm's detection capabilities.

The detection of small target defects is a common challenge in defect detection. Fityanul Akhyar ^[16] and others introduced a novel approach using RetinaNet to streamline defect detection from a two-stage to a more efficient single-stage process. Vikanksh Nath ^[17] proposed a hybrid model, S2D2Net, for an efficient and robust surface defect detection in steel materials during the manufacturing process. S2D2Net employed pre-trained ImageNet models as feature extractors and learned capsule networks on the extracted features.

The existing mainstream YOLOv7 series algorithms have faced persistent technical challenges in the detection of small objects, making them less suitable for steel inspection. To improve the detection accuracy of small defects in steel, this paper proposed an improved YOLOv7 algorithm. The proposed algorithm aims to overcome the limitations of current technologies in detecting small objects within the context of steel defect inspection. This paper's contributions include the following:

1) We propose the transformer-inception (TI) module, which is a novel fusion of the TransformerBlock ^[18] and the InceptionDWConvolution ^[19]. We seamlessly integrate the TI module into the YOLOv7 architecture, thus significantly improving the network's accuracy in detecting small target defects.

2) To optimize the network structure, we introduce the global attention mechanism (GAM) ^[20] module. This module is seamlessly fused with the YOLOv7 backbone network. It effectively extracts features at different scales while suppressing irrelevant information. The inclusion of the GAM attention mechanism substantially boosts the algorithm's proficiency in small target defect detection.

3) In contrast to the traditional spatial pyramid pooling cross-stage partial channel (SPPCSPC) module, we introduce the spatial pyramid pooling with feature context spatial pyramid convolution (SPPFCSPC) module. This innovative module not only maintains the algorithm's speed enhancements, but also ensures that the receptive field remains at the desired level. This adaptation significantly improves the algorithm's capacity.

The content overview of the subsequent sections is as follows. In Section 2, a detailed introduction to the YOLOv7 network is provided. Section 3 introduces the enhanced YOLOv7 network, thereby outlining the functions and characteristics of each module within the network. Section 4 offers an overview of the specific experimental design, encompassing the dataset introduction, the experimental environment and parameter settings, the evaluation criteria, the ablation experiments, and the comparative trials. Section 5 comprehensively summarizes the paper by emphasizing the algorithm's strengths and weaknesses, along with any potential future solutions and directions.

2. YOLOv7 network structure

In this paper, we adopted the YOLOv7 network framework as the fundamental architecture for steel surface defect detection. YOLOv7 represents an evolution of the YOLOv5 model by incorporating several improvements and innovations in object detection. It was introduced in July 2022, and has demonstrated a superior performance across a wide range of frame rates, from 5 frames per second (FPS) to 160 FPS, thereby surpassing all known object detectors. The network structure of YOLOv7 is depicted in Figure 1. YOLOv7 consists of three main components:

Figure 1. YOLOv7 network structure.

Hyper parameters	Value
Learning rate	0.001
Momentum	0.937
Weight decay	0.0005
Optimizer	SGD
Batch size	16
Total epochs	300
Frozen epochs	50
Box	0.05
Cls	0.5
Cls_pw	1.0
Obj	1.0
Obj_pw	1.0

YOLOv7	Mish	SPPFCSPC	GAM	TI	MPDIoU	mAP/%	AP/%
YOLOv7	Mish	SPPFCSPC	GAM	TI	MPDIoU	mAP/%	Cr	Rs	Sc	In	Pa	Pi
√						76.4	50.1	75.0	83.9	78.5	94.0	77.7
√	√					76.8	50.3	75.0	85.6	79.5	94.0	78.2
√	√	√				77.5	50.3	74.8	87.4	82.1	95.3	77.6
√	√	√	√			79.8	52.6	76.5	91.2	82.3	97.2	81.1
√	√	√	√	√		81.1	53.5	79.6	92.0	87.6	97.7	81.1
√	√	√	√	√	√	82.4	56.1	80.3	92.0	89.7	97.7	81.1

	mAP/%	Precision/%	FPS
YOLOv7	66.7	65.2	47.6
YOLOv5	67.2	67.5	46.3
YOLOX	67.5	67.2	49.6
proposed YOLOv7	69.3	68.9	26.8

[1]	S. Mei, Y. D. Wang, G. J. Wen, Automatic fabric defect detection with a multi-scale convolutional denoising autoencoder network model, Sensors, 18 (2018), 1064. http://doi.org/10.3390/S18041064 doi: 10.3390/S18041064
[2]	Z. Q. He, Q. F. Liu, Deep regression neural network for industrial surface defect detection, IEEE Access, 8 (2020), 35583–35591. http://doi.org/10.1109/ACCESS.2020.2975030 doi: 10.1109/ACCESS.2020.2975030
[3]	J. X. Luo, Z. Y. Yang, S. P. Li, Y. Wu, FPCB surface defect detection: a decoupled two-stage object detection framework, IEEE Trans. Instrum. Meas., 70 (2021). http://doi.org/10.1109/TIM.2021.3092510 doi: 10.1109/TIM.2021.3092510
[4]	L. H. Shao, E. R. Zhang, Q. R. Ma, M. Li, Pixel-wise semisupervised fabric defect detection method combined with multitask mean teacher, IEEE Trans. Instrum. Meas., 71 (2022). http://doi.org/10.1109/TIM.2022.3162286 doi: 10.1109/TIM.2022.3162286
[5]	M. Q. Chen, L. J. Yu, C. Zhi, R. Sun, S. Zhu, Z. Gao, et al., Improved faster R-CNN for fabric defect detection based on Gabor filter with genetic algorithm optimization, Comput. Ind., 134 (2022). http://doi.org/10.1016/j.compind.2021.103551 doi: 10.1016/j.compind.2021.103551
[6]	J. Redmon, S. Divvala, R. Girshick, A. Farhadi, You only look once: Unified, real-time object detection, in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2016), 27–30. http://doi.org/10.1109/CVPR.2016.91
[7]	J. Redmon, A. Farhadi, YOLO9000: Better, faster, stronger, in 30th IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2017), 21–26. http://doi.org/10.1109/CVPR.2017.690
[8]	J. Redmon, A. Farhadi, YOLOv3: An incremental improvement, preprint, arXiv: 180402767.
[9]	A. Bochkovskiy, C. Y. Wang, H. Y. M. Liao, YOLOv4: Optimal speed and accuracy of object detection, preprint, arXiv: 200410934.
[10]	X. H. Qian, X. Wang, S. Y. Yang, J. Lei, LFF-YOLO: A YOLO algorithm with lightweight feature fusion network for multi-scale defect detection, IEEE Access, 10 (2022), 130339–130349. http://doi.org/10.1109/ACCESS.2022.3227205 doi: 10.1109/ACCESS.2022.3227205
[11]	N. Yang, W. Guo, Application of improved YOLOv5 model for strip surface defect detection, in 2022 Global Reliability and Prognostics and Health Management (PHM-Yantai), (2022), 1–5. http://doi.org/10.1109/PHM-Yantai55411.2022.9942194
[12]	Y. Wan, H. Y. Wang, Z. H. Xin, Efficient detection model of steel strip surface defects based on YOLO-V7, IEEE Access, 10 (2022), 133936–133944. http://doi.org/10.1109/ACCESS.2022.3230894 doi: 10.1109/ACCESS.2022.3230894
[13]	X. Wang, K. Zhuang, An improved YOLOX method for surface defect detection of steel strips, in 2023 IEEE 3rd International Conference on Power, Electronics and Computer Applications (ICPECA), (2022), 152–157. http://doi.org/10.1109/ICPECA56706.2023.10075827
[14]	C. Li, L. Li, H. Jiang, K. Weng, Y. Geng, L. Li, et al., YOLOv6: A single-stage object detection framework for industrial applications, preprint, arXiv: 220902976.
[15]	C. Y. Wang, A. Bochkovskiy, H. Y. M. Liao, YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2023), 7464–7475. http://doi.org/10.48550/arXiv.2207.02696
[16]	F. Akhyar, C. Y. Lin, K. Muchtar, T. Y. Wu, H. F. Ng, High efficient single-stage steel surface defect detection, in 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), (2019), 18–21. http://doi.org/10.1109/AVSS.2019.8909834
[17]	V. Nath, C. Chattopadhyay, S2D2Net: An improved approach for robust steel surface defects diagnosis with small sample learning, in IEEE International Conference on Image Processing (ICIP), (2021), 1199–1203. http://doi.org/10.26599/TST.2018.9010090
[18]	A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, et al., Attention is all you need, in Advances in Neural Information Processing Systems, 30 (2017). http://doi.org/10.1109/ICIP42928.2021.9506405
[19]	W. Yu, P. Zhou, S. Yan, X. Wang, Inceptionnext: When inception meets convnext, preprint, arXiv: 230316900.
[20]	Y. Liu, Z. Shao, N. Hoffmann, Global attention mechanism: Retain information to enhance channel-spatial interactions, preprint, arXiv: 211205561.

1.	Yuxin Ma, Jiaxing Yin, Feng Huang, Qipeng Li, Surface defect inspection of industrial products with object detection deep networks: a systematic review, 2024, 57, 1573-7462, 10.1007/s10462-024-10956-3
2.	Jianbo Lu, MiaoMiao Yu, Junyu Liu, Lightweight strip steel defect detection algorithm based on improved YOLOv7, 2024, 14, 2045-2322, 10.1038/s41598-024-64080-x
3.	Haiqiang Xu, Renzheng Xue, Zifeng Zhang, Shijie Hua, MGDE-YOLO: An Improved Lightweight Algorithm for Personnel Departure Detection Based on YOLOv7, 2024, 12, 2169-3536, 150592, 10.1109/ACCESS.2024.3480040
4.	Shaoxu Li, Honggui Deng, Fengyun Zhou, Yitao Zheng, DEC-YOLO: Surface Defect Detection Algorithm for Laser Nozzles, 2025, 14, 2079-9292, 1279, 10.3390/electronics14071279

Mathematical Biosciences and Engineering

Improved YOLOv7-based steel surface defect detection algorithm

Related Papers:

Abstract

1. Introduction

2. YOLOv7 network structure

3. The proposed YOLOv7 algorithm network structure

3.1. TI module

3.1.1. Transformer block

3.1.2. InceptionDWConvolution

3.2. SPPFCSPC

3.3. GAM global attention mechanism

3.4. Mish

3.5. MPDIoU

4. Experiment and result analysis

4.1. Datasets

4.2. Settings

4.3. Evaluation metric

4.4. Ablation

4.5. Contrast test

4.6. Result analysis

5. Conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog