Improved MViTv2-T model for insulator defect detection

Fuhong Meng; Guowu Yuan; Hao Zhou; Hao Wu; Yi Ma; Fuhong Meng; Guowu Yuan; Hao Zhou; Hao Wu; Yi Ma

doi:10.3934/electreng.2025001

AIMS Electronics and Electrical Engineering

2025, Volume 9, Issue 1: 1-25. doi: 10.3934/electreng.2025001

Previous Article Next Article

Research article Special Issues

Improved MViTv2-T model for insulator defect detection

1.
School of Information Science and Engineering, Yunnan University, Kunming 650504, Yunnan, China
2.
Electric Power Research Institute, Yunnan Power Grid Co., Ltd, Kunming 650214, Yunnan, China

Academic Editor: Qichun Zhang

Received: 05 September 2024 Revised: 04 November 2024 Accepted: 18 November 2024 Published: 04 December 2024

Insulators play a crucial role in transmission lines. Insulators exposed to natural environments are prone to various malfunctions. These faults will seriously affect the safety and stability of the power grid system operation, so intelligent detection of insulator defects has become increasingly important. This paper presents an insulator defect detection model based on the improved MViTv2-T (Multiscale Vision Transformers Version 2 Tiny). The new model utilizes the sore penalty mechanism (SPM) cluster non-maximum suppression (NMS) algorithm instead of the batched non-maximum suppression (NMS) algorithm from the original model. Additionally, it introduces the stage query recollection method, which integrates high-level and low-level module queries within each stage, along with various experimentation on integration functions between the two. The experimental results indicate that the improved MViTv2-T model attains an mAP (mean average precision)@0.5:0.95 of 76.1 $\%$ , mAP@0.5 of 96.1 $\%$ , and mAR@0.5 of 97.2 $\%$ in insulator defect detection. Compared to the original model, there is a 1.8 $\%$ increase in mAP@0.5:0.95 and a 17 $\%$ decrease in the detection error rate at an Intersection over Union (IoU) threshold of 0.5. Furthermore, when compared to standard two-stage detection models and YOLO series models, the improved MViTv2-T model also exhibits distinct performance advantages.

Keywords:

Citation: Fuhong Meng, Guowu Yuan, Hao Zhou, Hao Wu, Yi Ma. Improved MViTv2-T model for insulator defect detection[J]. AIMS Electronics and Electrical Engineering, 2025, 9(1): 1-25. doi: 10.3934/electreng.2025001

Related Papers:

[1]	Hakkee Jung . Analysis of drain induced barrier lowering for junctionless double gate MOSFET using ferroelectric negative capacitance effect. AIMS Electronics and Electrical Engineering, 2023, 7(1): 38-49. doi: 10.3934/electreng.2023003
[2]	Youness Chawki, Khalid Elasnaoui, Mohamed Ouhda . Classification and detection of Covid-19 based on X-Ray and CT images using deep learning and machine learning techniques: A bibliometric analysis. AIMS Electronics and Electrical Engineering, 2024, 8(1): 71-103. doi: 10.3934/electreng.2024004
[3]	Abdullah Yahya Abdullah Amer, Tamanna Siddiqu . A novel algorithm for sarcasm detection using supervised machine learning approach. AIMS Electronics and Electrical Engineering, 2022, 6(4): 345-369. doi: 10.3934/electreng.2022021
[4]	Martin Seilmayer, Varun Kumar Katepally . Thermal conductivity survey of different manufactured insulation systems of rectangular copper wires. AIMS Electronics and Electrical Engineering, 2018, 2(1): 27-36. doi: 10.3934/ElectrEng.2018.1.27
[5]	Sebin J Olickal, Renu Jose . LSTM projected layer neural network-based signal estimation and channel state estimator for OFDM wireless communication systems. AIMS Electronics and Electrical Engineering, 2023, 7(2): 187-195. doi: 10.3934/electreng.2023011
[6]	Abdul Yussif Seidu, Elvis Twumasi, Emmanuel Assuming Frimpong . Hybrid optimized artificial neural network using Latin hypercube sampling and Bayesian optimization for detection, classification and location of faults in transmission lines. AIMS Electronics and Electrical Engineering, 2024, 8(4): 508-541. doi: 10.3934/electreng.2024024
[7]	D Venkata Ratnam, K Nageswara Rao . Bi-LSTM based deep learning method for 5G signal detection and channel estimation. AIMS Electronics and Electrical Engineering, 2021, 5(4): 334-341. doi: 10.3934/electreng.2021017
[8]	M V Raghunathareddy, G Indumathi, K R Niranjan . Highly sensitive optical MEMS based photonic biosensor for colon tissue detection. AIMS Electronics and Electrical Engineering, 2022, 6(3): 285-295. doi: 10.3934/electreng.2022017
[9]	J. Rajeshwari, M. Sughasiny . Modified PNN classifier for diagnosing skin cancer severity condition using SMO optimization technique. AIMS Electronics and Electrical Engineering, 2023, 7(1): 75-99. doi: 10.3934/electreng.2023005
[10]	Ruyi Dong, Kai Yang, Tong Wang . Research on tracking strategy of manipulator based on fusion reward mechanism. AIMS Electronics and Electrical Engineering, 2025, 9(1): 99-117. doi: 10.3934/electreng.2025006

Abstract

1. Introduction

With the rapid industrial development in China, the demand for electricity has steadily increased, leading to the expansion of various power facilities such as transmission lines. Insulators play critical roles in these lines, providing essential electrical insulation and mechanical support functions. Exposed to the natural environment, insulators are prone to damage like defects and brokenness caused by diverse and harsh weather conditions. These issues can significantly compromise the safety and stability of the power grid system, emphasizing the vital need for regular insulator inspections^[1]. Manual inspection remains the primary maintenance method, with power companies relying heavily on manual checks and maintenance of crucial transmission line components, including insulators. However, the continuous expansion of the power grid, the proliferation of long-distance transmission lines, and the placement of lines in remote mountainous areas have posed numerous challenges for manual inspections, marked by high costs and operational complexities.

In recent years, the proliferation of artificial intelligence technology has led to the widespread application of defect detection methods based on deep learning across various engineering domains, significantly impacting the field ^[2,3]. Numerous researchers have proposed deep learning-based insulator defect detection methods. ^[4] introduced the bidirectional feature pyramid network (BiFPN) module into YOLOv5, and then the BiFPN module and SimAM were combined, achieving higher detection accuracy while maintaining a high detection speed. ^[5] merged YOLOv3 with SRCNN. Experimental findings indicate that this approach boosts detection accuracy by 1 $\%$ to 3 $\%$ compared to Faster R-CNN and SSD, offering improved speed and nearly achieving real-time performance. However, challenges persist in detecting small targets. ^[6] proposed an improved YOLOv8n-based insulator defect detection model, introducing a triplet attention module. They put forward a lighter SC-Detect to replace the original SC-Detect and reconstructed the neck structure using GSConv-based Slim-neck, reducing the model's parameters and computational load to meet the requirements of high accuracy and real-time performance. ^[7] presented a multi-scale insulator defect detection approach, which is introduced by utilizing the detection transformer (DETR). The approach includes a multi-scale backbone, a self-attention upsampling (SAU) module, and the insulator defect (IDIoU) loss function, resulting in exceptional performance in detecting small defects. ^[8] introduced a dense connection architecture incorporating multi-scale features, an adaptive weight transfer module operating at multiple scales, and a multi-branch detection unit. This architecture successfully enables the accurate identification and precise localization of insulator defects, outperforming the comparison algorithm in terms of both accuracy and speed. ^[9] presented an insulator defect detection framework in an unsupervised image reconstruction manner. Collecting and using the catenary insulator defect (CID) dataset, they achieved high accuracy without manual annotations. ^[10] presented an enhanced Faster R-CNN algorithm that employs the ResNeSt network as its backbone. This improved model integrates the regional proposal network (RPN) within the ResNeSt network to boost the extraction of defect features, leading to a heightened detection accuracy of 98.38 $\%$ for insulator defects. ^[11] proposed a detection method based on a microwave technique and an automatic detection system to detect the internal defects of composite insulators, performing efficiently, while being labor-saving and robust. ^[12] proposed a coordinate attention mechanism (CAM) and feature channel shuffle operation (CSO) YOLO (CACS-YOLO). Using synthetic weather algorithms for data enhancement and introducing the CAM and CSO in the YOLOv8m model, they improved the detection precision and reduced the parameters of the model.

Several challenges arise when detecting crucial components in transmission lines like insulators, including substantial scale variations in detection targets involving numerous small and medium-sized objects^[13,14], intricate backgrounds, and occlusion occurrences^[15,16]. To address these challenges while catering to practical engineering needs, it is crucial to strike a balance between detection precision and speed. Consequently, the current research focuses on creating a lightweight, precise, and robust defect detection model^[17,18].

In this paper, we propose a defect detection model for insulator defect detection according to the requirements of intelligent inspection of power grids. Unlike other research that widely applies the YOLO model in the field of insulator defect detection, we innovatively introduce the MViTv2 model based on transformers. Unlike other research that tends to add complex modules to improve performance, we attempt to introduce no additional modules or increase complex computational processes, focusing instead on fully integrating and utilizing the features learned by each layer of the model. In this way, we achieve higher detection accuracy without introducing additional parameters and significantly slowing the detection speed. Our model's metrics meet the engineering application standards and can be applied to the front-end acquisition equipment for power grid inspection and monitoring.

2. Dataset and its statistics

The insulator defect dataset in this paper mainly comes from the State Grid Corporation of China and Yunnan Limited Company of China Southern Power Grid. All insulator images are sourced from real insulators on China's power grid transmission lines, and collected through fixed or drone photography. Due to the difficulty of collection, the sample size is limited. The State Grid Corporation of China provides the normal type and the defect type insulator samples, with 600 and 248 images, respectively; Yunnan Limited Company of China Southern Power Grid provides the broken type and the defect type insulator samples, with 232 and 134 images each. Therefore, there are 600 images for normal insulators, 382 for defect insulators, and 232 for broken insulators, totaling 1214 images.

The insulator is composed of insulator units and metal connectors. Insulators with all intact insulator units are the normal type insulators; insulators with one or more broken insulator units are the broken type insulators, which can still perform their normal functions but need to be replaced; insulators with one or more missing insulator units are the defect type insulators, which have completely lost their normal functions and need to be replaced urgently. Figure 1 shows the three categories of labeled targets.

Figure 1. Three categories of labeled targets.

Algorithm 1 SPM Cluster NMS Algorithm
Input:
boxes (Tensor[N, 4])	$\triangleright$ Bounding boxes
scores (Tensor[N, 1])	$\triangleright$ Scores for each box
NMS_threshold (float)	$\triangleright$ IoU threshold for NMS
Output:
boxes_kept (Tensor[M, 4])	$\triangleright$ Filtered bounding boxes
scores_kept (Tensor[M, 1])	$\triangleright$ Filtered scores
1: scores, idx $\gets$ sort(scores, descending = True)	$\triangleright$ Sort scores and get indices
2: boxes $\gets$ boxes[idx]	$\triangleright$ Rearrange boxes based on sorted indices
3: iou $\gets$ box_iou(boxes, boxes)	$\triangleright$ Compute IoU matrix
4: C $\gets$ triu(iou, diagonal = 1)	$\triangleright$ Upper triangular IoU matrix
5: Initialize $i \gets 0$
6: while $i < 200$ do
7: A $\gets$ C	$\triangleright$ Store current IoU matrix
8: maxA $\gets$ max(A, dim = 0)[0]	$\triangleright$ Find column maximum values
9: E $\gets$ (maxA $<$ NMS_threshold)	$\triangleright$ Create exclusion mask
10: C $\gets$ iou $\cdot$ E	$\triangleright$ Element-wise multiplication with IoU
11: if A.equal(C) then
12: break	$\triangleright$ Exit loop if stable
13: end if
14: $i \gets i + 1$
15: end while
16: scores $\gets$ prod(exp( $-C^2/0.2$ ), 0) $\cdot$ (scores.squeeze(1))	$\triangleright$ Penalty on scores
17: keep $\gets$ scores $>$ 0.01	$\triangleright$ Apply score thresholding
18 return boxes[keep], scores[keep]	$\triangleright$ Return filtered boxes and scores

Method	mAP @0.5:0.95/%	mAP@0.5/%	mAP@0.75/%	mAR@0.5/%	FPS
Original model	74.3	95.3	87.3	95.8	22.2
Soft NMS	75.3	95.4	87.6	96.1	0.5
DIoU NMS	74.5	95.4	87.9	96.1	0.8
Fast NMS	67.9	88.2	79.7	89.3	20.6
Cluster NMS	74.7	95.4	87.1	96.2	17.2
SPM Cluster NMS	75.4	95.4	87.4	96.2	17.1

method	hyperparameter	mAP @0.5:0.95/%	mAP@0.5/%	mAP@0.75/%	mAR@0.5/%	FPS	#Params
Original model	-	74.3	95.3	87.3	95.8	22.2	41.0M
f1	-	75.2	95.7	87.3	95.3	22.6	41.0M
f2	-	74.0	95.3	86.9	96.0	15.7	41.1M
f3	$Ratio=0.5$	75.3	96.0	87.7	95.3	21.4	41.6M
	$Ratio=1$	75.6	96.4	87.0	97.0	21.7	41.3M
	$Ratio=4$	75.0	95.6	86.9	96.4	21.9	41.1M

SPM Cluster NMS	$f6$ ( $Ratio=1$ )	$f4$ ( $W_0=0.6, W_1=0.4$ )	mAP @0.5:0.95/%	mAP @0.5/%	mAP @0.75/%
			74.3	95.3	87.3
✔			75.4	95.4	87.4
	✔		75.6	96.4	87.0
		✔	75.5	96.4	87.7
✔	✔		76.0	96.3	87.1
✔		✔	76.1	96.2	87.9

Models	mAP @0.5:0.95/%	mAP@0.5/%	mAP@0.75/%	mAR@0.5/%	FPS	GFLOPs
RetinaNet	64.3	86.8	74.1	92.4	12.6	93.6
Faster-RCNN	66.1	88.6	76.7	90.4	13.4	38.5
Mask-RCNN	65.2	87.1	77.4	88.7	13.4	9.2
Cascade-RCNN	69.0	87.9	78.6	90.2	8.5	9.2
FCOS	53.2	81.6	60.7	90.6	13.3	80.1
YOLOv5n	64.8	90.8	74.1	85.2	52.9	4.2
YOLOv5m	73.5	94.5	85.3	91.6	31.1	48.2
YOLOv5l	74.4	95.5	86.9	93.3	25.8	108.3
YOLOv7t	66.2	90.8	74.9	86.1	44.4	12.9
YOLOv7	73.8	93.9	86.5	89.6	22.8	104.5
YOLOv8s	72.3	91.8	81.3	88.4	41.3	28.3
YOLOv8m	73.1	93.0	82.3	90.1	37.9	78.1
YOLOv8l	73.2	92.7	81.7	88.1	29.2	163.9
YOLOv9c	67.0	89.4	76.4	84.4	17.4	102.1
Ours-1	76.0	96.3	87.1	96.7	16.3	9.9
Ours-2	76.1	96.2	87.9	97.0	17.4	9.7

[1]	Zhao ZB, Jiang ZG, Li YX, Qi YC, Zhai YJ, Zhao WQ, et al. (2021) Overview of visual defect detection of transmission line components. Journal of Image and Graphics 26: 2545–2560. https://doi.org/10.11834/jig.200689 doi: 10.11834/jig.200689
[2]	Chen C, Yuan GW, Zhou H, Ma Y (2023) Improved YOLOv5s model for key components detection of power transmission lines. Math Biosci Eng 20: 7738–7761. https://doi.org/10.3934/mbe.2023334 doi: 10.3934/mbe.2023334
[3]	Liu HY, Yuan GW (2022) Cigarette appearance defect detection method based on improved YOLOv5s. Comput Technol Dev 32: 161–167. https://doi.org/10.3969/j.issn.1673-629X.2022.08.026 doi: 10.3969/j.issn.1673-629X.2022.08.026
[4]	Zhang Y, Dou Y, Yang K, Song X, Wang J, Zhao L (2024) Insulator defect detection based on BaS-YOLOv5. Multimed Syst 30: 212. https://doi.org/10.1007/s00530-024-01413-w doi: 10.1007/s00530-024-01413-w
[5]	Chen H, He Z, Shi B, Zhong T (2019) Research on recognition method of electrical components based on YOLO V3. IEEE Access 7: 157818–157829. https://doi.org/10.1109/ACCESS.2019.2950053 doi: 10.1109/ACCESS.2019.2950053
[6]	Su J, Yuan Y, Przystupa K, Kochan O (2024) Insulator defect detection algorithm based on improved YOLOv8 for electric power. Signal Image Video Process 18: 6197–6209. https://doi.org/10.1007/s11760-024-03307-w doi: 10.1007/s11760-024-03307-w
[7]	Li D, Yang P, Zou Y (2024) Optimizing Insulator Defect Detection with Improved DETR Models. Mathematics 12: 1507. https://doi.org/10.3390/math12101507 doi: 10.3390/math12101507
[8]	Yuan H, Wang J (2023) Power Insulator Defect Detection Based on Multi-scale Dense Adaptive Sensing. Mathematics 2661: 012001. https://doi.org/10.1088/1742-6596/2661/1/012001 doi: 10.1088/1742-6596/2661/1/012001
[9]	Zhang T, Zhong S, Xu W, Yan L, Zou X (2024) Catenary Insulator Defects Detection: A Dataset and an Unsupervised Baseline. IEEE T Instrum Meas 73: 1–15. https://doi.org/10.1109/TIM.2024.3390695 doi: 10.1109/TIM.2024.3390695
[10]	Wang S, Liu Y, Qing Y, Wang C, Lan T, Yao R (2020) Detection of insulator defects with improved ResNeSt and region proposal network. IEEE Access 8: 184841–184850. https://doi.org/10.1109/ACCESS.2020.3029857 doi: 10.1109/ACCESS.2020.3029857
[11]	Mei H, Jiang H, Chen J, Yin F, Wang L, Farzaneh M (2021) Detection of internal defects of full-size composite insulators based on microwave technique. IEEE T Instrum Meas 70: 1–10. https://doi.org/10.1109/TIM.2021.3085111 doi: 10.1109/TIM.2021.3085111
[12]	Cao Z, Chen K, Chen J, Chen Z, Zhang M (2024) CACS-YOLO: A Lightweight Model for Insulator Defect Detection based on Improved YOLOv8m. IEEE T Instrum Meas 73: 1–10. https://doi.org/10.1109/TIM.2024.3453332 doi: 10.1109/TIM.2024.3453332
[13]	Han G, Yuan Q, Zhao F, Wang R, Zhao L, Li S, et al. (2023) An improved algorithm for insulator and defect detection based on yolov4. Electronics 12: 933. https://doi.org/10.3390/ELECTRONICS12040933 doi: 10.3390/ELECTRONICS12040933
[14]	Zhao ZB, Li Y, Qi YC, Kong YH, Nie LQ (2020) Insulator defect detection method based on dynamic focus loss function and sample balance method. Electric Power Automation Equipment 40: 205–211. https://doi.org/10.16081/j.epae.202010008 doi: 10.16081/j.epae.202010008
[15]	Guo L, Liao Y, Yao H, Chen J, Wang M (2018) An electrical insulator defects detection method combined human receptive field model. J Control Sci Eng 2018: 2371825. https://doi.org/10.1155/2018/2371825 doi: 10.1155/2018/2371825
[16]	Li T, Hao T (2022) Damage detection of insulators in catenary based on deep learning and Zernike moment algorithms. Appl Sci 12: 5004. https://doi.org/10.3390/app12105004 doi: 10.3390/app12105004
[17]	Qi Y, Li Y, Du A (2023) Research on an insulator defect detection method based on improved yolov5. Appl Sci 13: 5741. https://doi.org/10.3390/app13095741 doi: 10.3390/app13095741
[18]	Zhang H, Huang G, Yang L (2023) Insulator defect detection algorithm based on multi-scale feature fusion optimization. International Conference on Algorithms, High Performance Computing, and Artificial Intelligence (AHPCAI 2023) 12941: 226–232. https://doi.org/10.1117/12.3011642 doi: 10.1117/12.3011642
[19]	Fan H, Xiong B, Mangalam K, Li Y, Yan Z, Malik J, et al. (2021) Multiscale vision transformers. Proceedings of the IEEE/CVF international conference on computer vision 2021: 6824–6835. https://doi.org/10.48550/arXiv.2104.11227 doi: 10.48550/arXiv.2104.11227
[20]	Li Y, Wu CY, Fan H, Mangalam K, Xiong B, Malik J, et al. (2022) Mvitv2: Improved multiscale vision transformers for classification and detection. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition 2022: 4804–4814. https://doi.org/10.48550/arXiv.2112.01526 doi: 10.48550/arXiv.2112.01526
[21]	Zheng Z, Wang P, Ren D, Liu W, Ye R, Hu Q, et al. (2021) Enhancing geometric factors in model learning and inference for object detection and instance segmentation. IEEE Trans Cybern 52: 8574–8586. https://doi.org/10.1109/TCYB.2021.3095305 doi: 10.1109/TCYB.2021.3095305
[22]	Lou M, Zhou HY, Yang S, Yu Y (2023) TransXNet: learning both global and local dynamics with a dual dynamic token mixer for visual recognition. arXiv preprint arXiv: 2310.19380. https://doi.org/10.48550/arXiv.2310.19380
[23]	Bodla N, Singh B, Chellappa R, Davis LS (2017) Soft-NMS–improving object detection with one line of code. Proceedings of the IEEE international conference on computer vision 2017: 5561–5569. https://doi.org/10.48550/arXiv.1704.04503 doi: 10.48550/arXiv.1704.04503
[24]	Zheng Z, Wang P, Liu W, Li J, Ye R, Ren D (2020) Distance-IoU loss: Faster and better learning for bounding box regression. Proceedings of the AAAI conference on artificial intelligence 34: 12993–13000. https://doi.org/10.1609/aaai.v34i07.6999 doi: 10.1609/aaai.v34i07.6999
[25]	Bolya D, Zhou C, Xiao F, Lee YJ (2019) Yolact: Real-time instance segmentation. Proceedings of the IEEE/CVF international conference on computer vision 2019: 9157–9166. https://doi.org/10.48550/arXiv.1904.02689 doi: 10.48550/arXiv.1904.02689

AIMS Electronics and Electrical Engineering

Improved MViTv2-T model for insulator defect detection

Related Papers:

Abstract

1. Introduction

2. Dataset and its statistics

3. Methods

3.1. MViTv2-T model

3.2. SPM cluster NMS

3.3. Stage query recollection

4. Experiments

4.1. Data augmentation

4.2. Experiment configuration and evaluation index

4.3. Experimental results and analysis

4.3.1. Experimental results of the SPM Cluster NMS

4.3.2. Experimental results of the stage query recollection

4.3.3. Ablation experiment

4.3.4. Overall detection results

4.3.5. Comparison for model training

4.3.6. Loss iteration during training

4.3.7. Experimental comparisons with other models

5. Conclusions

Author contributions

Acknowledgments

Conflict of interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog