Effect of dual-convolutional neural network model fusion for Aluminum profile surface defects classification and recognition

Xiaochen Liu; Weidong He; Yinghui Zhang; Shixuan Yao; Ze Cui; Xiaochen Liu; Weidong He; Yinghui Zhang; Shixuan Yao; Ze Cui

doi:10.3934/mbe.2022046

Mathematical Biosciences and Engineering

2022, Volume 19, Issue 1: 997-1025. doi: 10.3934/mbe.2022046

Previous Article Next Article

Research article

Effect of dual-convolutional neural network model fusion for Aluminum profile surface defects classification and recognition

1.
School of Mechanical Engineering, Dalian Jiaotong University, Dalian 116028, China
2.
School of Software Engineering, Dalian University of Foreign Languages, Dalian 116044, China
3.
School of Control Science and Engineering, Dalian University of Technology, Dalian 116024, China

Received: 15 August 2021 Accepted: 01 November 2021 Published: 25 November 2021

Classifying and identifying surface defects is essential during the production and use of aluminum profiles. Recently, the dual-convolutional neural network(CNN) model fusion framework has shown promising performance for defects classification and recognition. Spurred by this trend, this paper proposes an improved dual-CNN model fusion framework to classify and identify defects in aluminum profiles. Compared with traditional dual-CNN model fusion frameworks, the proposed architecture involves an improved fusion layer, fusion strategy, and classifier block. Specifically, the suggested method extracts the feature map of the aluminum profile RGB image from the pre-trained VGG16 model's pool5 layer and the feature map of the maximum pooling layer of the suggested A4 network, which is added after the Alexnet model. then, weighted bilinear interpolation unsamples the feature maps extracted from the maximum pooling layer of the A4 part. The network layer and upsampling schemes ensure equal feature map dimensions ensuring feature map merging utilizing an improved wavelet transform. Finally, global average pooling is employed in the classifier block instead of dense layers to reduce the model's parameters and avoid overfitting. The fused feature map is then input into the classifier block for classification. The experimental setup involves data augmentation and transfer learning to prevent overfitting due to the small-sized data sets exploited, while the K cross-validation method is employed to evaluate the model's performance during the training process. The experimental results demonstrate that the proposed dual-CNN model fusion framework attains a classification accuracy higher than current techniques, and specifically 4.3% higher than Alexnet, 2.5% for VGG16, 2.9% for Inception v3, 2.2% for VGG19, 3.6% for Resnet50, 3% for Resnet101, and 0.7% and 1.2% than the conventional dual-CNN fusion framework 1 and 2, respectively, proving the effectiveness of the proposed strategy.

Keywords:

Citation: Xiaochen Liu, Weidong He, Yinghui Zhang, Shixuan Yao, Ze Cui. Effect of dual-convolutional neural network model fusion for Aluminum profile surface defects classification and recognition[J]. Mathematical Biosciences and Engineering, 2022, 19(1): 997-1025. doi: 10.3934/mbe.2022046

Related Papers:

[1]	Yong Tian, Tian Zhang, Qingchao Zhang, Yong Li, Zhaodong Wang . Feature fusion–based preprocessing for steel plate surface defect recognition. Mathematical Biosciences and Engineering, 2020, 17(5): 5672-5685. doi: 10.3934/mbe.2020305
[2]	Zhigao Zeng, Cheng Huang, Wenqiu Zhu, Zhiqiang Wen, Xinpan Yuan . Flower image classification based on an improved lightweight neural network with multi-scale feature fusion and attention mechanism. Mathematical Biosciences and Engineering, 2023, 20(8): 13900-13920. doi: 10.3934/mbe.2023619
[3]	Guozhen Dong . A pixel-wise framework based on convolutional neural network for surface defect detection. Mathematical Biosciences and Engineering, 2022, 19(9): 8786-8803. doi: 10.3934/mbe.2022408
[4]	Keying Du, Liuyang Fang, Jie Chen, Dongdong Chen, Hua Lai . CTFusion: CNN-transformer-based self-supervised learning for infrared and visible image fusion. Mathematical Biosciences and Engineering, 2024, 21(7): 6710-6730. doi: 10.3934/mbe.2024294
[5]	Jiaming Ding, Peigang Jiao, Kangning Li, Weibo Du . Road surface crack detection based on improved YOLOv5s. Mathematical Biosciences and Engineering, 2024, 21(3): 4269-4285. doi: 10.3934/mbe.2024188
[6]	Hongxia Ni, Minzhen Wang, Liying Zhao . An improved Faster R-CNN for defect recognition of key components of transmission line. Mathematical Biosciences and Engineering, 2021, 18(4): 4679-4695. doi: 10.3934/mbe.2021237
[7]	Zhangjie Wu, Minming Gu . A novel attention-guided ECA-CNN architecture for sEMG-based gait classification. Mathematical Biosciences and Engineering, 2023, 20(4): 7140-7153. doi: 10.3934/mbe.2023308
[8]	Xiao Zou, Jintao Zhai, Shengyou Qian, Ang Li, Feng Tian, Xiaofei Cao, Runmin Wang . Improved breast ultrasound tumor classification using dual-input CNN with GAP-guided attention loss. Mathematical Biosciences and Engineering, 2023, 20(8): 15244-15264. doi: 10.3934/mbe.2023682
[9]	Yongmei Ren, Xiaohu Wang, Jie Yang . Maritime ship recognition based on convolutional neural network and linear weighted decision fusion for multimodal images. Mathematical Biosciences and Engineering, 2023, 20(10): 18545-18565. doi: 10.3934/mbe.2023823
[10]	Haifeng Song, Weiwei Yang, Songsong Dai, Lei Du, Yongchen Sun . Using dual-channel CNN to classify hyperspectral image based on spatial-spectral information. Mathematical Biosciences and Engineering, 2020, 17(4): 3450-3477. doi: 10.3934/mbe.2020195

Abstract

1. Introduction

The aluminum profile is a relatively common material in infrastructure construction and industrial manufacturing that is lightweight, has high strength, corrosion resistance, formability, and is recyclable ^[1]. It is extensively used in rail transit, construction facilities, automobile manufacturing, equipment manufacturing, medical equipment, and other industries ^[2]. In the production or use process of aluminum profiles, due to external factors, it may present the defects of inconsistency in various sizes and shapes, seriously affecting the safety and reliability of aluminum profiles. Therefore, detecting and ensuring the surface quality of aluminum profiles is significant to improve the product's service life ^[3].

In the past, object defects were commonly detected manually. It was a simple, highly repetitive, cost-wasting and labor work, where accuracy and stability could not be guaranteed. With the advancement of optical instruments, numerous scholars have used machine vision to realize defect recognition and improve the detection stability and recognition rate ^[4]. For example, Gao et al. ^[5] exploit thermal imaging technology to propose a low-rank tensor sparse mixture Gaussian (MoG) decomposition algorithm for natural crack detection. Their method reduces noise interference and extracts crack information to realize metal defect detection. Luo et al. ^[6] suggest a hybrid spatial and temporal deep learning architecture for automatic thermography defects detection that extracts internal defect information of composite materials with complex shapes and patterns. Accordingly, Hu et al. ^[7] developed a hybrid multi-dimensional feature fusion structure involving spatial and temporal segmentation appropriate for automated thermography defect detection of composite materials. Ahmed et al. ^[8] use the optical pulse thermal imaging diagnosis system and propose a joint sparse low-rank matrix decomposition algorithm to separate weak defect information from intense noise in composite materials and improve defect resolution. Sun et al. ^[9] investigate weld defect detection and classification based on machine vision. They categorize the weld defects and suggest a modified background subtraction method based on Gaussian mixture models to extract the feature areas of the weld defects, which are then employed to design classification algorithms. Zhang et al. ^[10] design an image acquisition system to simultaneously collect weld images and propose a new CNN classification model with 11 layers to identify weld penetration defects based on weld imagery. Bao et al. ^[11] propose a Triplet-Graph Reasoning Network (TGRNet), which combines surface defect triples (including a triple encoder and triple loss) to segment the background and defect areas, and separates them into metal and non-metal classes (leather and tile). For this method, the data is centralized to verify the network's effectiveness. Shervan et al. ^[12] focus on the surface defect detection problem, considering a new noise-resistant and multi-resolution version of LBP to extract surface features. Additionally, the authors propose a surface defect detection algorithm that is invariant to the texture descriptor. The effectiveness of this technique is verified in architectonic stone and Fabric Textile. Jong et al. ^[13] suggest a new convolutional variational autoencoder (CVAE) to generate sufficient defect data. Defect classification algorithm based on deep CNN for metal surface defect detection has also been proposed. Ihor et al. ^[14] design an automated method for detecting and classifying three types of surface defects in rolled metal, and use Resnet50 for feature extraction and defect classification. Guan et al. ^[15] utilize VGG19 to extract steel surface defects and suggest different feature layers originating from the defect weight model. Then the authors employ SSIM and a decision tree to evaluate the image quality and adjust the network's structure and classify steel surface defects.

The latter method uses image processing and deep learning methods to extract the defect features of various objects effectively, and to a certain extent, provides insights for the method developed in this paper. Since this article mainly considers identifying and classifying defects on the aluminum profiles surface, the following works introduce the related literature. Defect recognition exploiting conventional machine vision mainly includes image capturing, feature extraction and definition, image preprocessing, and defect-recognition ^[16]. In this regard, the defect recognition accuracy is seriously affected by the accuracy of the feature extraction process and the method defining the features. Liu et al. ^[17] employ the gray-level co-occurrence matrix algorithm and the Gabor wavelet transform method to extract the surface texture features of aluminum profiles. They classify the features based on the radial basis function kernel SVM (Support Vector Machines) classification algorithm. Chondronasios et al. ^[18] propose a new technology based on the gradient-only co-occurrence matrix (GOCM) and the Sobel operator to extract and define the surface features of aluminum profiles.The authors use two-layer ANNs to classify the surface defects of aluminum profiles. Although traditional machine vision-based methods utilize image processing for surface defect feature extraction and defects classification, the extraction and defects definition requires manual processing and empirical judgment by engineers ^[19], which lacks robustness and is not conducive to operation.

Recently, deep learning has been extensively used in various application, including feature extraction and classification of aluminum profiles surface defects, due to its ability to learn image features automatically. In the context of aluminum profiles surface defects, Li et al. ^[20] rely on the adaptive threshold method to binarize the surface image of the aluminum plate, extract image features, and implement surface defect classification through a three-layer BP neural network. Wei et al. ^[21] utilize Resnet101 as the primary network and propose a multi-scale defect detection network based on deep learning to identify and classify surface defects of aluminum profiles. Neuhauser et al. ^[22] propose a VGG16 based architecture suitable for actual industrialization exploiting transfer learning. and data augmentation to increase the data set, avoiding model overfitting. Zhang et al. ^[23] design an attention mechanism to detect surface defects of aluminum profiles. This method initially exploits the category representation network to extract the common category feature map (CCM). Then, the attention module generates the proposed feature map (PM), and a rare category feature map (RCM) is formed through CCM and PM. After that, the score of the defect category is obtained through CCM and RCM spatial pooling for defects identification. Chen et al. ^[24] propose an aluminum profiles surface defect detection method relying on a deep self-attention mechanism (DSAM) under hybrid noise conditions. This technique employs the residual learning strategy to obtain the defect feature map from the image, adds the corresponding weight matrix to the defect feature map to achieve fine feature extraction, and finally adds a softmax classification layer for defect recognition. Liu et al. ^[25] develop a semi-supervised anomaly detection method, entitled Dual Prototype Auto-Encoder (DPAE). During the training phase, a dual prototype loss and reconstruction loss are introduced to encourage the latent vector generated by the encoder to be closer to its own prototype. Finally, the distance between the image's latent vectors is used to detect and identify the surface defects of the aluminum profile.

The above works exploit deep learning to identify and classify the surface defects of aluminum profiles and achieve good experimental results. Additionally, compared with traditional machine vision, deep learning-based feature extraction is more robust. However, there are still some issues that need to be resolved. For example, current deep learning methods utilize an input source, a neural network model, and the characteristic information of a single information source extracted through the neural network, which cannot fully reflect the characteristics of the object examined ^[26].

To solve these problems, the defect classification accuracy can be enhanced through a dual-convolutional neural network(CNN) model fusion framework that extracts the input source features separately, which are then fused. A dual-CNN model fusion framework may have two forms, either employing two different input sources or the same input source. In the former case, the same neural network model extracts features of different input sources and then merges them for classification ^{[26,27,28,29,30,31]}. This case involves neural network models with specific structure differences, e.g., CNN convolution kernel size and number, and several operation differences in the model learning process. In the same input source case, the classification performance varies depending on the extracted features ^[32]. For this fusion scheme, two different convolutional neural networks separately extract features and then merge them, aiming during the design, the extracted features to complement each other ^[32,33].

Duan et al. ^[26]propose a dual-CNN model fusion framework based on gradient images to identify and classify the surface defects of aluminum profiles. The original and gradient images are used as two different input sources, while both neural network models use Alexnet and realize feature fusion through wavelet transform fusion. Then the fused features are input into the SVM classifier block for defect classification, Akilan et al. ^[33] use the VGG16 and Alexnet networks to extract features from two identical input sources, and employ PCA (Principal Component Analysis) and energy normalization to form a feature space. This work also utilizes algorithm rules (Sum, Average, Max, Min) to fuze the features, with several rules being evaluated to select the optimal fusion strategy. The fuzed features are then input into an SVM classifier block for classification. Experimental results employing this method demonstrate that the Sum strategy is effective in most data sets. The first fusion framework mentioned above combines the output features of the first dense layer of the two Alexnet models, while the second fusion framework combines the output features of the first dense layer of VGG16 and the second dense layer of Alexnet. Both model fusion frameworks have a common attribute: the fused features are first input to the first dense layer of the classifier block, and then classification is achieved through multiple network layers. (the fusion framework will be introduced in the next part of this article). It should be noted that given the lack of research on recognizing and classifying aluminum profile surface defects utilizing a dual-CNN fusion framework, this article mainly refers to methods applied in other fields aiming to suggest the necessary improvements to facilitate a solution appropriate for aluminum profiles.

This article proposes an improved dual-CNN model fusion framework that uses the same input source and different convolutional neural networks (VGG16 and Alexnet). We add multiple network layers before the feature fusion process and after the Alexnet network (including convolution, pooling, and activation). The RGB image feature map is extracted from the last maximum pooling layer in the pre-trained VGG16 and the last maximum pooling layer in the network layer added after the Alexnet network. Then, we use weighted bilinear interpolation to upsample the the maximum pooling layer feature maps of the network layer added after Alexnet to ensure that the feature maps output by the two models have the same dimensions. Feature map fusion relies on the improved wavelet transform fusion method. Finally, our method develops a classifier block (see Section 2) utilizing a global average pooling layer instead of a dense layer.

Compared with traditional dual-CNN model fusion frameworks ^[26,33], we extract the feature maps of the largest pooling layer at the end of the proposed CNN model, fuse these feature maps, and use global average pooling for classification rather than dense layers. This strategy preserves more local feature information extracted from the image and reduces the model's dimensionality, making the network easier to train, avoiding too many weight parameters when the feature map enters the dense layer, which leads to overfitting during the model training process ^[34,35,36]. Regarding the feature fusion strategy, the improved dual-CNN model fusion framework uses an improved wavelet transform that combines the Canny operator and the area energy method (see Section 2).

The remainder of this article is organized as follows. Section 2 introduces the dual-CNN model fusion framework, network layer function, up-sampling, feature fusion methods, and model training methods proposed in this paper. Section 3 describes the experimental setup and the evaluation metrics, while Section 4 presents the experimental results and analysis. Finally, Section 5 concludes this work.

To improve readability, some of the abbreviations presented throughout the text are defined as follows. Support Vector Machines (SVM) is a class of generalized linear classifiers that classifies binary data in a supervised learning manner. Its decision boundary is the maximum hyperplane margin solved for the learning sample. Principal Component Analysis (PCA) is a standard data analysis method, often used for dimensionality reduction of high-dimensional data that can be utilized to extract data's main feature components. Local Response Normalization (LRN) is a local normalization method that primarily prevents the neural network model from overfitting during the training process.

2. Methods

This section mainly introduces the related methods utilized in the experiments of Section 3, including the dual-CNN model fusion framework, the definition of the relevant network layers, the feature map upsampling method, the feature fusion strategy, transfer learning, data augmentation, and model performance evaluation methods.

2.1. Dual-convolution neural network model fusion frameworks

A traditional convolutional neural network framework includes a neural network model and a single classifier to extract feature information from the input source. This framework is called a single convolutional neural network. In contrast, the multi-convolutional neural network model fusion framework involves multiple convolutional neural network models that extract several features from given training data and inputs the fused features into a single classifier for classification ^[28]. The dual-CNN model fusion framework includes two network models. The input source features are extracted from the two models, are fused ^[29,30,32], and are then input into a single classifier for classification. Figures 1 and 2 illustrate the two different dual-CNN model fusion strategies ^[26,33].

Figure 1. Fusion framework 1.

Category	Number of original images collected		Number of images enhanced by data
Category	train	test	train	test
Intact	828	93	2484	276
Bruise	522	59	1566	174
Dirty spots(DS)	980	108	2944	328
Scratch	882	98	2646	294

Fold	Training		Validation
Fold	Accuracy	Loss	Accuracy	Loss
1	0.973	0.121	0.965	0.129
2	0.980	0.109	0.968	0.128
3	0.975	0.118	0.966	0.133
4	0.983	0.097	0.971	0.116
5	0.976	0.104	0.981	0.112
Average	0.977±0.006	0.109±0.012	0.970±0.011	0.124±0.012

Combinations	Accuracy		Loss
Combinations	Training	Validation	Training	Validation
BI+SF	0.911	0.910	0.382	0.326
BI+MF	0.917	0.927	0.278	0.266
BI+WTF	0.943	0.938	0.147	0.161
BI+IWTF	0.961	0.952	0.121	0.119
WBI+SF	0.939	0.935	0.174	0.181
WBI+MF	0.920	0.905	0.325	0.297
WBI+WTF	0.956	0.951	0.129	0.132
WBI+ IWTF	0.983	0.971	0.097	0.116
BCI+SF	0.935	0.928	0.212	0.262
BCI+MF	0.916	0.909	0.316	0.299
BCI+WTF	0.953	0.948	0.136	0.144
BCI+IWTF	0.968	0.957	0.117	0.123

method	Acc(%)	AP(%)	AT(%)	Af(%)
Fusion frameworks1	94.4	93.8	94.3	94.1
Fusion frameworks2	93.9	93.5	93.3	93.6
Fusion frameworks(proposed)	95.1	94.9	95.0	94.9
(Acc:Accuracy AP: Average PPV AT: Average TPR Af:Average F-score)

[1]	Z. W. Liu, L. X. Li, J. Yi, S. K. Li, Z. H. Wang, G. Wang, Influence of heat treatment conditions on bending characteristics of 6063 aluminum alloy sheets, T. Nonferr. Metal. Soc., 27 (2017), 1498–1506. doi: 10.1016/s1003-6326(17)60170-5. doi: 10.1016/s1003-6326(17)60170-5
[2]	S. Bingol, A. Bozaci, Experimental and Numerical Study on the Strength of Aluminum Extrusion Welding, Materials (Basel), 8 (2015), 4389-4399. doi: 10.3390/ma8074389. doi: 10.3390/ma8074389
[3]	L. Donati, L. Tomesani, The effect of die design on the production and seam weld quality of extruded aluminum profiles, J. Mater. Process. Technol., 164-165 (2005), 1025–1031. doi: 10.1016/j.jmatprotec.2005.02.156. doi: 10.1016/j.jmatprotec.2005.02.156
[4]	C. T. Mgonja, A review on effects of hazards in foundries to workers and environment, IJISET: Int. J. Innov. Sci. Eng. Technol., 4 (2017), 326–334.
[5]	J. Ahmed, B. Gao, W. l. Woo, Sparse low-rank tensor decomposition for metal defect detection using thermographic imaging diagnostics, IEEE T. Ind. Inform., 17 (2020), 1810–1820. doi: 10.1109/TⅡ.2020.2994227. doi: 10.1109/TⅡ.2020.2994227
[6]	Q. Luo, B. Gao, W. l. Woo, Y. Yang, Temporal and spatial deep learning network for infrared thermal defect detection, NDT & E. Int., 108 (2019), 102164. doi: 10.1016/j.ndteint.2019.102164. doi: 10.1016/j.ndteint.2019.102164
[7]	B. Z. Hu, B. Gao, W. l. Woo, L. F. Ruan, J. K. Jin, A Lightweight Spatial and Temporal Multi-Feature Fusion Network for Defect Detection, IEEE T. Image Process., 30 (2020), 472–486. doi: 10.1109/TIP.2020.3036770. doi: 10.1109/TIP.2020.3036770
[8]	J. Ahmed, B. Gao, W. l. Woo, Y. Zhu, Ensemble Joint Sparse Low-Rank Matrix Decomposition for Thermography Diagnosis System, IEEE T. Ind. Electronics, 68 (2020), 2648–2658. doi: 10.1109/TIE.2020.2975484. doi: 10.1109/TIE.2020.2975484
[9]	J. Sun, C. Li, X. J. Wu, V. Palade, W. Fang, An effective method of weld defect detection and classification based on machine vision, IEEE T. Ind. Inform., 15 (2019), 6322–6333. doi: 10.1109/TⅡ.2019.2896357. doi: 10.1109/TⅡ.2019.2896357
[10]	Z. F. Zhang, G. R. Wen, S. B. Chen, Weld image deep learning-based on-line defects detection using convolutional neural networks for Al alloy in robotic arc welding, J. Manuf. Process., 45 (2019), 208–216. Doi: 10.1016/j.jmapro.2019.06.023. doi: 10.1016/j.jmapro.2019.06.023
[11]	Y. Q. Bao, K. C. Song, J. Liu, Y. Y. Wang, Y. H. Yan, H. Yu, et al., Triplet-Graph Reasoning Network for Few-shot Metal Generic Surface Defect Segmentation, IEEE Trans. Instrum. Meas., 70 (2021). doi: 10.1109/TIM.2021.3083561. doi: 10.1109/TIM.2021.3083561
[12]	S. Fekri-Ershad, F. Tajeripour, Multi-resolution and noise-resistant surface defect detection approach using new version of local binary patterns, Appl. Artif. Intell., 31 (2017), 395–410. doi: 10.1080/08839514.2017.1378012. doi: 10.1080/08839514.2017.1378012
[13]	P. Y. Jong, C. S. Woosang, K. Gyogwon, S. K. Min, L. Chungki, J. L. Sang, Automated defect inspection system for metal surfaces based on deep learning and data augmentation, J. Manuf. Syst., 55 (2020), 317–324. doi: 10.1016/j.jmsy.2020.03.009. doi: 10.1016/j.jmsy.2020.03.009
[14]	K. Ihor, M. Pavlo, B. Janette, B. Jakub, Steel surface defect classification using deep residual neural network, Metals, 10 (2020), 846. doi: 10.3390/met10060846. doi: 10.3390/met10060846
[15]	S. H. Guan, M. Lei, H. Lu, A steel surface defect recognition algorithm based on improved deep learning network model using feature visualization and quality evaluation, IEEE Access, 8 (2020), 49885–49895. doi: 10.1109/ACCESS.2020.2979755. doi: 10.1109/ACCESS.2020.2979755
[16]	B. Zhang, M. M. Liu, Y. Z. Tian, G. Wu, X. H. Yang, S. Y. Shi, et al., Defect inspection system of nuclear fuel pellet end faces based on machine vision, J. Nucl. Sci. Technol., 57 (2020), 617–623. doi: 10.1080/00223131.2019.1708827. doi: 10.1080/00223131.2019.1708827
[17]	Z. H. Liu, H. B. Shi, X. F. Zhou, Aluminum Profile Type Recognition Based on Texture Features, Appl. Mech. Mater., 556–562 (2014), 2846–2851. doi: 10.4028/www.scientific.net/AMM.556-562.2846. doi: 10.4028/www.scientific.net/AMM.556-562.2846
[18]	A. Chondronasios, I. Popov, I, Jordanov., Feature selection for surface defect classification of extruded aluminum profiles, Int. J. Adv. Manuf. Technol., 83 (2015), 33–41. doi: 10.1007/s00170-015-7514-3. doi: 10.1007/s00170-015-7514-3
[19]	A. Krizhevsky, I. Sutskever, G. E. Hinton, ImageNet classification with deep convolutional neural networks, Commun. ACM, 60 (2017), 84–90.
[20]	Q. H. Li, D. Liu, Aluminum Plate Surface Defects Classification Based on the BP Neural Network, Appl. Mech. Mater., 734 (2015), 543–547. doi: 10.4028/www.scientific.net/AMM.734.543. doi: 10.4028/www.scientific.net/AMM.734.543
[21]	R. F. Wei, Y. B. Bi, Research on Recognition Technology of Aluminum Profile Surface Defects Based on Deep Learning, Materials (Basel), 12 (2019), 1681. doi: 10.3390/ma12101681. doi: 10.3390/ma12101681
[22]	F. M. Neuhauser, G. Bachmann, P. Hora, Surface defect classification and detection on extruded aluminum profiles using convolutional neural networks, Int. J. Mater. Form., 13 (2019), 591–603. doi: 10.1007/s12289-019-01496-1. doi: 10.1007/s12289-019-01496-1
[23]	D. F. Zhang, K. C. Song, J. Xu, Y. He, Y. H. Yan, Unified detection method of aluminium profile surface defects: Common and rare defect categories, Opt. Lasers Eng., 126 (2020), 105936. doi: 10.1016/j.optlaseng.2019.105936. doi: 10.1016/j.optlaseng.2019.105936
[24]	R. X. Chen, D. Y. Cai, X. L. Hu, Z. Zhan, S. Wang, Defect Detection Method of Aluminum Profile Surface Using Deep Self-Attention Mechanism under Hybrid Noise Conditions, IEEE Trans. Instrum. Meas., (2021). doi: 10.1109/TIM.2021.3109723. doi: 10.1109/TIM.2021.3109723
[25]	J. Liu, K. C. Song, M. Z. Feng, Y. H. Yan, Z. B. Tu, L. Liu, Semi-supervised anomaly detection with dual prototypes autoencoder for industrial surface inspection, Opt. Lasers Eng., 136 (2021), 106324. doi: 10.1016/j.optlaseng.2020.106324. doi: 10.1016/j.optlaseng.2020.106324
[26]	C. M. Duan, T. C. Zhang, Two-Stream Convolutional Neural Network Based on Gradient Image for Aluminum Profile Surface Defects Classification and Recognition, IEEE Access, 8 (2020), 172152-172165. doi: 10.1109/ACCESS.2020.3025165. doi: 10.1109/ACCESS.2020.3025165
[27]	Y. L. Yu, F. X. Liu, A Two-Stream Deep Fusion Framework for High-Resolution Aerial Scene Classification, Comput. Intell. Neurosci., 2018 (2018), 8639367. doi: 10.1155/2018/8639367. doi: 10.1155/2018/8639367
[28]	C. Khraief, F. Benzarti, H. Amiri, Elderly fall detection based on multi-stream deep convolutional networks, Multimed. Tools Appl., 79 (2020), 19537–19560. doi: 10.1007/s11042-020-08812-x. doi: 10.1007/s11042-020-08812-x
[29]	W. Ye, J. Cheng, F. Yang, Y. Xu, Two-Stream Convolutional Network for Improving Activity Recognition Using Convolutional Long Short-Term Memory Networks, IEEE Access, 7 (2019), 67772–67780. doi: 10.1109/ACCESS.2019.2918808. doi: 10.1109/ACCESS.2019.2918808
[30]	Q. S. Yan, D. Gong, Y. N. Zhang, Two-Stream Convolutional Networks for Blind Image Quality Assessment, IEEE Trans. Image Process., 28 (2019), 2200–2211. doi: 10.1109/TIP.2018.2883741. doi: 10.1109/TIP.2018.2883741
[31]	T. Zhang, H. Zhang, R. Wang, Y. D. Wu, A new JPEG image steganalysis technique combining rich model features and convolutional neural networks, Math. Biosci. Eng., 16 (2019), 4069–4081. doi: 10.3934/mbe.2019201. doi: 10.3934/mbe.2019201
[32]	M. Uno, X. H. Han, Y. W. Chen, Comprehensive Study of Multiple CNNs Fusion for Fine-Grained Dog Breed Categorization, 2018 IEEE Int. Sym. Multim. (ISM), (2018), 198–203. doi: 10.1109/ISM.2018.000-7. doi: 10.1109/ISM.2018.000-7
[33]	T. Akilan, Q. J. Wu, H. Zhang, Effect of fusing features from multiple DCNN architectures in image classification, IET Image Process., 12 (2018), 1102–1110.
[34]	D. J. Li, H. T. Guo, B. M. Zhang, C. Zhao, D. H. Yu, Double vision full convolution network for object extraction in remote sensing imagery, J. Image Graph., 25 (2020), 0535–0545.
[35]	M. Lin, Q. Chen, S. Yan, Network In Network, arXiv preprint arXiv: 1312. 4400(2013).
[36]	K. M. He, X. Zhang, S. Q. Ren, J. Sun, Deep residual learning for image recognition, Proc. IEEE confer. Computer vis. Pattern recognit., (2016), 770–778.
[37]	C. Szegedy, W. Liu, Y. Q. Jia, P. Sermanet, S. Reed, D. Anguelov, et al., Going deeper with convolutions, Proc. IEEE confer. Computer vis. Pattern recognit., (2015), 1–9.
[38]	K. Simonyan, A. Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition, arXiv preprint arXiv: 1409. 1556 (2014).
[39]	Y. Lecun, Y. Bengio, Convolutional Networks for Images, Speech, and Time-Series, The Handbook of Brain Theory & Neural Networks, 3361 (10), 1995.
[40]	V. Suarez-Paniagua, I. Segura-Bedmar, Evaluation of pooling operations in convolutional architectures for drug-drug interaction extraction, BMC Bioinformatics, 19 (2018), 209. doi: 10.1186/s12859-018-2195-1. doi: 10.1186/s12859-018-2195-1
[41]	X. L. Zhang, J. F. Xu, J. Yang, L. Chen, H. B. Zhou, X. J. Liu, et al., Understanding the learning mechanism of convolutional neural networks in spectral analysis, Anal Chim Acta, 1119 (2020), 41–51. doi: 10.1016/j.aca.2020.03.055. doi: 10.1016/j.aca.2020.03.055
[42]	S. W. Kwon, I. J. Choi, J. Y. Kang, W. I. Jang, G. H. Lee, M. C. Lee, Ultrasonographic Thyroid Nodule Classification Using a Deep Convolutional Neural Network with Surgical Pathology, J. Digit. Imaging, 33 (2020), 1202–1208. doi: 10.1007/s10278-020-00362-w. doi: 10.1007/s10278-020-00362-w
[43]	G. E. Dahl, T. N. Sainath, G. E. Hinton, Improving deep neural networks for LVCSR using rectified linear units and dropout, 2013 IEEE Int. Conf. Acoustics, IEEE, 2013. doi: 10.1109/ICASSP.2013.6639346. doi: 10.1109/ICASSP.2013.6639346
[44]	N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, R. Salakhutdinov, Dropout: A Simple Way to Prevent Neural Networks from Overfitting, J. Mach. Learn. Res., 15 (2014), 1929–1958.
[45]	S. Ioffe, C. Szegedy, Batch normalization: Accelerating deep network training by reducing internal covariate shift, Int. Conf. Mach. Learn., PMLR, (2015), pp. 448–456.
[46]	V. Nair, G. E. Hinton, Rectified linear units improve restricted boltzmann machines, lcml, 2010.
[47]	P. Li, X. Liu, Bilinear interpolation method for quantum images based on quantum Fourier transform, Int. J. Quantum Inf., 16 (2018), 1850031. doi: 10.1142/S0219749918500314. doi: 10.1142/S0219749918500314
[48]	D. Y. Han, Comparison of commonly used image interpolation methods, Proc. 2nd Int. Conf. Comput. Sci. Electron. Eng. (ICCSEE 2013), 10 (2013).
[49]	X. Wang, X. Jia, W. Zhou, et al., Correction for color artifacts using the RGB intersection and the weighted bilinear interpolation, Appl. Opt., 58 (2019), 8083–8091. doi: 10.1364/AO.58.008083. doi: 10.1364/AO.58.008083
[50]	J. F. Dou, Q. Qin, Z. M. Tu, Image fusion based on wavelet transform with genetic algorithms and human visual system, Multimed. Tools Appl., 78 (2018), 12491–12517. doi: 10.1007/s11042-018-6756-0. doi: 10.1007/s11042-018-6756-0
[51]	H. M. Lu, L. F. Zhang, S. Serikawa, Maximum local energy: An effective approach for multisensor image fusion in beyond wavelet transform domain, Comput. Math. Appl. 64 (2012), 996–1003. doi: 10.1016/j.camwa.2012.03.017. doi: 10.1016/j.camwa.2012.03.017
[52]	B. Zhang, Study on image fusion based on different fusion rules of wavelet transform, 2010 3rd Int. Conf. Adv. Comput. Theo. Eng. (ICACTE), Vol. 3. IEEE, 2010. doi: 10.1109/ICACTE.2010.5579586.
[53]	S. L. Liu, Z. J. Song, M. N. Wang, WaveFuse: A Unified Deep Framework for Image Fusion with Discrete Wavelet Transform, arXiv preprint arXiv: 2007. 14110(2020).
[54]	D. Kusumoto, M. Lachmann, T. Kunihiro, S. Yuasa, Y. Kishino, M. Kimura, et al., Automated Deep Learning-Based System to Identify Endothelial Cells Derived from Induced Pluripotent Stem Cells, Stem Cell Rep., 10 (2018), 1687–1695. doi: 10.1016/j.stemcr.2018.04.007. doi: 10.1016/j.stemcr.2018.04.007
[55]	Su. P, Guo. S, Roys. S, F. Maier, H. Bhat, J. Zhuo, et al., Transcranial MR Imaging-Guided Focused Ultrasound Interventions Using Deep Learning Synthesized CT, AJNR Am. J. Neuroradiol., 41 (2020), 1841–1848. doi: 10.3174/ajnr.A6758. doi: 10.3174/ajnr.A6758
[56]	S. J. Pan, Q. Yang, A Survey on Transfer Learning, IEEE Trans. Knowl. Data Eng., 22 (2010), 1345–1359. doi: 10.1109/TKDE.2009.191. doi: 10.1109/TKDE.2009.191
[57]	S. Medghalchi, C. F. Kusche, E. Karimi, U. Kerzel, S. K. Kerzel, et al., Damage Analysis in Dual-Phase Steel Using Deep Learning: Transfer from Uniaxial to Biaxial Straining Conditions by Image Data Augmentation, JOM, 72 (2020), 4420–4430. doi: 10.1007/s11837-020-04404-0. doi: 10.1007/s11837-020-04404-0
[58]	X. R. Yu, X. M. Wu, C. B. Luo, P. Ren, Deep learning in remote sensing scene classification: a data augmentation enhanced convolutional neural network framework, GISci. Remote Sens., 54 (2017), 741–758. doi: 10.1080/15481603.2017.1323377. doi: 10.1080/15481603.2017.1323377
[59]	A. Taheri-Garavand, H. Ahmadi, M. Omid, S. S. Mohtasebi, K. Mollazade, G. M. Carlomagno, et al., An intelligent approach for cooling radiator fault diagnosis based on infrared thermal image processing technique, Appl. Therm. Eng., 87 (2015), 434–443. doi: 10.1016/j.applthermaleng.2015.05.038. doi: 10.1016/j.applthermaleng.2015.05.038
[60]	M. Drozdzal, E. Vorontsov, G. Chartrand, S. Kadoury, C. Pal, The Importance of Skip Connections in Biomedical Image Segmentation, Deep learning and data labeling for medical applications, Springer, Cham, 2016. 179–187. doi: 10.1007/978-3-319-46976-8_19.
[61]	Y-Lan. Boureau, Bach. F, Y. LeCun, Ponce. J, Learning mid-level features for recognition, 2010 IEEE Computer Society Conf. Comput. Vis. Pattern Recognit., IEEE, (2010), 2559–2566. doi: 10.1109/CVPR.2010.5539963. doi: 10.1109/CVPR.2010.5539963

1.	Jing Shi, Muhammad Babar, Deep Learning for College English Education Evaluation, 2022, 2022, 1875-905X, 1, 10.1155/2022/3558558
2.	Bin Li, Fuji Ren, Hongjun Ni, Xin Kang, Shuaishuai Lv, Zhuangzhuang Hao, 2022, Classification Method of Surface Defects of Aluminum Profile Based on Transfer Learning, 978-1-6654-9246-1, 1, 10.1109/MLISE57402.2022.00008
3.	Zhiyang Li, Bin Li, Hongjun Ni, Fuji Ren, Shuaishuai Lv, Xin Kang, An Effective Surface Defect Classification Method Based on RepVGG with CBAM Attention Mechanism (RepVGG-CBAM) for Aluminum Profiles, 2022, 12, 2075-4701, 1809, 10.3390/met12111809
4.	Guohua Liu, Wei Zhao, Weight-guided feature fusion and non-local balance model for aluminum surface defect detection, 2023, 34, 0957-0233, 125409, 10.1088/1361-6501/acf952
5.	Zhuxi Ma, Yibo Li, Minghui Huang, Nanzhou Deng, Online visual end-to-end detection monitoring on surface defect of aluminum strip under the industrial few-shot condition, 2023, 70, 02786125, 31, 10.1016/j.jmsy.2023.06.016
6.	Liang Hao, Pei Shen, Zhiwei Pan, Yong Xu, Multi-level semantic information guided image generation for few-shot steel surface defect classification, 2023, 11, 2296-424X, 10.3389/fphy.2023.1208781
7.	Guangxia Wu, Lin Fei, Limiao Deng, Haoyan Yang, Meng Han, Zhongzhi Han, Longgang Zhao, Identification of Soybean Mutant Lines Based on Dual-Branch CNN Model Fusion Framework Utilizing Images from Different Organs, 2023, 12, 2223-7747, 2315, 10.3390/plants12122315
8.	Zhihong Chen, Xuhong Huang, Ronghao Kang, Jianjun Huang, Junhan Peng, Aluminum Product Surface Defect Detection Method Based on Improved CenterNet, 2024, 1931-4973, 10.1002/tee.24218
9.	Yuqian Wang, Junyan Wang, 2024, Surface defect detection of aluminum profiles based on deep learning, 979-8-3315-0623-0, 1541, 10.1109/ICEMCE64157.2024.10862601

Mathematical Biosciences and Engineering

Effect of dual-convolutional neural network model fusion for Aluminum profile surface defects classification and recognition

Related Papers:

Abstract

1. Introduction

2. Methods

2.1. Dual-convolution neural network model fusion frameworks

2.2. A brief overview of convolutional neural networks

2.3. Upsampling and Feature fusion

2.4. K-fold cross-validation

2.5. Transfer learning and data augmentation

3. Experimental setup

3.1. Machine vision setup

3.2. Experimental development environment and dataset

3.3. Quantitative evaluation metrics

4. Experiments results and analysis

4.1. K-fold cross-validation and performance evaluation

4.2. Comparison of different interpolation methods under different feature fusion methods

4.3. Comparison of the proposed fusion frameworks with other fusion frameworks

4.4. Insufficient research and future work

5. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog