Enhancing autonomous vehicle safety in rain: a data centric approach for clear vision

Mark A. Seferian; Jidong J. Yang; Mark A. Seferian; Jidong J. Yang

doi:10.3934/aci.2024017

Applied Computing and Intelligence

2024, Volume 4, Issue 2: 282-299. doi: 10.3934/aci.2024017

Previous Article Next Article

Research article

Enhancing autonomous vehicle safety in rain: a data centric approach for clear vision

Mark A. Seferian ,
Jidong J. Yang ^,

Smart Mobility and Infrastructure Laboratory, College of Engineering, University of Georgia, Athens, GA, 30602, USA

Academic Editor: Pasi Fränti

Received: 23 November 2024 Revised: 25 December 2024 Accepted: 26 December 2024 Published: 30 December 2024

Autonomous vehicles (AV) face significant challenges in navigating adverse weather, particularly rain, due to the visual impairment of camera-based systems. In this study, we leveraged contemporary deep learning techniques to mitigate these challenges, aiming to develop a vision model that processes live vehicle camera feeds to eliminate rain-induced visual hindrances, yielding visuals closely resembling clear, rain-free scenes. Using the Car Learning to Act (CARLA) simulation environment, we generated a comprehensive dataset of clear and rainy images for model training and testing. In our model, we employed a classic encoder-decoder architecture with skip connections and concatenation operations. It was trained using novel batching schemes designed to effectively distinguish high-frequency rain patterns from low-frequency scene features across successive image frames. To evaluate the model's performance, we integrated it with a steering module that processes front-view images as input. The results demonstrated notable improvements in steering accuracy, underscoring the model's potential to enhance navigation safety and reliability in rainy weather conditions.

Keywords:

Citation: Mark A. Seferian, Jidong J. Yang. Enhancing autonomous vehicle safety in rain: a data centric approach for clear vision[J]. Applied Computing and Intelligence, 2024, 4(2): 282-299. doi: 10.3934/aci.2024017

Related Papers:

[1]	Yunxiang Yang, Hao Zhen, Yongcan Huang, Jidong J. Yang . Enhancing nighttime vehicle detection with day-to-night style transfer and labeling-free augmentation. Applied Computing and Intelligence, 2025, 5(1): 14-28. doi: 10.3934/aci.2025002
[2]	Hao Zhen, Oscar Lares, Jeffrey Cooper Fortson, Jidong J. Yang, Wei Li, Eric Conklin . Unraveling the dynamics of single-vehicle versus multi-vehicle crashes: a comparative analysis through binary classification. Applied Computing and Intelligence, 2024, 4(2): 349-369. doi: 10.3934/aci.2024020
[3]	Yongcan Huang, Jidong J. Yang . Semi-supervised multiscale dual-encoding method for faulty traffic data detection. Applied Computing and Intelligence, 2022, 2(2): 99-114. doi: 10.3934/aci.2022006
[4]	Oscar Lares, Hao Zhen, Jidong J. Yang . Feature group tabular transformer: a novel approach to traffic crash modeling and causality analysis. Applied Computing and Intelligence, 2025, 5(1): 29-56. doi: 10.3934/aci.2025003
[5]	Noah Gardner, John Paul Hellenbrand, Anthony Phan, Haige Zhu, Zhiling Long, Min Wang, Clint A. Penick, Chih-Cheng Hung . Investigation of ant cuticle dataset using image texture analysis. Applied Computing and Intelligence, 2022, 2(2): 133-151. doi: 10.3934/aci.2022008
[6]	Xuetao Jiang, Binbin Yong, Soheila Garshasbi, Jun Shen, Meiyu Jiang, Qingguo Zhou . Crop and weed classification based on AutoML. Applied Computing and Intelligence, 2021, 1(1): 46-60. doi: 10.3934/aci.2021003
[7]	Sheyda Ghanbaralizadeh Bahnemiri, Mykola Pnomarenko, Karen Eguiazarian . Iterative transfer learning with large unlabeled datasets for no-reference image quality assessment. Applied Computing and Intelligence, 2024, 4(2): 107-124. doi: 10.3934/aci.2024007
[8]	Guanyu Yang, Zihan Ye, Rui Zhang, Kaizhu Huang . A comprehensive survey of zero-shot image classification: methods, implementation, and fair evaluation. Applied Computing and Intelligence, 2022, 2(1): 1-31. doi: 10.3934/aci.2022001
[9]	Hao Zhen, Yucheng Shi, Jidong J. Yang, Javad Mohammadpour Vehni . Co-supervised learning paradigm with conditional generative adversarial networks for sample-efficient classification. Applied Computing and Intelligence, 2023, 3(1): 13-26. doi: 10.3934/aci.2023002
[10]	Jui Mhatre, Ahyoung Lee, Ramazan Aygun . Frequency-hopping scheduling algorithm for energy-efficient IoT, long-range, wide-area networks. Applied Computing and Intelligence, 2024, 4(2): 300-327. doi: 10.3934/aci.2024018

Abstract

1. Introduction

Rain poses significant challenges to not only human visual perception, but also for autonomous vehicles (AV) navigating roadways. Rain streaks can severely hinder camera-based objects and feature detection systems employed by AV. Consequently, automotive manufacturers often deactivate autonomous driving features during inclement weather. In response, research within the field of developing deraining deep learning models has seen a surge of interest in recent years. However, due to the intricate nature of dynamic rain streaks and slowly changing background scenes, deraining remains a challenging task.

We aim to address the deraining challenge for AV by: (1) Developing a deep learning based vision model capable of removing rain streaks, yielding results that resemble a clear, rain-free image, (2) using a data centric approach to devise and analyze different batching schemes to enhance model training and inference performance, and (3) utilizing an established steering angle predication model to validate the benefits of deraining in improving AV's steering performance.

The datasets for this study are generated from the Car Learning to Act (CARLA) simulator. Three datasets are prepared for model training, validation, and testing purposes. Each dataset comprises of rainy images as the input and corresponding clear images as the label. The training dataset includes diverse maps and environments to improve the model's generalization. Moreover, the validation and testing datasets are derived from maps not used in the training dataset. Deraining, akin to denoising, is a common machine learning task, exemplified by methods like denoising autoencoder ^[1].

However, the challenge of removing rain streaks from an ego vehicle's camera view differs significantly from denoising static images due to the dynamic nature of sequential scenes captured by these cameras. Here, rain streaks act as high-frequency signals (similar to noises), contrasting with the low-frequency signals of camera scenes, which evolve slowly as the vehicle navigates roads. Removing rain streaks while maintaining the integrity of scenes is analogous to filtering out high-frequency signals. In order to harness the distinct dynamics of these signal types, two novel batching schemes are employed and compared to the conventional batching to assess their impact on the model training and the resultant deraining performance. The first batching scheme uses paired images that are sequential in time and sequential in batch (STSB), while the second batching scheme uses paired images that are sequential in time and random in batch (STRB). In contrast, the traditional batching scheme relies on image pairs that are random in time and random in batch (RTRB), where the temporal cue is lost within each batch.

The model architecture devised in this study draws inspiration from two influential architectures: The Deep Convolutional Generative Adversarial Network (DCGAN) ^[2] and the U-Net ^[3]. DCGAN, renowned for its applications in computer vision such as image generation ^[4,5,6], style transfer ^[7,8], and data augmentation ^[9,10,11], serves as a foundational pillar in our approach. Moreover, U-Net, initially developed for biomedical image segmentation, is important in modern diffusion models for iterative image denoising ^[12,13,14]. To tackle the challenge of image deraining, we propose an encoder-decoder architecture that extends the DCGAN to accommodate higher image resolutions while integrating the skip-concatenation mechanism from U-Net to leverage multiscale perceptual views, fostering context-aware denoising.

To illustrate the effectiveness of our model, we compare the derained images against both rainy and ground-truth (clear) images. Additionally, we benchmark our model against PreNet ^[15], a seminal work in the field. To quantitatively assess the performance of our deraining model, we employ PilotNet ^[16], a steering angle predictor. Steering performance is evaluated under rainy, clear, and derained conditions to demonstrate the advantages of our model in improving vehicle steering under adverse weather conditions. In summary, the key contributions of this study are as follows:

• We introduce sequential batching schemes that facilitate cost-free learning of structured scenic features against noisy rain streaks. This data-centric approach enhances both training stability and inference performance.

• Inspired by DCGAN and U-Net, our proposed simple yet effective architecture surpasses the prior work in removing rain streaks from images.

• The efficacy of our deraining model is validated through steering performance evaluation using PilotNet, where steering angles predicted from derained images closely match those from clear images.

2. Deraining

A number of deep learning models have been proposed to address the deraining challenge. DID-MDN ^[17] utilized a multi-dilation network to capture rain streaks of varying sizes using dilated convolutions to capture long-range dependences in rain streak patterns. JORDER ^[18] jointly addresses rain detection and removal within a unified framework by extracting rain discriminative features. PReNet ^[15] used a modified progressive residual network (PRN) to remove rain streaks by progressively refining the deraining results through multiple stages. DerainNet ^[19] employed a deep convolutional neural network (CNN) ^[20] to directly learn the mapping relationship between rainy and clear images. Restormer ^[21] used a multiscale hierarchal design incorporating efficient transformer blocks, such as multi-Dconv head transposed attention and gated-Dconv feed-forward network to derain images. KBnet ^[22] argued against transformer models as they lack desirable inductive bias of convolutions. Instead, it incorporated a kernel basis attention model to adaptively aggregate spatial information and a multi-axis feature fusion to encode and fuse diverse features for image restoration.

Other researchers in the field have adopted GAN ^[23] based architectures for image deraining. DerainCycleGAN ^[24] used an unsupervised attention guided rain streak extractor, two generators, and two discriminators to derain images. ID-CGAN ^[25] integrated skip-connections and DenseNet Block that uses per-pixel loss and perceptual loss to improve deraining performance. PAN ^[26] employed a perceptual adversarial loss and hidden trainable layers. FS-GAN ^[27] incorporated feature-supervision on generator layers to contribute gradient information for optimization to improve image deraining. IGAN ^[28] followed a divide-and-conquer strategy to divide image deraining into rain locating, removal, and detail refinement sub-tasks.

In contrast to researchers who focused on model architecture, we emphasize a data centric approach using cost-free batching schemes to improve image deraining performance. For proof of concept, we devise a simple encoder-decoder architecture with end-to-end training for direct image deraining and style transfer.

3. Data collection

Data collection and curation is pivotal in our data-centric approach, profoundly shaping the training of our model. We aim to achieve three objectives: (1) Capturing both rainy images and their corresponding clear counterparts as an ego vehicle navigates roads, facilitating direct end-to-end training with sequential images; (2) ensuring diversity in the driving environments captured within the datasets; and (3) acquiring steering wheel angle data alongside image data to enable quantitative evaluation of deraining on steering performance.

3.1. CARLA simulator

To collect necessary data for model training and testing, Car Learning to Act (CARLA) ^[29], a simulator for autonomous driving research, is utilized. CARLA is a powerful open-source simulator that contains various digital assets, such as vehicles, sensors to capture data, and pre-made maps that include a diverse selection of environments. CARLA also has an extensive API, offering flexibility in setting the time of day, controlling weather conditions, and gathering necessary vehicle data. However, one glaring issue with CARLA is its simulated rain effects. The original rain effects in CARLA based on the 0.9.14 release were unrealistic when compared to real rain. Consequently, modifying the rain effects in CARLA to closely reflect real-world rainy conditions becomes necessary. To modify the rain effects, a custom-built version of CARLA is created using the Unreal Editor to modify the rain asset file to reflect real-world rain effects. Figure 1 shows a comparison of a real-world heavy rain image, the original CARLA heavy rain image, and the modified CARLA heavy rain image.

Figure 1. Visualization of heavy rain effect when compared to original CARLA rain effect and modified CARLA rain effect.

Batch Scheme	Train Loss	Validation Loss	Test Loss
STSB	0.0122	0.0821	0.0132
STRB	0.0012	0.0130	0.0106
RTRB Reference	0.0014	0.0164	0.0115
Note: bold indicates the best performance.

Condition	Error (degree)
Clear	0.356 (Inherent Error)
Heavy Rain	1.204
Light Rain	0.561
Derained	0.508

[1]	P. Vincent, H. Larochelle, Y. Bengio, P. A. Manzagol, Extracting and composing robust features with denoising autoencoders, Proceedings of the 25th International Conference on Machine Learning, 2008, 1096–1103. https://doi.org/10.1145/1390156.1390294
[2]	A. Radford, L. Metz, S. Chintala, Unsupervised representation learning with deep convolutional generative adversarial networks, arXiv: 1511.06434. https://arXiv.org/abs/1511.06434
[3]	O. Ronneberger, P. Fischer, T. Brox, U-net: convolutional networks for biomedical image segmentation, In: Medical image computing and computer-assisted intervention–MICCAI 2015, Cham: Springer, 2015,234–241. https://doi.org/10.1007/978-3-319-24574-4_28
[4]	T. Karras, T. Aila, S. Laine, J. Lehtinen, Progressive growing of gans for improved quality, stability, and variation, arXiv: 1710.10196. https://doi.org/10.48550/arXiv.1710.10196
[5]	A. Brock, J. Donahue, K. Simonyan, Large scale GAN training for high fidelity natural image synthesis, arXiv: 1809.11096. https://doi.org/10.48550/arXiv.1809.11096
[6]	H. Zhang, T. Xu, H. Li, S. Zhang, X. Wang, X. Huang, et al., Stackgan: text to photo-realistic image synthesis with stacked generative adversarial networks, Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, 5907–5915.
[7]	T. Karras, S. Laine, T. Aila, A style-based generator architecture for generative adversarial networks, Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, 4396–4405. https://doi.org/10.1109/CVPR.2019.00453
[8]	W. Xu, C. Long, R. Wang, G. Wang, Drb-gan: a dynamic resblock generative adversarial network for artistic style transfer, Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, 6383–6392.
[9]	H. Zhen, Y. Shi, J. Yang, J. Vehni, Co-supervised learning paradigm with conditional generative adversarial networks for sample-efficient classification, Appl. Comput. Intell., 3 (2023), 13–26. https://doi.org/10.3934/aci.2023002 doi: 10.3934/aci.2023002
[10]	S. Motamed, P. Rogalla, F. Khalvati, Data augmentation using generative adversarial networks (GANs) for GAN-based detection of Pneumonia and COVID-19 in chest X-ray images, Informatics in Medicine Unlocked, 27 (2021), 100779. https://doi.org/10.1016/j.imu.2021.100779 doi: 10.1016/j.imu.2021.100779
[11]	A. Jadli, M. Hain, A. Chergui, A. Jaize, DCGAN-based data augmentation for document classification, Proceedings of IEEE 2nd International Conference on Electronics, Control, Optimization and Computer Science (ICECOCS), 2020, 1–5. https://doi.org/10.1109/icecocs50124.2020.9314379
[12]	A. Ramesh, M. Pavlov, G. Goh, S. Gray, C. Voss, A. Radford, et al., Zero-shot text-to-image generation, Proceedings of the 38th International Conference on Machine Learning, 2021, 8821–8831.
[13]	R. Rombach, A. Blattmann, D. Lorenz, P. Esser, B. Ommer, High-resolution image synthesis with latent diffusion models, Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, 10684–10695. https://doi.org/10.1109/CVPR52688.2022.01042
[14]	B. Xia, Y. Zhang, S. Wang, Y. Wang, X. Wu, Y. Tian, et al., Diffir: efficient diffusion model for image restoration, Proceedings of IEEE/CVF International Conference on Computer Vision (ICCV), 2023, 13049–13059. https://doi.org/10.1109/ICCV51070.2023.01204
[15]	D. Ren, W. Zuo, Q. Hu, P. Zhu, D. Meng, Progressive image deraining networks: a better and simpler baseline, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, 3937–3946. https://doi.org/10.1109/CVPR.2019.00406
[16]	M. Bojarski, D. Testa, D. Dworakowski, B. Firner, B. Flepp, P. Goyal, et al., End to end learning for self-driving cars, arXiv: 1604.07316. https://doi.org/10.48550/arXiv.1604.07316
[17]	H. Zhang, V. M. Patel, Density-aware single image de-raining using a multi-stream dense network, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018, 695–704. https://doi.org/10.1109/CVPR.2018.00079
[18]	W. Yang, R. T. Tan, J. Feng, J. Liu, Z. Guo, S. Yan, Deep joint rain detection and removal from a single image, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, 1357–1366. https://doi.org/10.1109/CVPR.2017.183
[19]	X. Fu, J. Huang, X. Ding, Y. Liao, J. Paisley, Clearing the skies: a deep network architecture for single-image rain removal, IEEE Trans. Image Process., 26 (2017), 2944–2956. https://doi.org/10.1109/tip.2017.2691802 doi: 10.1109/tip.2017.2691802
[20]	Y. LeCun, B. Boser, J. Denker, D. Henderson, R. Howard, W. Hubbard, et al., Backpropagation applied to handwritten zip code recognition, Neural Comput., 1 (1989), 541–551. https://doi.org/10.1162/neco.1989.1.4.541 doi: 10.1162/neco.1989.1.4.541
[21]	S. Zamir, A. Arora, S. Khan, M. Hayat, F. Khan, M. Yang, Restormer: efficient transformer for high-resolution image restoration, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, 5728–5739. https://doi.org/10.1109/cvpr52688.2022.00564
[22]	Y. Zhang, D. Li, X. Shi, D. He, K. Song, X. Wang, et al., Kbnet: kernel basis network for image restoration, arXiv: 2303.02881. https://doi.org/10.48550/arXiv.2303.02881
[23]	I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, et al., Generative adversarial networks, Commun. ACM, 63 (2020), 139–144. https://doi.org/10.1145/3422622 doi: 10.1145/3422622
[24]	Y. Wei, Z. Zhang, Y. Wang, M. Xu, Y. Yang, S. Yan, et al., Deraincyclegan: rain attentive cyclegan for single image deraining and rainmaking, IEEE Trans. Image Process., 30 (2021), 4788–4801. https://doi.org/10.1109/TIP.2021.3074804 doi: 10.1109/TIP.2021.3074804
[25]	H. Zhang, V. Sindagi, V. M. Patel, Image de-raining using a conditional generative adversarial network, IEEE Trans. Circ. Syst. Vid., 30 (2020), 3943–3956. https://doi.org/10.1109/tcsvt.2019.2920407 doi: 10.1109/tcsvt.2019.2920407
[26]	C. Wang, C. Xu, C. Wang, D. Tao, Perceptual adversarial networks for image-to-image transformation, IEEE Trans. Image Process., 27 (2018), 4066–4079. https://doi.org/10.1109/TIP.2018.2836316 doi: 10.1109/TIP.2018.2836316
[27]	P. Xiang, L. Wang, F. Wu, J. Cheng, M. Zhou, Single-image de-raining with feature-supervised generative adversarial network, IEEE Signal Proc. Let., 26 (2019), 650–654. https://doi.org/10.1109/LSP.2019.2903874 doi: 10.1109/LSP.2019.2903874
[28]	Y. Ren, M. Nie, S. Li, C. Li, Single image de-raining via improved generative adversarial nets, Sensors, 20 (2020), 1591. https://doi.org/10.3390/s20061591 doi: 10.3390/s20061591
[29]	A. Dosovitskiy, G. Ros, F. Codevilla, A. Lopez, V. Koltun, CARLA: an open urban driving simulator, arXiv: 1711.03938. https://doi.org/10.48550/arXiv.1711.03938
[30]	S. Ioffe, C. Szegedy, Batch normalization: accelerating deep network training by reducing internal covariate shift, Proceedings of the 32nd International Conference on International Conference on Machine Learning, 2015, 448–456.
[31]	J. Springenberg, A. Dosovitskiy, T. Brox, M. A. Riedmiller, Striving for simplicity: the all convolutional net, arXiv: 1412.6806. https://doi.org/10.48550/arXiv.1412.6806
[32]	D. Kingma, J. Ba, Adam: a method for stochastic optimization, arXiv: 1412.6980. https://doi.org/10.48550/arXiv.1412.6980
[33]	J. Ho, A. Jain, P. Abbeel, Denoising diffusion probabilistic models, Proceedings of the 34th International Conference on Neural Information Processing Systems, 2020, 6840–6851.

Applied Computing and Intelligence

Enhancing autonomous vehicle safety in rain: a data centric approach for clear vision

Related Papers:

Abstract

1. Introduction

2. Deraining

3. Data collection

3.1. CARLA simulator

3.2. Image data

3.3. Steering angle data

4. Data-centric approach

4.1. Data preparation and batching schemes

4.2. Model architecture

5. Model training and evaluation

5.1. Batching scheme performance

5.2. Comparison of deraining results

5.3. Steering performance

6. Conclusions

Use of AI tools declaration

Conflict of interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog