Remote Sensing Scene Classification using ConvNeXt-Tiny Model with Attention Mechanism and Label Smoothing

  • Rachmawan Atmaji Perdana Badan Riset dan Inovasi Nasional
  • Aniati Murni Arimurthy Universitas Indonesia
  • Risnandar Badan Riset dan Inovasi Nasional
Keywords: attention mechanism, deep learning, CNN, ConvNeXt-Tiny, label smooth regularization

Abstract

Remote Sensing Scene Classification (RSSC) is the discrete categorization of remote sensing images into various classes of scene categories based on their image content. RSSC plays an important role in many fields, such as agriculture, land mapping, and identification of disaster-prone areas. Therefore, a reliable and accurate RSSC algorithm is required to ensure the accuracy of land identification. Many existing studies in recent years have used deep learning methods, especially CNN combined with attention modules to solve this problem. This study focuses on solving the RSSC problem by proposing a deep learning-based method (CNN) with the ConvNeXt-Tiny model integrated with Efficient Channel Attention Module (ECANet) and label smoothing regularization (LSR). The ConvNeXt-Tiny model shows that a persistent superior outperforms the ‘large’ model in convinced metrics. The ConvNeXt-Tiny model also has a huge advantage in high-precision positioning and higher classification accuracy and localization precision in a variety of complicated scenarios of remote sensing scene recognition. The experiments in this study also aim to prove that the integration of the attention module and LSR into the basic CNN network can improve precision, because the attention module can strengthen important features and weaken features that are less useful for classification. The experimental results proved that the integration of ECANet and LSR in the ConvNeXt-Tiny base network obtained a higher precision of 0.38% in the UC-Merced dataset, 0.7% in the AID, and 0.4% in the WHU-RS19 dataset than the ConvNeXt-Tiny model without ECANet and LSR. The ConvNeXt-Tiny model with ECANet integration and LSR obtained an accuracy of 99.00±0.41% in the UC-Merced dataset, 95.08±0.20% in AID, and 99.50±0.31% in the WHU-RS19 dataset.

Downloads

Download data is not yet available.

References

G. Cheng, X. Xie, J. Han, L. Guo, and G.-S. Xia, “Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 13, pp. 3735–3756, 2020, doi: 10.1109/JSTARS.2020.3005403.

C. Patel, D. Labana, S. Pandya, K. Modi, H. Ghayvat, and M. Awais, “Histogram of Oriented Gradient-Based Fusion of Features for Human Action Recognition in Action Video Sequences,” Sensors, vol. 20, p. 7299, Dec. 2020, doi: 10.3390/s20247299.

Y. Li, H. Tang, W. Xie, and W. Luo, “Multidimensional Local Binary Pattern for Hyperspectral Image Classification,” IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–13, 2022, doi: 10.1109/TGRS.2021.3069505.

W. Zhang, T. Zhou, C. xu, and M. Liu, “A SIFT-Like Feature Detector and Descriptor for Multibeam Sonar Imaging,” Journal of Sensors, vol. 2021, Jul. 2021, doi: 10.1155/2021/8845814.

N. Iqbal, R. Mumtaz, U. Shafi, and S. M. H. Zaidi, “Gray level co-occurrence matrix (GLCM) texture based crop classification using low altitude remote sensing platforms,” PeerJ Computer Science, vol. 7, 2021, [Online]. Available: https://api.semanticscholar.org/CorpusID:235453113

R. Pires de Lima and K. Marfurt, “Convolutional Neural Network for Remote-Sensing Scene Classification: Transfer Learning Analysis,” Remote Sensing, vol. 12, no. 1, 2020, doi: 10.3390/rs12010086.

X. Zhao, J. Zhang, J. Tian, L. Zhuo, and J. Zhang, “Residual Dense Network Based on Channel-Spatial Attention for the Scene Classification of a High-Resolution Remote Sensing Image,” Remote Sensing, vol. 12, no. 11, 2020, doi: 10.3390/rs12111887.

R. Cao, L. Fang, T. Lu, and N. He, “Self-Attention-Based Deep Feature Fusion for Remote Sensing Scene Classification,” IEEE Geoscience and Remote Sensing Letters, vol. 18, no. 1, pp. 43–47, 2021, doi: 10.1109/LGRS.2020.2968550.

J. Ji, T. Zhang, L. Jiang, W. Zhong, and H. Xiong, “Combining Multilevel Features for Remote Sensing Image Scene Classification With Attention Model,” IEEE Geoscience and Remote Sensing Letters, vol. 17, no. 9, pp. 1647–1651, 2020, doi: 10.1109/LGRS.2019.2949253.

Y. He, S. Zhou, and X. Quan, “Remote Sensing Image Scene Classification Based on ECA Attention Mechanism Convolutional Neural Network,” in 2022 IEEE 4th International Conference on Civil Aviation Safety and Information Technology (ICCASIT), 2022, pp. 1265–1269. doi: 10.1109/ICCASIT55263.2022.9987089.

Z. Zhao, J. Li, Z. Luo, J. Li, and C. Chen, “Remote Sensing Image Scene Classification Based on an Enhanced Attention Module,” IEEE Geoscience and Remote Sensing Letters, vol. 18, no. 11, pp. 1926–1930, 2021, doi: 10.1109/LGRS.2020.3011405.

R. Tombe and S. Viriri, “Remote Sensing Image Scene Classification: Advances and Open Challenges,” Geomatics, vol. 3, no. 1, pp. 137–155, 2023, doi: 10.3390/geomatics3010007.

Z. Liu, H. Mao, C.-Y. Wu, C. Feichtenhofer, T. Darrell, and S. Xie, “A ConvNet for the 2020s,” in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 11966–11976. doi: 10.1109/CVPR52688.2022.01167.

Q. Wang, B. Wu, P. Zhu, P. Li, W. Zuo, and Q. Hu, “ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Los Alamitos, CA, USA: IEEE Computer Society, Jun. 2020, pp. 11531–11539. doi: 10.1109/CVPR42600.2020.01155.

L. Alzubaidi et al., “Review of deep learning: concepts, CNN architectures, challenges, applications, future directions.,” J Big Data, vol. 8, no. 1, p. 53, 2021, doi: 10.1186/s40537-021-00444-8.

X. Zhao, L. Wang, Y. Zhang, X. Han, M. Deveci, and M. Parmar, “A review of convolutional neural networks in computer vision,” Artificial Intelligence Review, vol. 57, no. 4, p. 99, Mar. 2024, doi: 10.1007/s10462-024-10721-6.

Z. Chen, J. Yang, Z. Feng, and L. Chen, “RSCNet: An Efficient Remote Sensing Scene Classification Model Based on Lightweight Convolution Neural Networks,” Electronics, vol. 11, p. 3727, Nov. 2022, doi: 10.3390/electronics11223727.

Z. Li, T. Gu, B. Li, W. Xu, X. He, and X. Hui, “ConvNeXt-Based Fine-Grained Image Classification and Bilinear Attention Mechanism Model,” Applied Sciences, vol. 12, no. 18, 2022, doi: 10.3390/app12189016.

X. Wang, “Improving Bag-of-Deep-Visual-Words Model via Combining Deep Features With Feature Difference Vectors,” IEEE Access, vol. 10, pp. 35824–35834, 2022, doi: 10.1109/ACCESS.2022.3163256.

Y. Long et al., “On Creating Benchmark Dataset for Aerial Image Interpretation: Reviews, Guidances and Million-AID,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. PP, pp. 1–1, Apr. 2021, doi: 10.1109/JSTARS.2021.3070368.

I. Kwong, F. Wong, T. Fung, E. Liu, R. Lee, and T. Ng, “A Multi-Stage Approach Combining Very High-Resolution Satellite Image, GIS Database and Post-Classification Modification Rules for Habitat Mapping in Hong Kong,” Remote Sensing, vol. 14, p. 67, Dec. 2021, doi: 10.3390/rs14010067.

S.-B. Chen, Q.-S. Wei, W.-Z. Wang, J. Tang, B. Luo, and Z.-Y. Wang, “Remote Sensing Scene Classification via Multi-Branch Local Attention Network,” IEEE Transactions on Image Processing, vol. 31, pp. 99–109, 2022, doi: 10.1109/TIP.2021.3127851.

M. Ismail, “A very high-resolution scene classification model using transfer deepCNNs based on saliency features,” Signal Image and Video Processing, vol. 15, Jun. 2021, doi: 10.1007/s11760-020-01801-5.

R. M. Anwer, F. S. Khan, and J. Laaksonen, “Compact Deep Color Features for Remote Sensing Scene Classification,” Neural Processing Letters, vol. 53, no. 2, pp. 1523–1544, Apr. 2021, doi: 10.1007/s11063-021-10463-4.

A. A. Aljabri, A. Alshanqiti, A. B. Alkhodre, A. Alzahem, and A. Hagag, “A Remote Sensing Scene Classification Model Based on EfficientNet-V2L Deep Neural Networks,” International Journal of Computer Science and Network Security, vol. 22, no. 10, pp. 406–412, Oct. 2022.

K. Qi, C. Yang, C. Hu, H. Zhai, Q. Guan, and S. Shen, “A multi-level improved circle pooling for scene classification of high-resolution remote sensing imagery,” Neurocomputing, vol. 462, pp. 506–522, 2021, doi: https://doi.org/10.1016/j.neucom.2021.08.022.

W. Zhang, P. Tang, and L. Zhao, “Remote Sensing Image Scene Classification Using CNN-CapsNet,” Remote Sensing, vol. 11, no. 5, 2019, doi: 10.3390/rs11050494.

H. Alhichri, A. S. Alswayed, Y. Bazi, N. Ammour, and N. A. Alajlan, “Classification of Remote Sensing Images Using EfficientNet-B3 CNN Model With Attention,” IEEE Access, vol. 9, pp. 14078–14094, 2021, doi: 10.1109/ACCESS.2021.3051085.

Published
2024-06-21
How to Cite
Rachmawan Atmaji Perdana, Aniati Murni Arimurthy, & Risnandar. (2024). Remote Sensing Scene Classification using ConvNeXt-Tiny Model with Attention Mechanism and Label Smoothing. Jurnal RESTI (Rekayasa Sistem Dan Teknologi Informasi), 8(3), 389 - 400. https://doi.org/10.29207/resti.v8i3.5731
Section
Information Systems Engineering Articles