Yue, H, Qing, L ORCID: https://orcid.org/0000-0003-3555-0005, Zhang, Z ORCID: https://orcid.org/0000-0002-3933-5401, Wang, Z, Guo, L ORCID: https://orcid.org/0000-0003-1272-8480 and Peng, Y ORCID: https://orcid.org/0000-0002-5508-1819 (2024) MSE-Net: A novel master–slave encoding network for remote sensing scene classification. Engineering Applications of Artificial Intelligence, 132. 107909. ISSN 0952-1976
Accepted Version
File will be available on: 25 January 2025. Available under License Creative Commons Attribution Non-commercial No Derivatives. Download (733kB) |
Abstract
Remote sensing scene (RSS) image classification plays a vital role in various fields such as urban planning and environmental protection. However, due to higher inter-class similarity and intra-class variability, achieving accurate classification for RSS images poses a considerable challenge for current convolutional neural networks (CNNs)-based and visual transformer (ViT)-based methods. To address these issues, this paper proposes a novel dual-encoding method named master–slave encoding network (MSE-Net) from two perspectives of feature extraction and fusion. The master encoder, based on ViT, extracts higher-level semantic features, while the slave encoder, based on CNN, captures relative lower-level spatial structure information. Secondly, to integrate feature information from the two encoders effectively, this paper further develop two fusion strategies. The first strategy involves the auxiliary enhancement units (AEUs), which eliminates semantic divergence between the two encoders, enhances spatial context awareness of the slave encoder and promotes effective feature learning. The interactive perception unit (IPU), as the second strategy, facilitates interaction and integration of the two encoders’ representations to extract more discriminative feature information. In addition, we conducted comparative experiments on four widely-used RSS datasets, including RSSCN7, SIRI-WHU, the aerial image dataset (AID) and NWPU-RESISC45 (NWPU45), to verify the effectiveness of MSE-Net. The experimental results demonstrate that MSE-Net achieved state-of-the-art (SOTA) performance across all the datasets.
Impact and Reach
Statistics
Additional statistics for this dataset are available via IRStats2.