SPECIAL: Zero-shot Hyperspectral Image Classification With CLIP
Journal:
arXiv
Published Date:
Jan 27, 2025
Abstract
Hyperspectral image (HSI) classification aims at categorizing each pixel in
an HSI into a specific land cover class, which is crucial for applications like
remote sensing, environmental monitoring, and agriculture. Although deep
learning-based HSI classification methods have achieved significant
advancements, existing methods still rely on manually labeled data for
training, which is both time-consuming and labor-intensive. To address this
limitation, we introduce a novel zero-shot hyperspectral image classification
framework based on CLIP (SPECIAL), aiming to eliminate the need for manual
annotations. The SPECIAL framework consists of two main stages: (1) CLIP-based
pseudo-label generation, and (2) noisy label learning. In the first stage, HSI
is spectrally interpolated to produce RGB bands. These bands are subsequently
classified using CLIP, resulting in noisy pseudo-labels that are accompanied by
confidence scores. To improve the quality of these labels, we propose a scaling
strategy that fuses predictions from multiple spatial scales. In the second
stage, spectral information and a label refinement technique are incorporated
to mitigate label noise and further enhance classification accuracy.
Experimental results on three benchmark datasets demonstrate that our SPECIAL
outperforms existing methods in zero-shot HSI classification, showing its
potential for more practical applications. The code is available at
https://github.com/LiPang/SPECIAL.