Semi-supervised learning for marine anomaly detection on board satellites
Journal:
arXiv
Published Date:
Mar 26, 2025
Abstract
Aquatic bodies face numerous environmental threats caused by several marine
anomalies. Marine debris can devastate habitats and endanger marine life
through entanglement, while harmful algal blooms can produce toxins that
negatively affect marine ecosystems. Additionally, ships may discharge oil or
engage in illegal and overfishing activities, causing further harm. These
marine anomalies can be identified by applying trained deep learning (DL)
models on multispectral satellite imagery. Furthermore, the detection of other
anomalies, such as clouds, could be beneficial in filtering out irrelevant
images. However, DL models often require a large volume of labeled data for
training, which can be both costly and time-consuming, particularly for marine
anomaly detection where expert annotation is needed. A potential solution is
the use of semi-supervised learning methods, which can also utilize unlabeled
data. In this project, we implement and study the performance of FixMatch for
Semantic Segmentation, a semi-supervised algorithm for semantic segmentation.
Firstly, we found that semi-supervised models perform best with a high
confidence threshold of 0.9 when there is a limited amount of labeled data.
Secondly, we compare the performance of semi-supervised models with
fully-supervised models under varying amounts of labeled data. Our findings
suggest that semi-supervised models outperform fully-supervised models with
limited labeled data, while fully-supervised models have a slightly better
performance with larger volumes of labeled data. We propose two hypotheses to
explain why fully-supervised models surpass semi-supervised ones when a high
volume of labeled data is used. All of our experiments were conducted using a
U-Net model architecture with a limited number of parameters to ensure
compatibility with space-rated hardware.