SemiOccam: A Robust Semi-Supervised Image Recognition Network Using Sparse Labels
Journal:
arXiv
Published Date:
Jun 4, 2025
Abstract
We present SemiOccam, an image recognition network that leverages
semi-supervised learning in a highly efficient manner. Existing works often
rely on complex training techniques and architectures, requiring hundreds of
GPU hours for training, while their generalization ability when dealing with
extremely limited labeled data remains to be improved. To address these
limitations, we construct a hierarchical mixture density classification
decision mechanism by optimizing mutual information between feature
representations and target classes, compressing redundant information while
retaining crucial discriminative components. Experimental results demonstrate
that our method achieves state-of-the-art performance on various datasets when
using negligible labeled samples, and its simple architecture keeps training
time to minute-level. Notably, this paper reveals a long-overlooked data
leakage issue in the STL-10 dataset for semi-supervised learning tasks and
removes duplicates to ensure the reliability of experimental results. We also
release the deduplicated CleanSTL-10 dataset to facilitate fair and reliable
research in future semi-supervised learning. Code available at
https://github.com/Shu1L0n9/SemiOccam.