AMLRIS: Alignment-aware Masked Learning for Referring Image Segmentation

Journal: arXiv

Published Date: Feb 26, 2026

Abstract

Referring Image Segmentation (RIS) aims to segment an object in an image identified by a natural language expression. The paper introduces Alignment-Aware Masked Learning (AML), a training strategy to enhance RIS by explicitly estimating pixel-level vision-language alignment, filtering out poorly aligned regions during optimization, and focusing on trustworthy cues. This approach results in state-of-the-art performance on RefCOCO datasets and also enhances robustness to diverse descriptions and scenarios

Authors

Tongfei Chen; Shuo Yang; Yuguang Yang; Linlin Yang; Runtang Guo; Changbai Li; He Long; Chunyu Xie; Dawei Leng; Baochang Zhang

External Resources

View on arXiv arXiv (2602.22740v1)

AMLRIS: Alignment-aware Masked Learning for Referring Image Segmentation

Abstract

Authors

Categories

External Resources

Popular Topics

Recent Journals

AMLRIS: Alignment-aware Masked Learning for Referring Image Segmentation

Abstract

Authors

Categories

External Resources

Don't Miss the Future of Medicine

Popular Topics

Recent Journals