With the advancement of Artificial Intelligence (AI) towards multiple
modalities (language, vision, speech, etc.), multi-modal models have
increasingly been used across various applications (e.g., visual question
answering or image generation/capti... read more
In the semantic segmentation of remote sensing images, acquiring complete
ground objects is critical for achieving precise analysis. However, this task
is severely hindered by two major challenges: high intra-class variance and
high inter-class sim... read more
It is difficult for individuals and organizations to protect personal
information without a fundamental understanding of relative privacy risks. By
analyzing over 5,000 empirical identity theft and fraud cases, this research
identifies which types ... read more
Nowadays, many fields, such as healthcare, farming, factories, transportation, cities, and homes are connected via network devices. These systems are configured in open environments and are prone to malicious attacks. It is important to protect these... read more
We demonstrate that Principal Component Analysis (PCA), when applied in a
structured manner, either to polar-transformed images or segment-wise to token
sequences, enables extreme compression of neural models without sacrificing
performance. Across... read more
Multimodal Recommender Systems aim to improve recommendation accuracy by
integrating heterogeneous content, such as images and textual metadata. While
effective, it remains unclear whether their gains stem from true multimodal
understanding or incr... read more
Indian journal of gastroenterology : official journal of the Indian Society of Gastroenterology
Aug 6, 2025
BACKGROUND: Hyperspectral imaging (HSI), a high-throughput, three-dimensional technology initially developed for remote sensing, has recently garnered increasing interest in applications in gastroenterology and hepatology. We focus on the aspects of ... read more
Depression is a widespread mental disorder that affects millions worldwide.
While automated depression assessment shows promise, most studies rely on
limited or non-clinically validated data, and often prioritize complex model
design over real-worl... read more
BACKGROUND: Seasonal influenza is a major global public health concern, leading to escalated morbidity and mortality rates. Traditional early warning models rely on binary (0/1) classification methods, which issue alerts only when predefined threshol... read more
Interactive multimodal agents must convert raw visual observations into
coherent sequences of language-conditioned actions -- a capability that current
vision-language models (VLMs) still lack. Earlier reinforcement-learning (RL)
efforts could, in ... read more
Join thousands of healthcare professionals staying informed about the latest AI breakthroughs in medicine. Get curated insights delivered to your inbox.