IEEE transactions on pattern analysis and machine intelligence
May 7, 2025
Understanding human emotions is crucial for a myriad of applications, from psychological research to advancements in Natural Language Processing (NLP). Traditionally, emotions are categorized into distinct basic groups, which has led to the developme...
IEEE transactions on pattern analysis and machine intelligence
Apr 18, 2025
Knowledge-based visual question answering (VQA) requires external knowledge beyond the image to answer the question. Early studies retrieve required knowledge from explicit knowledge bases (KBs), which often introduces irrelevant information to the q...
IEEE transactions on pattern analysis and machine intelligence
Apr 8, 2025
Recently, numerous benchmarks have been developed to evaluate the logical reasoning abilities of large language models (LLMs). However, assessing the equally important creative capabilities of LLMs is challenging due to the subjective, diverse, and d...
IEEE transactions on pattern analysis and machine intelligence
Apr 8, 2025
Given radiology images, automatic radiology report generation aims to produce informative text that reports diseases. It can benefit current clinical practice in diagnostic radiology. Existing methods typically rely on large-scale medical datasets an...
IEEE transactions on pattern analysis and machine intelligence
Apr 8, 2025
Despite the impressive advances in text-to-image models, they often struggle to effectively compose complex scenes with multiple objects, displaying various attributes and relationships. To address this challenge, we present T2I-CompBench++, an enhan...
IEEE transactions on pattern analysis and machine intelligence
Mar 6, 2025
Recovering whole-body mesh by inferring the abstract pose and shape parameters from visual content can obtain 3D bodies with realistic structures. However, the inferring process is highly non-linear and suffers from image-mesh misalignment, resulting...
IEEE transactions on pattern analysis and machine intelligence
Jan 9, 2025
Visual Speech Recognition (VSR) aims to infer speech into text depending on lip movements alone. As it focuses on visual information to model the speech, its performance is inherently sensitive to personal lip appearances and movements, and this make...
IEEE transactions on pattern analysis and machine intelligence
Jan 9, 2025
We study the domain adaptation task for action recognition, namely domain adaptive action recognition, which aims to effectively transfer action recognition power from a label-sufficient source domain to a label-free target domain. Since actions are ...
IEEE transactions on pattern analysis and machine intelligence
Dec 4, 2024
Longitudinal data with incomplete entries pose a significant challenge for clinical score regression over multiple time points. Although many methods primarily estimate longitudinal scores with complete baseline features (i.e., features collected at ...
IEEE transactions on pattern analysis and machine intelligence
Dec 4, 2024
Although data-driven methods usually have noticeable performance on disease diagnosis and treatment, they are suspected of leakage of privacy due to collecting data for model training. Recently, federated learning provides a secure and trustable alte...