Latest AI and machine learning research in care of terminally ill / palliative care for healthcare professionals.
Reliable recognition of standard cine cardiac MRI views is essential because each view determines wh...
Referring multi-object tracking (RMOT) is a task of associating all the objects in a video that sema...
Earth Observation (EO) systems are essentially designed to support domain experts who often express ...
Video depth estimation is essential for providing 3D scene structure in applications ranging from au...
Developing optical systems for free-space applications requires simulation tools that accurately cap...
Decreasing sequence length is a common way to accelerate transformers, but prior token reduction wor...
World action models (WAMs) have emerged as a promising direction for robot policy learning, as they ...
Deep learning models utilizing longitudinal healthcare data have significantly advanced epidemiologi...
End-to-end autonomous driving models based on Vision-Language-Action (VLA) architectures have shown ...
Most of the recent generative image super-resolution (SR) methods rely on adapting large text-to-ima...
This paper proposes an end-to-end shared attention estimation method via group detection. Most previ...
Optical character recognition remains critical infrastructure for document digitization, yet state-o...
Low left ventricular ejection fraction (LEF) frequently remains undetected until progression to symp...
Automatically extracting chemical structures from documents is essential for the large-scale analysi...
Medical Visual Question Answering (MedVQA) models often exhibit limited generalization due to relian...
We present Feature-Align CNN (FA-CNN), a prototype CNN architecture with intrinsic class attribution...
Low-light image enhancement (LLIE) has traditionally been formulated as a deterministic mapping. How...
Document parsing has recently advanced with multimodal large language models (MLLMs) that directly m...
End-to-end text-image machine translation (TIMT), which directly translates textual content in image...
Machine vision, including object recognition and image reconstruction, is a central technology in ma...
We introduce Latent-WAM, an efficient end-to-end autonomous driving framework that achieves strong t...