Regression Mean (RegMean), an approach that formulates model merging as a
linear regression problem, aims to find the optimal weights for each linear
layer in the merge model by minimizing the discrepancy in predictions between
the merge and candid... read more
We introduce Skywork UniPic, a 1.5 billion-parameter autoregressive model
that unifies image understanding, text-to-image generation, and image editing
within a single architecture-eliminating the need for task-specific adapters or
inter-module con... read more
This study introduces a novel framework, "Comprehensive Optimization and
Refinement through Ensemble Fusion in Domain Adaptation for Person
Re-identification (CORE-ReID)", to address an Unsupervised Domain Adaptation
(UDA) for Person Re-identificat... read more
Terrain elevation modeling for off-road navigation aims to accurately
estimate changes in terrain geometry in real-time and quantify the
corresponding uncertainties. Having precise estimations and uncertainties plays
a crucial role in planning and ... read more
Personalized generation in T2I diffusion models aims to naturally incorporate
individual user preferences into the generation process with minimal user
intervention. However, existing studies primarily rely on prompt-level modeling
with large-scale... read more
Open Vocabulary Human-Object Interaction (HOI) detection aims to detect
interactions between humans and objects while generalizing to novel interaction
classes beyond the training set. Current methods often rely on Vision and
Language Models (VLMs)... read more
Antiretroviral therapy (ART) has transformed HIV from a rapidly progressive and fatal disease to a chronic disease with limited impact on life expectancy. However, people living with HIV(PLWHs) faced high critical illness risk due to the increased pr... read more
Accurate preoperative prediction of erectile dysfunction (ED) is important
for counseling patients undergoing radical prostatectomy. While clinical
features are established predictors, the added value of preoperative MRI
remains underexplored. We i... read more
Cross-domain Few-shot Medical Image Segmentation (CD-FSMIS) is a potential
solution for segmenting medical images with limited annotation using knowledge
from other domains. The significant performance of current CD-FSMIS models
relies on the heavi... read more
Join thousands of healthcare professionals staying informed about the latest AI breakthroughs in medicine. Get curated insights delivered to your inbox.