Latest AI and machine learning research in covid-19 for healthcare professionals.
Face editing modifies the appearance of face, which plays a key role in customization and enhancem...
Segment Anything Models (SAMs), as vision foundation models, have demonstrated remarkable performa...
Referring Video Object Segmentation (RVOS) aims to segment target objects throughout a video based...
MOTIVATION: Predicting the structure of antibody-antigen complexes is a challenging task with signif...
With the advance of high-throughput genotyping and sequencing technologies, it becomes feasible to...
Zero-shot depth estimation (DE) models exhibit strong generalization performance as they are train...
We introduce LOCORE, Long-Context Re-ranker, a model that takes as input local descriptors corresp...
In this paper, we introduce zero-shot audio-video editing, a novel task that requires transforming...
Accurate segmentation of nodules in both 2D breast ultrasound (BUS) and 3D automated breast ultras...
The level set estimation problem seeks to identify regions within a set of candidate points where ...
Sora has unveiled the immense potential of the Diffusion Transformer (DiT) architecture in single-...
Segmentation is a fundamental task in computer vision, with prompt-driven methods gaining prominen...
The acquisition of annotated datasets with paired images and segmentation masks is a critical chal...
We present a target-aware video diffusion model that generates videos from an input image in which...
High-quality test datasets are crucial for assessing the reliability of Deep Neural Networks (DNNs...
Learning-based point cloud compression methods have made significant progress in terms of performa...
Recent advances in diffusion models bring new vitality to visual content creation. However, curren...
High-resolution semantic segmentation is essential for applications such as image editing, bokeh i...
Image geolocalization is a fundamental yet challenging task, aiming at inferring the geolocation o...
Point clouds, which directly record the geometry and attributes of scenes or objects by a large nu...
Recent 3D face editing methods using masks have produced high-quality edited images by leveraging ...