AIMC Topic: Benchmarking

Clear Filters Showing 51 to 60 of 490 articles

Multi-institutional Knowledge-Based (KB) plan prediction benchmark models for whole breast irradiation.

Physica medica : PM : an international journal devoted to the applications of physics to medicine and biology : official journal of the Italian Association of Biomedical Physics (AIFB)
PURPOSE: To train and validate KB prediction models by merging a large multi-institutional cohort of whole breast irradiation (WBI) plans using tangential fields.

A benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohort.

Scientific reports
Deep learning (DL) methods have demonstrated remarkable effectiveness in assisting with lung cancer risk prediction tasks using computed tomography (CT) scans. However, the lack of comprehensive comparison and validation of state-of-the-art (SOTA) mo...

Success History Adaptive Competitive Swarm Optimizer with Linear Population Reduction: Performance benchmarking and application in eye disease detection.

Computers in biology and medicine
Eye disease detection has achieved significant advancements thanks to artificial intelligence (AI) techniques. However, the construction of high-accuracy predictive models still faces challenges, and one reason is the deficiency of the optimizer. Thi...

A New Benchmark: Clinical Uncertainty and Severity Aware Labeled Chest X-Ray Images With Multi-Relationship Graph Learning.

IEEE transactions on medical imaging
Chest radiography, commonly known as CXR, is frequently utilized in clinical settings to detect cardiopulmonary conditions. However, even seasoned radiologists might offer different evaluations regarding the seriousness and uncertainty associated wit...

Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction.

PLoS computational biology
The 3D structure of RNA critically influences its functionality, and understanding this structure is vital for deciphering RNA biology. Experimental methods for determining RNA structures are labour-intensive, expensive, and time-consuming. Computati...

Benchmarking the performance of large language models in uveitis: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, Google Gemini, and Anthropic Claude3.

Eye (London, England)
BACKGROUND/OBJECTIVE: This study aimed to evaluate the accuracy, comprehensiveness, and readability of responses generated by various Large Language Models (LLMs) (ChatGPT-3.5, Gemini, Claude 3, and GPT-4.0) in the clinical context of uveitis, utiliz...

Unmasking the chameleons: A benchmark for out-of-distribution detection in medical tabular data.

International journal of medical informatics
BACKGROUND: Machine Learning (ML) models often struggle to generalize effectively to data that deviates from the training distribution. This raises significant concerns about the reliability of real-world healthcare systems encountering such inputs k...

MedSegBench: A comprehensive benchmark for medical image segmentation in diverse data modalities.

Scientific data
MedSegBench is a comprehensive benchmark designed to evaluate deep learning models for medical image segmentation across a wide range of modalities. It covers a wide range of modalities, including 35 datasets with over 60,000 images from ultrasound, ...

MultiADE: A Multi-domain benchmark for Adverse Drug Event extraction.

Journal of biomedical informatics
OBJECTIVE: Active adverse event surveillance monitors Adverse Drug Events (ADE) from different data sources, such as electronic health records, medical literature, social media and search engine logs. Over the years, many datasets have been created, ...

A multi-species benchmark for training and validating mass spectrometry proteomics machine learning models.

Scientific data
Training machine learning models for tasks such as de novo sequencing or spectral clustering requires large collections of confidently identified spectra. Here we describe a dataset of 2.8 million high-confidence peptide-spectrum matches derived from...