Benchmarking - AI Medical Compendium

Multi-institutional Knowledge-Based (KB) plan prediction benchmark models for whole breast irradiation.

Physica medica : PM : an international journal devoted to the applications of physics to medicine and biology : official journal of the Italian Association of Biomedical Physics (AIFB) Jan 16, 2025

PURPOSE: To train and validate KB prediction models by merging a large multi-institutional cohort of whole breast irradiation (WBI) plans using tangential fields.

Radiotherapy Dosage Knowledge Bases Radiotherapy Planning, Computer-Assisted Humans Female Breast Benchmarking Breast Neoplasms

View on PubMed DOI

A benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohort.

Scientific reports Jan 11, 2025

Deep learning (DL) methods have demonstrated remarkable effectiveness in assisting with lung cancer risk prediction tasks using computed tomography (CT) scans. However, the lack of comprehensive comparison and validation of state-of-the-art (SOTA) mo...

Humans Tomography, X-Ray Computed Aged Male Middle Aged Female Risk Factors Benchmarking Cohort Studies Lung Neoplasms Early Detection of Cancer Risk Assessment Deep Learning

View on PubMed DOI

Success History Adaptive Competitive Swarm Optimizer with Linear Population Reduction: Performance benchmarking and application in eye disease detection.

Computers in biology and medicine Jan 2, 2025

Eye disease detection has achieved significant advancements thanks to artificial intelligence (AI) techniques. However, the construction of high-accuracy predictive models still faces challenges, and one reason is the deficiency of the optimizer. Thi...

Benchmarking Eye Diseases Humans Machine Learning Algorithms Artificial Intelligence

View on PubMed DOI

A New Benchmark: Clinical Uncertainty and Severity Aware Labeled Chest X-Ray Images With Multi-Relationship Graph Learning.

IEEE transactions on medical imaging Jan 2, 2025

Chest radiography, commonly known as CXR, is frequently utilized in clinical settings to detect cardiopulmonary conditions. However, even seasoned radiologists might offer different evaluations regarding the seriousness and uncertainty associated wit...

Humans Databases, Factual Lung Benchmarking Radiography, Thoracic Uncertainty Deep Learning Radiographic Image Interpretation, Computer-Assisted

View on PubMed DOI

Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction.

PLoS computational biology Dec 30, 2024

The 3D structure of RNA critically influences its functionality, and understanding this structure is vital for deciphering RNA biology. Experimental methods for determining RNA structures are labour-intensive, expensive, and time-consuming. Computati...

Sequence Alignment Computational Biology Deep Learning RNA Sequence Analysis, RNA Benchmarking Nucleic Acid Conformation

View on PubMed DOI

Benchmarking the performance of large language models in uveitis: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, Google Gemini, and Anthropic Claude3.

Eye (London, England) Dec 17, 2024

BACKGROUND/OBJECTIVE: This study aimed to evaluate the accuracy, comprehensiveness, and readability of responses generated by various Large Language Models (LLMs) (ChatGPT-3.5, Gemini, Claude 3, and GPT-4.0) in the clinical context of uveitis, utiliz...

Benchmarking Comprehension Large Language Models Generative Artificial Intelligence Uveitis Humans Language

View on PubMed DOI

Unmasking the chameleons: A benchmark for out-of-distribution detection in medical tabular data.

International journal of medical informatics Dec 17, 2024

BACKGROUND: Machine Learning (ML) models often struggle to generalize effectively to data that deviates from the training distribution. This raises significant concerns about the reliability of real-world healthcare systems encountering such inputs k...

Machine Learning Algorithms Benchmarking Humans

View on PubMed DOI

MedSegBench: A comprehensive benchmark for medical image segmentation in diverse data modalities.

Scientific data Nov 25, 2024

MedSegBench is a comprehensive benchmark designed to evaluate deep learning models for medical image segmentation across a wide range of modalities. It covers a wide range of modalities, including 35 datasets with over 60,000 images from ultrasound, ...

Image Processing, Computer-Assisted Magnetic Resonance Imaging Diagnostic Imaging Humans Benchmarking Algorithms Deep Learning Ultrasonography

View on PubMed DOI

MultiADE: A Multi-domain benchmark for Adverse Drug Event extraction.

Journal of biomedical informatics Nov 12, 2024

OBJECTIVE: Active adverse event surveillance monitors Adverse Drug Events (ADE) from different data sources, such as electronic health records, medical literature, social media and search engine logs. Over the years, many datasets have been created, ...

Data Mining Adverse Drug Reaction Reporting Systems Natural Language Processing Benchmarking Drug-Related Side Effects and Adverse Reactions Electronic Health Records Social Media Machine Learning Algorithms Databases, Factual Humans

View on PubMed DOI

A multi-species benchmark for training and validating mass spectrometry proteomics machine learning models.

Scientific data Nov 8, 2024

Training machine learning models for tasks such as de novo sequencing or spectral clustering requires large collections of confidently identified spectra. Here we describe a dataset of 2.8 million high-confidence peptide-spectrum matches derived from...

Machine Learning Animals Humans Mass Spectrometry Benchmarking Peptides Proteomics

View on PubMed DOI

AIMC Topic: Benchmarking

Multi-institutional Knowledge-Based (KB) plan prediction benchmark models for whole breast irradiation.

A benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohort.

Success History Adaptive Competitive Swarm Optimizer with Linear Population Reduction: Performance benchmarking and application in eye disease detection.

A New Benchmark: Clinical Uncertainty and Severity Aware Labeled Chest X-Ray Images With Multi-Relationship Graph Learning.

Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction.

Benchmarking the performance of large language models in uveitis: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, Google Gemini, and Anthropic Claude3.

Unmasking the chameleons: A benchmark for out-of-distribution detection in medical tabular data.

MedSegBench: A comprehensive benchmark for medical image segmentation in diverse data modalities.

MultiADE: A Multi-domain benchmark for Adverse Drug Event extraction.

A multi-species benchmark for training and validating mass spectrometry proteomics machine learning models.

Popular Topics

Recent Journals

AIMC Topic: Benchmarking

Don't Miss the Future of Medicine

Popular Topics

Recent Journals