Benchmarking - AI Medical Compendium

SynEL: A synthetic benchmark for entity linking.

PloS one Jan 8, 2026

Large language models (LLMs) offer significant potential for constructing commonsense knowledge graphs from text, demonstrating adaptability across diverse domains. However, their effectiveness varies significantly with domain-specific language, high...

Humans Natural Language Processing Benchmarking Data Mining Algorithms Language

View on PubMed DOI

A Standardized Benchmark for Machine-Learned Molecular Dynamics Using Weighted Ensemble Sampling.

The journal of physical chemistry. B Dec 4, 2025

The rapid evolution of molecular dynamics (MD) methods, including machine-learned dynamics, has outpaced the development of standardized tools for method validation. Objective comparison between simulation approaches is often hindered by inconsistent...

Protein Conformation Benchmarking Molecular Dynamics Simulation Machine Learning Proteins

View on PubMed DOI

Benchmarking retrieval-augmented large language models in biomedical NLP: Application, robustness, and self-awareness.

Science advances Nov 21, 2025

To reduce hallucinations in large language models (LLMs), retrieval-augmented LLMs (RALs) retrieve supporting knowledge from external databases. However, their performance on biomedical natural language processing (NLP) tasks remains underexplored. W...

Information Storage and Retrieval Large Language Models Natural Language Processing Benchmarking Humans Algorithms

View on PubMed DOI

Benchmarking Sequence-Based Compound-Protein Interaction Prediction through Constructing a Debiased Data Set CDPN.

Journal of chemical information and modeling Nov 20, 2025

Accurate prediction of compound-protein interactions (CPIs) is critical for drug discovery, but existing data sets often suffer from biases that hinder model generalization. Here, we first highlighted that over-represented molecular scaffolds and imb...

Drug Discovery Proteins Benchmarking Deep Learning Protein Binding

View on PubMed DOI

Benchmarking deep learning methods for biologically conserved single-cell integration.

Genome biology Nov 20, 2025

BACKGROUND: Advancements in single-cell RNA sequencing have enabled the analysis of millions of cells, but integrating such data across samples and methods while mitigating batch effects remains challenging. Deep learning approaches address this by l...

Benchmarking Single-Cell Analysis Sequence Analysis, RNA Humans Deep Learning

View on PubMed DOI

Meta simulation approach for evaluating machine learning method selection in data limited settings.

Scientific reports Nov 19, 2025

Selecting appropriate machine learning (ML) methods for domain-specific tasks remains a persistent challenge, particularly in medicine where datasets are often small, heterogeneous, and incomplete. Traditional benchmarking strategies rely on limited ...

Computer Simulation Benchmarking Machine Learning Algorithms Humans

View on PubMed DOI

DNALONGBENCH: a benchmark suite for long-range DNA prediction tasks.

Nature communications Nov 18, 2025

Modeling long-range DNA dependencies is crucial for understanding genome structure and function across diverse biological contexts. However, effectively capturing these dependencies, which may span millions of base pairs in tasks such as three-dimens...

Chromatin Humans Benchmarking Computational Biology Deep Learning Neural Networks, Computer DNA Quantitative Trait Loci Genomics Software Sequence Analysis, DNA

View on PubMed DOI

Benchmarking YOLOv8 to YOLOv13 for robust hand gesture recognition in human-robot interaction.

Scientific reports Nov 14, 2025

Real-time and accurate hand gesture detection is essential for safe and intuitive Human-Robot Interaction (HRI), enabling robots to interpret non-verbal cues and respond appropriately in dynamic environments. This research evaluates the effectiveness...

Hand Benchmarking Pattern Recognition, Automated Gestures Robotics Humans Algorithms

View on PubMed DOI

Benchmarking diffusion models against state-of-the-art architectures for OCT fluid biomarker segmentation.

PloS one Oct 29, 2025

OBJECTIVES: Retinal diseases, major causes of vision impairment and blindness, are assessed using optical coherence tomography (OCT) scans. Automated report generation for retinal OCT scans, powered by deep learning, can help standardize interpretati...

Humans Subretinal Fluid Tomography, Optical Coherence Retina Biomarkers Retinal Detachment Benchmarking Retinal Diseases Deep Learning

View on PubMed DOI

EasyGeSe - a resource for benchmarking genomic prediction methods.

BMC genomics Oct 24, 2025

BACKGROUND: Genomic prediction is a widely used method to predict phenotypes from genotypic data. Advances in both biological and computer science have enabled the generation of vast amounts of data and the development of new algorithms, specifically...

Phenotype Software Genomics Benchmarking Animals Algorithms Databases, Genetic

View on PubMed DOI

AIMC Topic: Benchmarking

SynEL: A synthetic benchmark for entity linking.

A Standardized Benchmark for Machine-Learned Molecular Dynamics Using Weighted Ensemble Sampling.

Benchmarking retrieval-augmented large language models in biomedical NLP: Application, robustness, and self-awareness.

Benchmarking Sequence-Based Compound-Protein Interaction Prediction through Constructing a Debiased Data Set CDPN.

Benchmarking deep learning methods for biologically conserved single-cell integration.

Meta simulation approach for evaluating machine learning method selection in data limited settings.

DNALONGBENCH: a benchmark suite for long-range DNA prediction tasks.

Benchmarking YOLOv8 to YOLOv13 for robust hand gesture recognition in human-robot interaction.

Benchmarking diffusion models against state-of-the-art architectures for OCT fluid biomarker segmentation.

EasyGeSe - a resource for benchmarking genomic prediction methods.

Popular Topics

Recent Journals

AIMC Topic: Benchmarking

Stay Ahead of Medical AI

Popular Topics

Recent Journals