Benchmarking - AI Medical Compendium

Germline-aware deep learning models and benchmarks for predicting antibody VH-VL pairing.

mAbs Oct 17, 2025

Variable heavy (VH) and variable light (VL) chain pairing is a critical determinant of antibody diversity, stability, and antigen-binding specificity. Identifying productive VH - VL combinations experimentally is labor-intensive and costly, motivatin...

Humans Deep Learning Immunoglobulin Light Chains Immunoglobulin Variable Region Immunoglobulin Heavy Chains Computational Biology Benchmarking

View on PubMed DOI

A comprehensive benchmark of single-cell Hi-C embedding tools.

Nature communications Oct 14, 2025

Embedding is the key step in single-cell Hi-C (scHi-C) analysis which relies on capturing biological meaningful heterogeneity at various levels of genome architecture. To understand the strength and limitations of existing tools in various applicatio...

Benchmarking Neural Networks, Computer Software Mice Deep Learning Computational Biology Single-Cell Analysis Animals Humans

View on PubMed DOI

LLM ethics benchmark: a three-dimensional assessment system for evaluating moral reasoning in large language models.

Scientific reports Oct 5, 2025

This study establishes a novel framework for systematically evaluating the moral reasoning capabilities of large language models (LLMs) as they increasingly integrate into critical societal domains. Current assessment methodologies lack the precision...

Humans Artificial Intelligence Language Morals Large Language Models Decision Making Benchmarking

View on PubMed DOI

Benchmarking Machine Learning Models for HIV-1 Protease Inhibitor Resistance Prediction: Impact of Data Set Construction and Feature Representation.

Journal of chemical information and modeling Sep 25, 2025

The rapid emergence of drug resistance in viral infections represents a significant global health challenge, threatening the efficacy of treatments for multiple diseases. Machine learning models have emerged as valuable tools for predicting antiviral...

Benchmarking HIV-1 Drug Resistance, Viral Neural Networks, Computer Machine Learning HIV Protease HIV Protease Inhibitors Humans

View on PubMed DOI

A comprehensive benchmarking of adaptive sampling tools for nanopore sequencing.

Genome biology Sep 17, 2025

BACKGROUND: Adaptive sampling is an emerging technology to enrich target reads while depleting unwanted reads during real-time nanopore sequencing. The application of different algorithms has spawned various tools for the determination of read reject...

Sequence Analysis, DNA Benchmarking Nanopore Sequencing Saccharomyces cerevisiae Algorithms Deep Learning Humans Software

View on PubMed DOI

Improved American College of Surgeons NSQIP Hospital Benchmarking with Risk Adjustment for Many CPT Codes Rather Than Just the Principal Code.

Journal of the American College of Surgeons Sep 16, 2025

BACKGROUND: Because of technical limitations inherent to logistic regression, NSQIP benchmarking has historically risk adjusted for procedure using only 1 principal CPT code among other predictors. This has the potential to create bias (favorable or ...

Humans Benchmarking Quality Improvement Hospitals Female Logistic Models Surgical Procedures, Operative Risk Adjustment Male United States Current Procedural Terminology

View on PubMed DOI

Precision in Predicting Protein-Nucleic Acid Complexes: Establishing a Benchmark Data Set and Comparative Metrics.

Journal of chemical information and modeling Sep 11, 2025

Protein-nucleic acid interactions are fundamental to biological processes and biotechnology, yet their computational prediction lags behind protein structure or protein-protein interaction modeling. This study introduces ProNASet, a benchmark data se...

Protein Binding Proteins Nucleic Acids Benchmarking Molecular Docking Simulation Deep Learning

View on PubMed DOI

The imitation game: large language models versus multidisciplinary tumor boards: benchmarking AI against 21 sarcoma centers from the ring trial.

Journal of cancer research and clinical oncology Sep 10, 2025

PURPOSE: The study aims to compare the treatment recommendations generated by four leading large language models (LLMs) with those from 21 sarcoma centers' multidisciplinary tumor boards (MTBs) of the sarcoma ring trial in managing complex soft tissu...

Large Language Models Cancer Care Facilities Language Humans Benchmarking Sarcoma

View on PubMed DOI

Benchmarking feature projection methods in radiomics.

Scientific reports Sep 5, 2025

In radiomics, feature selection methods are primarily used to eliminate redundant features and identify relevant ones. Feature projection methods, such as principal component analysis (PCA), are often avoided due to concerns that recombining features...

Image Processing, Computer-Assisted Benchmarking Algorithms Radiomics Humans Tomography, X-Ray Computed Magnetic Resonance Imaging ROC Curve Principal Component Analysis

View on PubMed DOI

GATmath and GATLc: Comprehensive benchmarks for evaluating Arabic large language models.

PloS one Sep 2, 2025

The evolution of Large Language Models (LLMs) has significantly advanced artificial intelligence, driving innovation across various applications. Their continued development relies on a deep understanding of their capabilities and limitations. This i...

Semantics Language Humans Large Language Models Benchmarking Artificial Intelligence

View on PubMed DOI

AIMC Topic: Benchmarking

Germline-aware deep learning models and benchmarks for predicting antibody VH-VL pairing.

A comprehensive benchmark of single-cell Hi-C embedding tools.

LLM ethics benchmark: a three-dimensional assessment system for evaluating moral reasoning in large language models.

Benchmarking Machine Learning Models for HIV-1 Protease Inhibitor Resistance Prediction: Impact of Data Set Construction and Feature Representation.

A comprehensive benchmarking of adaptive sampling tools for nanopore sequencing.

Improved American College of Surgeons NSQIP Hospital Benchmarking with Risk Adjustment for Many CPT Codes Rather Than Just the Principal Code.

Precision in Predicting Protein-Nucleic Acid Complexes: Establishing a Benchmark Data Set and Comparative Metrics.

The imitation game: large language models versus multidisciplinary tumor boards: benchmarking AI against 21 sarcoma centers from the ring trial.

Benchmarking feature projection methods in radiomics.

GATmath and GATLc: Comprehensive benchmarks for evaluating Arabic large language models.

Popular Topics

Recent Journals

AIMC Topic: Benchmarking

Don't Miss the Future of Medicine

Popular Topics

Recent Journals