AIMC Topic: Sample Size

Clear Filters Showing 1 to 10 of 89 articles

Effect of training sample size, image resolution and epochs on filamentous and floc-forming bacteria classification using machine learning.

Journal of environmental management
Computer vision techniques can expedite the detection of bacterial growth in wastewater treatment plants and alleviate some of the shortcomings associated with traditional detection methods. In recent years, researchers capitalized on this potential ...

Larger sample sizes are needed when developing a clinical prediction model using machine learning in oncology: methodological systematic review.

Journal of clinical epidemiology
BACKGROUND AND OBJECTIVES: Having a sufficient sample size is crucial when developing a clinical prediction model. We reviewed details of sample size in studies developing prediction models for binary outcomes using machine learning (ML) methods with...

Area under the ROC Curve has the most consistent evaluation for binary classification.

PloS one
The proper use of model evaluation metrics is important for model evaluation and model selection in binary classification tasks. This study investigates how consistent different metrics are at evaluating models across data of different prevalence whi...

Addressing statistical challenges in the analysis of proteomics data with extremely small sample size: a simulation study.

BMC genomics
BACKGROUND: One of the most promising approaches for early and more precise disease prediction and diagnosis is through the inclusion of proteomics data augmented with clinical data. Clinical proteomics data is often characterized by its high dimensi...

Applying support vector machines to a diagnostic classification model for polytomous attributes in small-sample contexts.

The British journal of mathematical and statistical psychology
Over several years, the evaluation of polytomous attributes in small-sample settings has posed a challenge to the application of cognitive diagnosis models. To enhance classification precision, the support vector machine (SVM) was introduced for esti...

Detecting differentially expressed genes from RNA-seq data using fuzzy clustering.

The international journal of biostatistics
A two-group comparison test is generally performed on RNA sequencing data to detect differentially expressed genes (DEGs). However, the accuracy of this method is low due to the small sample size. To address this, we propose a method using fuzzy clus...

Biobjective gradient descent for feature selection on high dimension, low sample size data.

PloS one
Even though deep learning shows impressive results in several applications, its use on problems with High Dimensions and Low Sample Size, such as diagnosing rare diseases, leads to overfitting. One solution often proposed is feature selection. In dee...

Automatic categorization of self-acknowledged limitations in randomized controlled trial publications.

Journal of biomedical informatics
OBJECTIVE: Acknowledging study limitations in a scientific publication is a crucial element in scientific transparency and progress. However, limitation reporting is often inadequate. Natural language processing (NLP) methods could support automated ...

Enhancing the fairness of AI prediction models by Quasi-Pareto improvement among heterogeneous thyroid nodule population.

Nature communications
Artificial Intelligence (AI) models for medical diagnosis often face challenges of generalizability and fairness. We highlighted the algorithmic unfairness in a large thyroid ultrasound dataset with significant diagnostic performance disparities acro...

Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Estimating Sample Size and Reducing Overfitting.

Journal of speech, language, and hearing research : JSLHR
PURPOSE: Many studies using machine learning (ML) in speech, language, and hearing sciences rely upon cross-validations with single data splitting. This study's first purpose is to provide quantitative evidence that would incentivize researchers to i...