AIMC Topic: Data Mining

Clear Filters Showing 571 to 580 of 1567 articles

A Lightweight API-Based Approach for Building Flexible Clinical NLP Systems.

Journal of healthcare engineering
Natural language processing (NLP) has become essential for secondary use of clinical data. Over the last two decades, many clinical NLP systems were developed in both academia and industry. However, nearly all existing systems are restricted to speci...

Clustering-based undersampling with random over sampling examples and support vector machine for imbalanced classification of breast cancer diagnosis.

Computer assisted surgery (Abingdon, England)
To overcome the two-class imbalanced classification problem existing in the diagnosis of breast cancer, a hybrid of Random Over Sampling Example, K-means and Support vector machine (RK-SVM) model is proposed which is based on sample selection. Random...

Machine learning and big data: Implications for disease modeling and therapeutic discovery in psychiatry.

Artificial intelligence in medicine
INTRODUCTION: Machine learning capability holds promise to inform disease models, the discovery and development of novel disease modifying therapeutics and prevention strategies in psychiatry. Herein, we provide an introduction on how machine learnin...

Combining a gravitational search algorithm, particle swarm optimization, and fuzzy rules to improve the classification performance of a feed-forward neural network.

Computer methods and programs in biomedicine
BACKGROUND AND OBJECTIVE: A feed-forward neural network (FNN) is a type of artificial neural network that has been widely used in medical diagnosis, data mining, stock market analysis, and other fields. Many studies have used FNN to develop medical d...

Cross-registry neural domain adaptation to extract mutational test results from pathology reports.

Journal of biomedical informatics
OBJECTIVE: We study the performance of machine learning (ML) methods, including neural networks (NNs), to extract mutational test results from pathology reports collected by cancer registries. Given the lack of hand-labeled datasets for mutational te...

Hierarchical Hidden Markov models enable accurate and diverse detection of antimicrobial resistance sequences.

Communications biology
The characterization of antimicrobial resistance genes from high-throughput sequencing data has become foundational in public health research and regulation. This requires mapping sequence reads to databases of known antimicrobial resistance genes to...

Adaptive robust principal component analysis.

Neural networks : the official journal of the International Neural Network Society
Robust Principal Component Analysis (RPCA) is a powerful tool in machine learning and data mining problems. However, in many real-world applications, RPCA is unable to well encode the intrinsic geometric structure of data, thereby failing to obtain t...

Machine learning and data mining frameworks for predicting drug response in cancer: An overview and a novel in silico screening process based on association rule mining.

Pharmacology & therapeutics
A major challenge in cancer treatment is predicting the clinical response to anti-cancer drugs on a personalized basis. The success of such a task largely depends on the ability to develop computational resources that integrate big "omic" data into e...

Using distant supervision to augment manually annotated data for relation extraction.

PloS one
Significant progress has been made in applying deep learning on natural language processing tasks recently. However, deep learning models typically require a large amount of annotated training data while often only small labeled datasets are availabl...