AIMC Topic: Data Mining

Clear Filters Showing 1151 to 1160 of 1626 articles

Incorporating linguistic knowledge for learning distributed word representations.

PloS one
Combined with neural language models, distributed word representations achieve significant advantages in computational linguistics and text mining. Most existing models estimate distributed word vectors from large-scale data in an unsupervised fashio...

Multi-focus cluster labeling.

Journal of biomedical informatics
Document collections resulting from searches in the biomedical literature, for instance, in PubMed, are often so large that some organization of the returned information is necessary. Clustering is an efficient tool for organizing search results. To ...

Subgraph augmented non-negative tensor factorization (SANTF) for modeling clinical narrative text.

Journal of the American Medical Informatics Association : JAMIA
OBJECTIVE: Extracting medical knowledge from electronic medical records requires automated approaches to combat scalability limitations and selection biases. However, existing machine learning approaches are often regarded by clinicians as black boxe...

Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature.

Database : the journal of biological databases and curation
Ambiguous gene names in the biomedical literature are a barrier to accurate information extraction. To overcome this hurdle, we generated Ontology Fingerprints for selected genes that are relevant for personalized cancer therapy. These Ontology Finge...

Generating a focused view of disease ontology cancer terms for pan-cancer data integration and analysis.

Database : the journal of biological databases and curation
Bio-ontologies provide terminologies for the scientific community to describe biomedical entities in a standardized manner. There are multiple initiatives that are developing biomedical terminologies for the purpose of providing better annotation, da...

A semi-supervised learning framework for biomedical event extraction based on hidden topics.

Artificial intelligence in medicine
OBJECTIVES: Scientists have devoted decades of efforts to understanding the interaction between proteins or RNA production. The information might empower the current knowledge on drug reactions or the development of certain diseases. Nevertheless, du...

Automatic negation detection in narrative pathology reports.

Artificial intelligence in medicine
OBJECTIVE: To detect negations of medical entities in free-text pathology reports with different approaches, and evaluate their performances.

A linear-RBF multikernel SVM to classify big text corpora.

BioMed research international
Support vector machine (SVM) is a powerful technique for classification. However, SVM is not suitable for classification of large datasets or text corpora, because the training complexity of SVMs is highly dependent on the input size. Recent developm...

The application of data mining techniques to oral cancer prognosis.

Journal of medical systems
This study adopted an integrated procedure that combines the clustering and classification features of data mining technology to determine the differences between the symptoms shown in past cases where patients died from or survived oral cancer. Two ...