AIMC Topic: Data Mining

Clear Filters Showing 1091 to 1100 of 1626 articles

Budget constrained non-monotonic feature selection.

Neural networks : the official journal of the International Neural Network Society
Feature selection is an important problem in machine learning and data mining. We consider the problem of selecting features under the budget constraint on the feature subset size. Traditional feature selection methods suffer from the "monotonic" pro...

Coronary artery disease risk assessment from unstructured electronic health records using text mining.

Journal of biomedical informatics
Coronary artery disease (CAD) often leads to myocardial infarction, which may be fatal. Risk factors can be used to predict CAD, which may subsequently lead to prevention or early intervention. Patient data such as co-morbidities, medication history,...

Annotating longitudinal clinical narratives for de-identification: The 2014 i2b2/UTHealth corpus.

Journal of biomedical informatics
The 2014 i2b2/UTHealth natural language processing shared task featured a track focused on the de-identification of longitudinal medical records. For this track, we de-identified a set of 1304 longitudinal medical records describing 296 patients. Thi...

Adapting existing natural language processing resources for cardiovascular risk factors identification in clinical notes.

Journal of biomedical informatics
The 2014 i2b2 natural language processing shared task focused on identifying cardiovascular risk factors such as high blood pressure, high cholesterol levels, obesity and smoking status among other factors found in health records of diabetic patients...

CRFs based de-identification of medical records.

Journal of biomedical informatics
De-identification is a shared task of the 2014 i2b2/UTHealth challenge. The purpose of this task is to remove protected health information (PHI) from medical records. In this paper, we propose a novel de-identifier, WI-deId, based on conditional rand...

Mining heart disease risk factors in clinical text with named entity recognition and distributional semantic models.

Journal of biomedical informatics
We present the design, and analyze the performance of a multi-stage natural language processing system employing named entity recognition, Bayesian statistics, and rule logic to identify and characterize heart disease risk factor events in diabetic p...

Classification of Medical Datasets Using SVMs with Hybrid Evolutionary Algorithms Based on Endocrine-Based Particle Swarm Optimization and Artificial Bee Colony Algorithms.

Journal of medical systems
The classification and analysis of data is an important issue in today's research. Selecting a suitable set of features makes it possible to classify an enormous quantity of data quickly and efficiently. Feature selection is generally viewed as a pro...

Risk factor detection for heart disease by applying text analytics in electronic medical records.

Journal of biomedical informatics
In the United States, about 600,000 people die of heart disease every year. The annual cost of care services, medications, and lost productivity reportedly exceeds 108.9 billion dollars. Effective disease risk assessment is critical to prevention, ca...

jEcho: an Evolved weight vector to CHaracterize the protein's posttranslational modification mOtifs.

Interdisciplinary sciences, computational life sciences
Protein's posttranslational modification (PTM) represents a major dynamic regulation of protein functions after the translation of polypeptide chains from mRNA molecule. Compared with the costly and labor-intensive wet laboratory characterization of ...