AIMC Topic: Data Mining

Clear Filters Showing 401 to 410 of 1566 articles

Multimodal Learning of Social Image Representation by Exploiting Social Relations.

IEEE transactions on cybernetics
Learning the representation for social images has recently made remarkable achievements for many tasks, such as cross-modal retrieval and multilabel classification. However, since social images contain both multimodal contents (e.g., visual images an...

Systematic auditing is essential to debiasing machine learning in biology.

Communications biology
Biases in data used to train machine learning (ML) models can inflate their prediction performance and confound our understanding of how and what they learn. Although biases are common in biological data, systematic auditing of ML models to identify ...

Computational approach for identification, characterization, three-dimensional structure modelling and machine learning-based thermostability prediction of xylanases from the genome of Aspergillus fumigatus.

Computational biology and chemistry
Identification of thermostable and alkaline xylanases from different fungal and bacterial species have gained an interest for the researchers because of its biotechnological relevance in many industries, such as pulp, paper, and bioethanol. In this s...

Task Similarity Estimation Through Adversarial Multitask Neural Network.

IEEE transactions on neural networks and learning systems
Multitask learning (MTL) aims at solving the related tasks simultaneously by exploiting shared knowledge to improve performance on individual tasks. Though numerous empirical results supported the notion that such shared knowledge among tasks plays a...

GrantExtractor: Accurate Grant Support Information Extraction from Biomedical Fulltext Based on Bi-LSTM-CRF.

IEEE/ACM transactions on computational biology and bioinformatics
Grant support (GS) in the MEDLINE database refers to funding agencies and contract numbers. It is important for funding organizations to track their funding outcomes from the GS information. As such, how to accurately and automatically extract fundin...

The Design and Development of a Personalized Leisure Time Physical Activity Application Based on Behavior Change Theories, End-User Perceptions, and Principles From Empirical Data Mining.

Frontiers in public health
Many adults do not reach the recommended physical activity (PA) guidelines, which can lead to serious health problems. A promising method to increase PA is the use of smartphone PA applications. However, despite the development and evaluation of mul...

Artificial intelligence in sepsis early prediction and diagnosis using unstructured data in healthcare.

Nature communications
Sepsis is a leading cause of death in hospitals. Early prediction and diagnosis of sepsis, which is critical in reducing mortality, is challenging as many of its signs and symptoms are similar to other less critical conditions. We develop an artifici...

AllergyMap: An Open Source Corpus of Allergy Mention Normalizations.

AMIA ... Annual Symposium proceedings. AMIA Symposium
Allergy mention normalization is challenging because of the wide range of possible allergens including medications, foods, plants, animals, and consumer products. This paper describes the process of mapping free-text allergy information from an elect...

EffiCare: Better Prognostic Models via Resource-Efficient Health Embeddings.

AMIA ... Annual Symposium proceedings. AMIA Symposium
Recent medical prognostic models adapted from high data-resource fields like language processing have quickly grown in complexity and size. However, since medical data typically constitute low data-resource settings, performances on tasks like clinic...