AIMC Topic: Cheminformatics

Clear Filters Showing 1 to 10 of 76 articles

HiRXN: Hierarchical Attention-Based Representation Learning for Chemical Reaction.

Journal of chemical information and modeling
In recent years, natural language processing (NLP) techniques, including large language modeling (LLM), have contributed significantly to advancements in organic chemistry research. Chemical reaction representations provide a link between NLP models ...

Molecular tweaking by generative cheminformatics and ligand-protein structures for rational drug discovery.

Bioorganic chemistry
The purpose of this review is two-fold: (1) to summarize artificial intelligence and machine learning approaches and document the role of ligand-protein structures in directing drug discovery; (2) to present examples of drugs from the recent literatu...

Chemoinformatics for corrosion science: Data-driven modeling of corrosion inhibition by organic molecules.

Molecular informatics
This paper reviews the application of machine learning to the inhibition of corrosion by organic molecules. The methodologies considered include quantitative structure-property relationships (QSPR) and related data-driven approaches. The characterist...

Deciphering Molecular Embeddings with Centered Kernel Alignment.

Journal of chemical information and modeling
Analyzing machine learning models, especially nonlinear ones, poses significant challenges. In this context, centered kernel alignment (CKA) has emerged as a promising model analysis tool that assesses the similarity between two embeddings. CKA's eff...

Research Progresses and Applications of Knowledge Graph Embedding Technique in Chemistry.

Journal of chemical information and modeling
A knowledge graph (KG) is a technique for modeling entities and their interrelations. Knowledge graph embedding (KGE) translates these entities and relationships into a continuous vector space to facilitate dense and efficient representations. In the...

MolPipeline: A Python Package for Processing Molecules with RDKit in Scikit-learn.

Journal of chemical information and modeling
The open-source package scikit-learn provides various machine learning algorithms and data processing tools, including the Pipeline class, which allows users to prepend custom data transformation steps to the machine learning model. We introduce the ...

Molecular similarity in chemical informatics and predictive toxicity modeling: from quantitative read-across (q-RA) to quantitative read-across structure-activity relationship (q-RASAR) with the application of machine learning.

Critical reviews in toxicology
This article aims to provide a comprehensive critical, yet readable, review of general interest to the chemistry community on molecular similarity as applied to chemical informatics and predictive modeling with a special focus on read-across (RA) and...

Cheminformatics analysis of indoleamine and tryptophan 2,3-dioxygenase inhibitors: A descriptor and fingerprint based machine learning approach to disclose selectivity measures.

Computers in biology and medicine
Indoleamine 2,3-dioxygenase (IDO) and tryptophan 2,3-dioxygenase (TDO) are attractive drug targets for cancer immunotherapy. After disappointing results of the epacadostat as a selective IDO inhibitor in phase III clinical trials, there is much inter...

Versatile Deep Learning Pipeline for Transferable Chemical Data Extraction.

Journal of chemical information and modeling
Chemical information disseminated in scientific documents offers an untapped potential for deep learning-assisted insights and breakthroughs. Automated extraction efforts have shifted from resource-intensive manual extraction toward applying machine ...

OpenChemIE: An Information Extraction Toolkit for Chemistry Literature.

Journal of chemical information and modeling
Information extraction from chemistry literature is vital for constructing up-to-date reaction databases for data-driven chemistry. Complete extraction requires combining information across text, tables, and figures, whereas prior work has mainly inv...