MolLM: a unified language model for integrating biomedical text with 2D and 3D molecular representations.
Journal:
Bioinformatics (Oxford, England)
Published Date:
Jun 28, 2024
Abstract
MOTIVATION: The current paradigm of deep learning models for the joint representation of molecules and text primarily relies on 1D or 2D molecular formats, neglecting significant 3D structural information that offers valuable physical insight. This narrow focus inhibits the models' versatility and adaptability across a wide range of modalities. Conversely, the limited research focusing on explicit 3D representation tends to overlook textual data within the biomedical domain.