Machine learning-driven bioavailability prediction in early-stage drug development: a KNIME-based computational workflow for digital health applications.

Journal: Xenobiotica; the fate of foreign compounds in biological systems

Published Date: May 28, 2025

Abstract

Bioavailability prediction remains a significant challenge in early-stage drug development, where conventional experimental approaches are time-consuming and resource-intensive. This study explores the application of machine learning techniques to enhance the efficiency of bioavailability prediction. By leveraging computational workflows within the KNIME Analytics Platform, we aim to automate bioavailability assessment and reduce dependence on costly and studies. A dataset comprising 475 drug-like compounds characterised by key molecular descriptors was analysed using multiple machine learning models, including Random Forest, Gradient Boosting, Decision Trees, k-Nearest Neighbours, and neural networks. Model performance was assessed through 5-fold cross-validation, with ensemble models outperforming linear and neural network-based approaches. Random Forest demonstrated the highest predictive performance ( = 0.87, RMSE = 0.08). Feature importance analysis identified topological polar surface area and solubility as the most influential factors in bioavailability prediction. The findings underscore the potential of integrating open-source tools and machine learning methodologies in pharmaceutical research, improving workflow efficiency while adhering to FAIR (Findable, Accessible, Interoperable, and Reusable) data principles. This approach facilitates rapid and cost-effective bioavailability assessment, supporting AI-driven predictive modelling and digital health applications in drug development.

Authors

Majdi Hammami

Laboratory of Medicinal and Aromatic Plants, Biotechnology Center of Borj-Cedria, Hammam-Lif, Tunisia.
Walid Yeddes

Laboratory of Aromatic and Medicinal Plants, Center of Biotechnology of Borj-Cédria, BP-901, 2050 Hammam-Lif, Tunisia.
Hamza Gadhoumi

Faculty of Sciences of Tunis, University of Tunis El Manar, El Manar, Tunis 2092, Tunis, Tunisia.
Raghda Yazidi

Laboratory of Medicinal and Aromatic Plants, Biotechnology Center of Borj-Cedria, Hammam-Lif, Tunisia.
Moufida Saidani Tounsi

Laboratory of Aromatic and Medicinal Plants, Center of Biotechnology of Borj-Cédria, BP-901, 2050 Hammam-Lif, Tunisia.
Kamel Msaada

Laboratory of Medicinal and Aromatic Plants, Biotechnology Center of Borj-Cedria, Hammam-Lif, Tunisia.

Keywords

No keywords available for this article.

External Resources

View on PubMed Access via DOI PubMed (40391875)

Machine learning-driven bioavailability prediction in early-stage drug development: a KNIME-based computational workflow for digital health applications.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Machine learning-driven bioavailability prediction in early-stage drug development: a KNIME-based computational workflow for digital health applications.

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals