Machine learning reveals novel compound for the improved production of chitooligosaccharides in Escherichia coli.
Journal:
New biotechnology
PMID:
39827984
Abstract
In order to improve predictability of outcome and reduce costly rounds of trial-and-error, machine learning models have been of increasing importance in the field of synthetic biology. Besides applications in predicting genome annotation, process parameters and transcription initiation frequency, such models have also been of help for pathway optimization. The latter is a common strategy in metabolic engineering and improves the production of a desirable compound by optimizing enzyme expression levels of the production pathway. However, engineering steps might not lead to sufficient improvement, and bottlenecks may remain hidden among the hundreds of metabolic reactions occurring in a living cell, especially if the production pathway is highly interconnected with other parts of the cell's metabolism. Here, we use the synthesis of chitooligosaccharides (COS) to show that the production from such complex pathways can be improved by using machine learning models and feature importance analysis to find new compounds with an impact on COS production. We screened Escherichia coli libraries of engineered transcription regulators with an expected broad range of metabolic diversity and trained several machine learning models to predict COS production titers. Subsequent feature analysis led to the finding of iron, whose addition we could show improved COS production in vivo up to two-fold. Additionally, the analysis revealed important clues for future engineering steps.