Integrating Multiple Data Sources With Interactions in Multi-Omics Using Cooperative Learning.
Journal:
Statistics in medicine
Published Date:
Jun 1, 2025
Abstract
Modeling with multiomics data presents multiple challenges, such as the high dimensionality of the problem ( ), the presence of interactions between features, and the need for integration between multiple data sources. We establish an interaction model that allows for the inclusion of multiple sources of data from the integration of two existing methods, pliable lasso and cooperative learning. The integrated model is tested both on simulation studies and on real multiomics datasets for predicting labor onset and cancer treatment response. The results show that the model is effective in modeling multisource data in various scenarios where interactions are present, both in terms of prediction performance and selection of relevant variables.