Applying machine learning to classify table olives using bacterial metataxonomic data.
Journal:
NPJ science of food
Published Date:
Jul 4, 2025
Abstract
In recent years, metataxonomic analysis has been increasingly used to characterize microbial communities in fermented foods. Moreover, advances in bioinformatics and machine learning (ML) have expanded resources for analyzing these metataxonomic data. Particularly tree-based algorithms are valuable for their interpretability. This work compares the use of three tree-based ML algorithms-Classification and Regression Tree, Random Forest (RF), and Extreme Gradient Boosting- for the analysis of a database composed of 442 samples of 16S rRNA bacterial profiles obtained from table olives. Our findings show that ML techniques can effectively classify bacterial profiles based on olive processing type, cultivar, country of origin, and isolation matrix. The RF model achieved the highest accuracy, reaching 97% in the best cases, with a kappa coefficient above 0.8 for most categories. This approach holds potential applications in the table olive sector and in other food products, where the industrial application of ML techniques could enhance traceability, authenticity, and quality control.
Authors
Keywords
No keywords available for this article.