Innovative data techniques for centrifugal pump optimization with machine learning and AI model.
Journal:
PloS one
Published Date:
Jun 10, 2025
Abstract
In modern centrifugal pump machines (CPM), a data acquisition system encompassing software- hardware interfacing is essential for parameter recording. The quality of recorded data plays a crucial role and directly influences the data transformation phase in machine learning (ML) and deep learning (DL) models. The Dewesoft FFT DAQ system is designed to extract the high-quality data from the CPM based on sensor fusion technology. The data recorded from DAQ system undergoes thorough in-depth analysis, processing & transformation before being incorporated into machine learning (ML) or artificial intelligence models. This paper emphasizes the importance of data cleaning, pre-processing, and applying appropriate methodologies to transform raw data into a valuable resource that can be utilized by ML and AI models. Key techniques include Exploratory Data Analysis (EDA), Data Visualization, and Feature Engineering (FE), which collectively enhance data interpretability. Following these transformations, hypothesis testing validates the data's integrity, ensuring reliability for subsequent modeling. The validated data is employed to train machine learning classifiers and deep learning algorithms, targeting a 27.25% enhancement in operational efficiency based on F1 score. Additionally, it decreases model training time by 180 seconds, facilitating predictive maintenance of critical performance metrics and minimizing downtime. The assessment of model performance relies on Precision, Recall, and F1 score. This approach leverages recent advancements in data science to derive actionable insights from CPM data, facilitating more informed decision-making and optimization of pump operations.