Predictive modeling of hemoglobin refractive index using Gaussian process regression with interpretability through partial dependence plots.

Journal: PloS one
Published Date:

Abstract

Accurately predicting the refractive index of hemoglobin across various wavelengths and concentrations is critical for advancing optical diagnostic techniques in biological and clinical applications. This study introduces a predictive model based on Gaussian Process Regression (GPR) to estimate the refractive index of hemoglobin in both oxygenated and deoxygenated states, covering wavelengths from 400 to 700 nm and concentrations ranging from 0 to 140 g/L. The GPR model effectively captures non-linear relationships, achieving high prediction accuracy with R2 values of 99.4% for the training dataset and 99.3% for the testing dataset. An independent external dataset was used to validate the model's robustness further, yielding an R2 value of 92.80%, RMSE of 0.0042, and MSE of 1.77 × 10 ⁻ ⁵, demonstrating the model's strong generalizability. To enhance interpretability, Partial Dependence Plots (PDPs) were employed to visualize the influence of wavelength and concentration on refractive index predictions, offering clear insights into hemoglobin's optical behavior. The model's ability to provide accurate and interpretable predictions has significant implications for improving the reliability of biophotonic diagnostic tools, such as optical coherence tomography and reflectance spectroscopy, in clinical settings. By combining machine learning with interpretability techniques, this study advances the understanding of hemoglobin's optical properties and sets a benchmark for predictive modeling in biomedical optics, paving the way for more precise and dependable diagnostic applications.

Authors

  • Mustfa Faisal Alkhanani
    Biology Department, College of Science, University of Hafr Al Batin, Hafr Al Batin, Saudi Arabia.