A scalable machine learning strategy for resource allocation in database.

Journal: Scientific reports

Published Date: Aug 20, 2025

Abstract

Modern cloud computing systems require intelligent resource allocation strategies that balance quality-of-service (QoS), operational costs, and energy sustainability. Existing deep Q-learning (DQN) methods suffer from sample inefficiency, centralization bottlenecks, and reactive decision-making during workload spikes. Transformer-based forecasting models such as Temporal Fusion Transformer (TFT) offer improved accuracy but introduce computational overhead, limiting real-time deployment. We propose LSTM-MARL-Ape-X, a novel framework integrating bidirectional Long Short-Term Memory (BiLSTM) for workload forecasting with Multi-Agent Reinforcement Learning (MARL) in a distributed Ape-X architecture. This approach enables proactive, decentralized, and scalable resource management through three innovations: high-accuracy forecasting using BiLSTM with feature-wise attention, variance-regularized credit assignment for stable multi-agent coordination, and faster convergence via adaptive prioritized replay. Experimental validation on real-world traces demonstrates 94.6% SLA compliance, 22% reduction in energy consumption, and linear scalability to over 5,000 nodes with sub-100 ms decision latency. The framework converges 3.2× faster than uniform sampling baselines and outperforms transformer-based models in both accuracy and inference speed. Unlike decoupled prediction-action frameworks, our method provides end-to-end optimization, enabling robust and sustainable cloud orchestration at scale.

Authors

Fady Nashat Manhary

Department of Computer Science, Faculty of Computers and Information, Assiut University, Assiut, Egypt.
Marghny H Mohamed

Computer and Information Technology, Egypt-Japan University of Science and Technology (E-JUST), New Borg El-Arab City, 21934, Alexandria, Egypt. marghny.mohamed@ejust.edu.eg.
Mamdouh Farouk

Department of Computer Science, Faculty of Computers and Information, Assiut University, Assiut, Egypt.

Keywords

No keywords available for this article.

External Resources

View on PubMed Access via DOI PubMed (40835668)

A scalable machine learning strategy for resource allocation in database.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

A scalable machine learning strategy for resource allocation in database.

Abstract

Authors

Keywords

External Resources

Don't Miss the Future of Medicine

Popular Topics

Recent Journals