IoT-Based Reinforcement Learning Using Probabilistic Model for Determining Extensive Exploration through Computational Intelligence for Next-Generation Techniques.

Journal: Computational intelligence and neuroscience
Published Date:

Abstract

Computing intelligence is built on several learning and optimization techniques. Incorporating cutting-edge learning techniques to balance the interaction between exploitation and exploration is therefore an inspiring field, especially when it is combined with IoT. The reinforcement learning techniques created in recent years have largely focused on incorporating deep learning technology to improve the generalization skills of the algorithm while ignoring the issue of detecting and taking full advantage of the dilemma. To increase the effectiveness of exploration, a deep reinforcement algorithm based on computational intelligence is proposed in this study, using intelligent sensors and the Bayesian approach. In addition, the technique for computing the posterior distribution of parameters in Bayesian linear regression is expanded to nonlinear models such as artificial neural networks. The Bayesian Bootstrap Deep -Network (BBDQN) algorithm is created by combining the bootstrapped DQN with the recommended computing technique. Finally, tests in two scenarios demonstrate that, when faced with severe exploration problems, BBDQN outperforms DQN and bootstrapped DQN in terms of exploration efficiency.

Authors

  • Pradeep Kumar Tiwari
    Manipal University Jaipur, Jaipur, India.
  • Pooja Singh
    School of Computing Science & Engineering, Department of CSE, Galgotias University, Greater Noida, UP, India.
  • Navaneetha Krishnan Rajagopal
    Business Studies, University of Technology and Applied Sciences, Salalah, Oman.
  • K Deepa
    Department of Computer Science and Engineering, M. Kumarasamy College of Engineering, Thalavapalayam, Karur, Tamilnadu, India.
  • Sampada Gulavani
    Department of MCA, Bharati Vidyapeeth (Deemed to Be University) Institute of Management, Kolhapur, Maharashtra, India.
  • Amit Verma
    University Centre for Research and Development, Department of Computer Science and Engineering, Chandigarh University Gharuan, Mohali, Punjab, India.
  • Yekula Prasanna Kumar
    Department of Mining Engineering, College of Engineering and Technology, Bule Hora University, Blue Hora 144, Oromia Region, Ethiopia.