Reinforcement Learning and Modeling Techniques: A Review
Hindreen Rashid Abdulqadir & Adnan Mohsin Abdulazeez
The Reinforcement learning (RL) algorithms solve a wide range of problems we faced. The topic of RL has achieved a new, complete standard of public opinion. High difficulty in large-scale real-world implementations is the effective use of large data sets previously obtained in augmented learning algorithms. Q-learning (QL), by learning a conservative Q function that allows a policy to be below the predicted value of the Q function, is introduced by us, which aims to circumvent these restrictions. We revealed technical reinforcement learning in this study. In principle, we demonstrate that QL creates a lower relation to current policy importance and that this can be correlated with guarantees of political learning theoretical change. In reality, QL strengthens the benchmark objective with a simple, standardized Q value which, in addition to existing Q-learning and essential applications, is quickly applied. The findings indicate that all algorithms are needed to learn how to play successfully. In comparison, all dual Q-learning variables have a significantly higher score compared with Q-learning, and the incremental reward function shows no improved effects than the normal reward function. We present an attack mechanism that uses the portability of competing tests to execute policy incentives and to prove their usefulness and consequences by means of a pilot study of a play learning scenario.
Keywords: Machine learning, Reinforcement learning, Modelling – Technique, Q- learning.
|Title:||Reinforcement Learning and Modeling Techniques: A Review|
|Author:||Hindreen Rashid Abdulqadir & Adnan Mohsin Abdulazeez|
|Journal Name:||International Journal of Science and Business|
|ISSN:||ISSN 2520-4750 (Online), ISSN 2521-3040 (Print)|
|Date of Publication:||16/02/2021|
|Paper Type:||Literature Review|
Cite This Article:
Abdulqadir, H. R. & Abdulazeez, A. M. (2021). Reinforcement Learning and Modeling Techniques: A Review. International Journal of Science and Business, 5(3), 174-189. doi: https://doi.org/10.5281/zenodo.4542638
Retrieved from https://ijsab.com/wp-content/uploads/696.pdf
About Author (s)
Hindreen Rashid Abdulqadir (corresponding author), Information Technology Department, Akre Technical College of Informatics, Duhok Polytechnic University, Duhok Kurdistan Region, Iraq. Email: Hindreen.firstname.lastname@example.org
Professor Adnan Mohsin Abdulazeez, Duhok Polytechnic University, Duhok, Kurdistan Region, Iraq. E-mail: email@example.com