[pdf-embedder url=”http://muhathir.blog.uma.ac.id/wp-content/uploads/sites/385/2020/09/MDP.pdf”]
Category Archives: Reinforcement Learning
Case Study: RL in Classic Games
[pdf-embedder url=”http://muhathir.blog.uma.ac.id/wp-content/uploads/sites/385/2020/09/games.pdf”]
Exploration and ExploitationExploration and Exploitation
[pdf-embedder url=”http://muhathir.blog.uma.ac.id/wp-content/uploads/sites/385/2020/09/XX.pdf”]
Integrating Learning and Planning
[pdf-embedder url=”http://muhathir.blog.uma.ac.id/wp-content/uploads/sites/385/2020/09/dyna.pdf”]
Policy Gradient Methods
[pdf-embedder url=”http://muhathir.blog.uma.ac.id/wp-content/uploads/sites/385/2020/09/pg.pdf”]
Value Function Approximation
[pdf-embedder url=”http://muhathir.blog.uma.ac.id/wp-content/uploads/sites/385/2020/09/FA.pdf”]
Model-Free Control
[pdf-embedder url=”http://muhathir.blog.uma.ac.id/wp-content/uploads/sites/385/2020/09/control.pdf”]
Model-Free Prediction
[pdf-embedder url=”http://muhathir.blog.uma.ac.id/wp-content/uploads/sites/385/2020/09/MC-TD.pdf”]
Planning by Dynamic Programming
[pdf-embedder url=”http://muhathir.blog.uma.ac.id/wp-content/uploads/sites/385/2020/09/DP.pdf”]
Introduction to Reinforcement Learning
[pdf-embedder url=”http://muhathir.blog.uma.ac.id/wp-content/uploads/sites/385/2020/09/intro_RL.pdf”]