[Chapter 4] Reinforcement Learning (2) Model-Free Method
Model-FreeRLMethodInmodel-basedmethod,weneedfirstlymodeltheenvironmentbylearning/estimatingthetransitionandrewardfunctions.However,inmodel-freemethod,weconsiderlearningthevalue/utilityfunctionsororact