[Chapter 3] Reinforcement Learning (1) Model-Based Method
ReinforcementLearningFirstly,weassumethatalltheenvironmentsinthefollowingmaterialsareallmodeledbyMarkovdecisionprocesses.Aswehaveknown,anMDPmodelcanberepresentedbyatuple,therewardsarereturnedfromtheen