论文笔记1:Deep Recurrent Q-Learning for Partially Observable MDPs
参考资料:鼻祖论文:PlayingAtariwithDeepReinforcementLearningHuman-levelcontrolthroughdeepreinforcementlearning.论文笔记之:DeepRecurrentQ-LearningforPartiallyObservableMDPs最近老师让看一写DQN算法上前人都做了哪些改进,下面是我自己写的一些理解首先我总结一下