Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation
ResearchTopicLearninggoal-directedbehaviorinenvironmentswithsparsefeedbackisamajorchallengeforreinforcementlearningalgorithms.这里有两个名词需要注意:goal-directedbehavior,sparsefeedback这篇文章提出了一种hierarchical-DQN(