YAN-Xi1998

Interpretable Rl Summary

文章目录

Model Approximation Method
- Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees （2018，ECML/PKDD）
- - - - Intro
      - contributions
      - Model
      - Evaluation
- *Veriﬁable Reinforcement Learning via Policy Extraction（2018，NeurIPS）
- - - - Intro
      - Contributions
      - Model
      - Evaluation
- *Towards Interpretable-AI Policies Induction using Evolutionary Nonlinear Decision Trees for Discrete Action Systems （2019）
- - - - Intro
      - Model
      - Evaluation
- RL-LIM: Reinforcement Learning-based Locally Interpretable Modeling （2019）
- - - - Intro
      - Contributions
      - Model
- Modelling Agent Policies with Interpretable Imitation Learning （2020，TAILOR）
- - - - Intro
      - Model
      - Evaluation
- Improved Policy Extraction via Online Q-Value Distillation （2020，IJCNN）
- - - - Intro
      - Contributions
      - Model
      - Evaluation
- Neural-to-Tree Policy Distillation with Policy Improvement Criterion （2021）
- - - - Intro
      - Contributions
      - Model
      - Evaluation
- Can You Trust Your Autonomous Car? Interpretable and Veriﬁably Safe Reinforcement Learning（2021, IEEE Intelligent Vehicles Symposium (IV))
- - - - Intro
      - Contributions
      - Model
      - Evaluation
- *Explaining by Imitating: Understanding Decisions by Interpretable Policy Learning (ICLR, 2021)
- - - - Intro
      - Contributions
- *POETREE: Interpretable Policy Learning with Adaptive Decision Trees (2022)
- - - - Intro
      - **contributions**
      - Model
      - Evaluation
Self-interpretable Modeling Method
- - - *Distilling a Neural Network Into a Soft Decision Tree （2017，CEx@AIIA）
    - *Optimization Methods for Interpretable Differentiable Decision Trees in Reinforcement Learning（2019）
    - - Intro
      - Contributions
      - Model
      - Evaluation
    - Conservative Q-Improvement: Reinforcement Learning for an Interpretable Decision-Tree Policy （2019）
    - - Intro
      - Contributions
      - Model
      - Evaluation
Post-hoc Interpretation Method
- *Programmatically Interpretable Reinforcement Learning （2018, ICML）
- *Explainable Reinforcement Learning Through a Causal Lens （2019, AAAI）
- - - - Intro
      - Contributions
      - Evaluation
- *Generation of Policy-Level Explanations for Reinforcement Learning （2019, AAAI）
- - - - Intro
      - Contributions
      - Model
- TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments（2020，AAAI）
- - - - Intro
      - **Key Features**
      - Model
      - Evaluation
- Feature-Based Interpretable Reinforcement Learning based on State-Transition Models （2021，IEEE）
- - - - Intro
      - Method Propertity
      - Model
      - Evaluation
- *EDGE: Explaining Deep Reinforcement Learning Policies （2021, NeuralPS）
- - - - Intro
      - Contributions
      - Model
      - Posterior Inference and Parameter Learning
      - Evaluation

*代表重要文章

Model Approximation Method

这种方法一般可以分类为策略蒸馏和模仿学习（这两类其实差不多)，都关注于如何用一个可解释的模型，去逼近以及解释DRL中的黑盒代表的映射函数。针对于模仿学习中的distributional shift 问题，其中VIPER利用了DAgger，而LMUT利用了进化算法，都能取得和DRL相似的performance 并且决策树的规模不会太大。

Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees （2018，ECML/PKDD）

Intro

这篇文章和我们做的方向一致。是第一篇讲强化学习和模仿学习结合的文章，通过LMUT模仿学习Q值。但并没有解决distrubution drift的问题并且在cartpole上的表现一般。

contributions

To our best knowledge, the ﬁrst work that extends interpretable mimic learning to Reinforcement Learning.
A novel on-line learning algorithm for LMUT, a novel model tree to mimic a DRL model.
We show how to interpret a DRL model by analyzing the knowledge stored in the tree structure of LMUT.

Model

文章提出了两种生成数据（用于监督学习）的方式：

**Experience Training：**记录所有在DRL训练时产生的状态和动作（这个优点是状态会比较符合环境中的分布，但缺点是很多动作并不是最优值因为会导致训练效果不好）。

**Active Play：**通过一个训练好的模型与环境交互生成数据（优点是动作都是最优，确实的状态的分布是偏移的）。

Linear Model U-Trees

可以用于回归问题，每个叶子结点是一个线性模型。LMUT同时也记录reward $r$ 和状态转移概率 $p$

整个训练分为两个阶段

Data Gathering Phase: 收集transitions用于拟合线性模型和分裂结点。

Node Splitting Phase：

LMUT扫描所有的叶子结点并且利用SGD更新线性模型，如果SGD的提升不明显，则分裂该结点。对一个batch的数据，每个叶子结点只考虑一次分裂。

SGD weight update

对每个动作都建立一棵LMUT。

$Q^{UT}(I_t| w_N, a_t) = ∑_{j=1}^J I_{tj}w_{Nj}+ w_{N0}$

**splitting criterion: **Variance test-选择一个分裂使得孩子结点上的Q值分布方差最小

Evaluation