Deep Reinforcement Learning (DRL) has recently achieved significant advancesin various domains. However, explaining the policy of RL agents still remainsan open problem due to several factors, one being the complexity of explainingneural networks decisions. Recently, a group of works have useddecision-tree-based models to learn explainable policies. Soft decision trees(SDTs) and discretized differentiable decision trees (DDTs) have beendemonstrated to achieve both good performance and share the benefit of havingexplainable policies. In this work, we further improve the results fortree-based explainable RL in both performance and explainability. Our proposal,Cascading Decision Trees (CDTs) apply representation learning on the decisionpath to allow richer expressivity. Empirical results show that in bothsituations, where CDTs are used as policy function approximators or asimitation learners to explain black-box policies, CDTs can achieve betterperformances with more succinct and explainable models than SDTs. As a secondcontribution our study reveals limitations of explaining black-box policies viaimitation learning with tree-based explainable models, due to its inherentinstability.