Adaptive Neural Trees - Paper Detail

Abstract

Deep neural networks and decision trees operate on largely separateparadigms; typically, the former performs representation learning withpre-specified architectures, while the latter is characterised by learninghierarchies over pre-specified features with data-driven architectures. Weunite the two via adaptive neural trees (ANTs), a model that incorporatesrepresentation learning into edges, routing functions and leaf nodes of adecision tree, along with a backpropagation-based training algorithm thatadaptively grows the architecture from primitive modules (e.g., convolutionallayers). We demonstrate that, whilst achieving over 99% and 90% accuracy onMNIST and CIFAR-10 datasets, ANTs benefit from (i) faster inference viaconditional computation, (ii) increased interpretability via hierarchicalclustering e.g. learning meaningful class associations, such as separatingnatural vs. man-made objects, and (iii) a mechanism to adapt the architectureto the size and complexity of the training dataset.

Quick Read (beta)

loading the full paper ...