Learning Representations by Humans, for Humans

Abstract

We propose a new, complementary approach to interpretability, in whichmachines are not considered as experts whose role it is to suggest what shouldbe done and why, but rather as advisers. The objective of these models is tocommunicate to a human decision-maker not what to decide but how to decide. Inthis way, we propose that machine learning pipelines will be more readilyadopted, since they allow a decision-maker to retain agency. Specifically, wedevelop a framework for learning representations by humans, for humans, inwhich we learn representations of inputs ("advice") that are effective forhuman decision-making. Representation-generating models are trained withhumans-in-the-loop, implicitly incorporating the human decision-making model.We show that optimizing for human decision-making rather than accuracy iseffective in promoting good decisions in various classification tasks whileinherently maintaining a sense of interpretability.

Quick Read (beta)

loading the full paper ...