Delphi: Towards Machine Ethics and Norms

  • 2021-10-14 17:38:12
  • Liwei Jiang, Jena D. Hwang, Chandra Bhagavatula, Ronan Le Bras, Maxwell Forbes, Jon Borchardt, Jenny Liang, Oren Etzioni, Maarten Sap, Yejin Choi
  • 222

Abstract

What would it take to teach a machine to behave ethically? While broadethical rules may seem straightforward to state ("thou shalt not kill"),applying such rules to real-world situations is far more complex. For example,while "helping a friend" is generally a good thing to do, "helping a friendspread fake news" is not. We identify four underlying challenges towardsmachine ethics and norms: (1) an understanding of moral precepts and socialnorms; (2) the ability to perceive real-world situations visually or by readingnatural language descriptions; (3) commonsense reasoning to anticipate theoutcome of alternative actions in different contexts; (4) most importantly, theability to make ethical judgments given the interplay between competing valuesand their grounding in different contexts (e.g., the right to freedom ofexpression vs. preventing the spread of fake news). Our paper begins to address these questions within the deep learningparadigm. Our prototype model, Delphi, demonstrates strong promise oflanguage-based commonsense moral reasoning, with up to 92.1% accuracy vetted byhumans. This is in stark contrast to the zero-shot performance of GPT-3 of52.3%, which suggests that massive scale alone does not endow pre-trainedneural language models with human values. Thus, we present Commonsense NormBank, a moral textbook customized for machines, which compiles 1.7M examples ofpeople's ethical judgments on a broad spectrum of everyday situations. Inaddition to the new resources and baseline performances for future research,our study provides new insights that lead to several important open researchquestions: differentiating between universal human values and personal values,modeling different moral frameworks, and explainable, consistent approaches tomachine ethics.

 

Quick Read (beta)

loading the full paper ...