Diffprivlib: The IBM Differential Privacy Library

Abstract

Since its conception in 2006, differential privacy has emerged as thede-facto standard in data privacy, owing to its robust mathematical guarantees,generalised applicability and rich body of literature. Over the years,researchers have studied differential privacy and its applicability to anever-widening field of topics. Mechanisms have been created to optimise theprocess of achieving differential privacy, for various data types andscenarios. Until this work however, all previous work on differential privacyhas been conducted on a ad-hoc basis, without a single, unifying codebase toimplement results. In this work, we present the IBM Differential Privacy Library, a generalpurpose, open source library for investigating, experimenting and developingdifferential privacy applications in the Python programming language. Thelibrary includes a host of mechanisms, the building blocks of differentialprivacy, alongside a number of applications to machine learning and other dataanalytics tasks. Simplicity and accessibility has been prioritised indeveloping the library, making it suitable to a wide audience of users, fromthose using the library for their first investigations in data privacy, to theprivacy experts looking to contribute their own models and mechanisms forothers to use.

Quick Read (beta)

loading the full paper ...