Preserving Privacy in Large Language Models: A Survey on Current Threats and Solutions

Abstract

Large Language Models (LLMs) represent a significant advancement inartificial intelligence, finding applications across various domains. However,their reliance on massive internet-sourced datasets for training brings notableprivacy issues, which are exacerbated in critical domains (e.g., healthcare).Moreover, certain application-specific scenarios may require fine-tuning thesemodels on private data. This survey critically examines the privacy threatsassociated with LLMs, emphasizing the potential for these models to memorizeand inadvertently reveal sensitive information. We explore current threats byreviewing privacy attacks on LLMs and propose comprehensive solutions forintegrating privacy mechanisms throughout the entire learning pipeline. Thesesolutions range from anonymizing training datasets to implementing differentialprivacy during training or inference and machine unlearning after training. Ourcomprehensive review of existing literature highlights ongoing challenges,available tools, and future directions for preserving privacy in LLMs. Thiswork aims to guide the development of more secure and trustworthy AI systems byproviding a thorough understanding of privacy preservation methods and theireffectiveness in mitigating risks.

Quick Read (beta)

loading the full paper ...