Natural Language Understanding with Privacy-Preserving BERT

Conference on Information and Knowledge Management(2021)

引用 5|浏览54
暂无评分
摘要
ABSTRACTPrivacy preservation remains a key challenge in data mining and Natural Language Understanding (NLU). Previous research shows that the input text or even text embeddings can leak private information. This concern motivates our research on effective privacy preservation approaches for pretrained Language Models (LMs). We investigate the privacy and utility implications of applying dχ-privacy, a variant of Local Differential Privacy, to BERT fine-tuning in NLU applications. More importantly, we further propose privacy-adaptive LM pretraining methods and show that our approach can boost the utility of BERT dramatically while retaining the same level of privacy protection. We also quantify the level of privacy preservation and provide guidance on privacy configuration. Our experiments and findings lay the groundwork for future explorations of privacy-preserving NLU with pretrained LMs.
更多
查看译文
关键词
Differential privacy,Natural language understanding,Language model,Key (cryptography),Private information retrieval,Internet privacy,Computer science,Privacy preserving,Privacy protection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要