TY - GEN
T1 - Protecting Privacy in Classifiers by Token Manipulation
AU - Harel, Re'em
AU - Elboher, Yair
AU - Pinter, Yuval
N1 - Publisher Copyright:
© 2024 Association for Computational Linguistics.
PY - 2024/1/1
Y1 - 2024/1/1
N2 - Using language models as a remote service entails sending private information to an untrusted provider. In addition, potential eavesdroppers can intercept the messages, thereby exposing the information. In this work, we explore the prospects of avoiding such data exposure at the level of text manipulation. We focus on text classification models, examining various token mapping and contextualized manipulation functions in order to see whether classifier accuracy may be maintained while keeping the original text unrecoverable. We find that although some token mapping functions are easy and straightforward to implement, they heavily influence performance on the downstream task, and via a sophisticated attacker can be reconstructed. In comparison, contextualized manipulation provides an improvement in performance.
AB - Using language models as a remote service entails sending private information to an untrusted provider. In addition, potential eavesdroppers can intercept the messages, thereby exposing the information. In this work, we explore the prospects of avoiding such data exposure at the level of text manipulation. We focus on text classification models, examining various token mapping and contextualized manipulation functions in order to see whether classifier accuracy may be maintained while keeping the original text unrecoverable. We find that although some token mapping functions are easy and straightforward to implement, they heavily influence performance on the downstream task, and via a sophisticated attacker can be reconstructed. In comparison, contextualized manipulation provides an improvement in performance.
UR - http://www.scopus.com/inward/record.url?scp=85204432308&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85204432308
T3 - PrivateNLP 2024 - 5th Workshop on Privacy in Natural Language Processing, Proceedings of the Workshop
SP - 29
EP - 38
BT - PrivateNLP 2024 - 5th Workshop on Privacy in Natural Language Processing, Proceedings of the Workshop
A2 - Habernal, Ivan
A2 - Ghanavati, Sepideh
A2 - Ravichander, Abhilasha
A2 - Jain, Vijayanta
A2 - Thaine, Patricia
A2 - Igamberdiev, Timour
A2 - Mireshghallah, Niloofar
A2 - Feyisetan, Oluwaseyi
PB - Association for Computational Linguistics (ACL)
T2 - 5th Workshop on Privacy in Natural Language Processing, PrivateNLP 2024 - Co-located with ACL 2024
Y2 - 15 August 2024
ER -