GitHub - justusmattern27/neighbour-mia

Membership Inference Attacks against Language Models via Neighbourhood Comparison

This is the code for the paper Membership Inference Attacks against Language Models via Neighbourhood Comparison.

Prerequisites:

To run our code, you need to have a model you want to attack (in path_to_attack_model)as well as a dataset consisting of training members and non members. in attack.py, examples for news, twitter and wikipedia data are provided. In the code, we assume that the first n lines of the text file are members and the n remaining ones are non-training-members.

How it works:

The code will use a BERT based model to generate neighbours and compute the likelihoods of neighbours and the original texts under the probability distribution of the provided, gpt2-based attack model. It will return these scores in a pickle file.

To parallelize the workload, you should provide a --proc-id as an argument

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Readme.md		Readme.md
attack.py		attack.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Membership Inference Attacks against Language Models via Neighbourhood Comparison

Prerequisites:

How it works:

About

Releases

Packages

Languages

justusmattern27/neighbour-mia

Folders and files

Latest commit

History

Repository files navigation

Membership Inference Attacks against Language Models via Neighbourhood Comparison

Prerequisites:

How it works:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages