BERT-attack

Oct 5, 2020

4f4a83c · Oct 5, 2020

Name	Name	Last commit message	Last commit date
parent directory ..
bert_classification	bert_classification	first commit	Oct 5, 2020
my_generator	my_generator	first commit	Oct 5, 2020
seqbackLSTM	seqbackLSTM	first commit	Oct 5, 2020
treeLSTM	treeLSTM	first commit	Oct 5, 2020
.gitignore	.gitignore	first commit	Oct 5, 2020
CW_QA_attack.py	CW_QA_attack.py	first commit	Oct 5, 2020
CW_QA_attack_ensemble.py	CW_QA_attack_ensemble.py	first commit	Oct 5, 2020
CW_QA_attack_random.py	CW_QA_attack_random.py	first commit	Oct 5, 2020
CW_attack.py	CW_attack.py	first commit	Oct 5, 2020
CW_attack_random.py	CW_attack_random.py	first commit	Oct 5, 2020
README.md	README.md	first commit	Oct 5, 2020
attack_classifier.py	attack_classifier.py	first commit	Oct 5, 2020
attack_qa.py	attack_qa.py	first commit	Oct 5, 2020
bertmodel.py	bertmodel.py	first commit	Oct 5, 2020
dataset.py	dataset.py	first commit	Oct 5, 2020
evaluate.py	evaluate.py	first commit	Oct 5, 2020
generate_adv_dataset.py	generate_adv_dataset.py	first commit	Oct 5, 2020
generate_append.ipynb	generate_append.ipynb	first commit	Oct 5, 2020
run_squad.py	run_squad.py	first commit	Oct 5, 2020
sampled_generate_append.py	sampled_generate_append.py	first commit	Oct 5, 2020
util.py	util.py	first commit	Oct 5, 2020
utils_squad.py	utils_squad.py	first commit	Oct 5, 2020
utils_squad_evaluate.py	utils_squad_evaluate.py	first commit	Oct 5, 2020
vocab.py	vocab.py	first commit	Oct 5, 2020

README.md

Attack BERT Model

This repo contains the code to attack both BERT-QA and BERT-classification.

Prepare BERT

Our BERT is fine-tuned on different datasets based on Transformers.

BERT classification

We put our BERT-classification code in bert_classification directory. You can reproduce the results by first running:

cd bert_classification
python make_dataset.py

Then

python main.py --dataset data_bert-base-uncased.pkl --task-name XXX --devi 3210 --lr 2e-5 --num-epochs 4 --batch-size 32 --dropout 0.1

BERT QA

We put BERT-QA code in the root directory. You can fine-tune BERT on QA dataset by running:

python run_squad.py --train_file train-v1.1.json --predict_file dev-v1.1.json --model_type bert --model_name_or_path bert-base-cased --output_dir official1 --do_train --do_eval --evaluate_during_training --per_gpu_train_batch_size 3 --per_gpu_eval_batch_size 4 --learning_rate 5e-5 --adam_epsilon 1e-6 --num_train_epochs 3 --save_steps 3000 --logging_steps 3000 --eval_all_checkpoints

Attack

You may read our attack_qa.py and attack_classifier.py for more information. You may also try different attack scenarios (position targeted attack/answer targeted attack) and different attack methods (word-level/sentence-level) to see the effects.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

BERT-attack

BERT-attack

README.md

Attack BERT Model

Prepare BERT

BERT classification

BERT QA

Attack

Files

BERT-attack

Directory actions

More options

Directory actions

More options

Latest commit

History

BERT-attack

Folders and files

parent directory

README.md

Attack BERT Model

Prepare BERT

BERT classification

BERT QA

Attack