learning-not-to-learn-tensorflow

This repo is a TensorFlow implementation of the following paper presented at CVPR 2019 -- Learning Not to Learn: Training Deep Neural Networks with Biased Data
Please notice that this is an unofficial implementation and may not fully reproduce the original results. If you are looking for the official PyTorch code from the authors, direct to github.com/feidfoe/learning-not-to-learn

Introduction

This code demonstrates unlearning of bias from a classification model, particularly under a modified version of MNIST dataset named the Colored-MNIST. Here, the training set is artificially injected with bias such that class categories have strong correlation to color, meanwhile the test set is not contaminated with such bias. In this setting, colors work as false discriminative signal against baseline training methods, thus leading to low test accuracy of the classifier.
The authors propose a novel training method such that the classifier model learns from the training data but unlearns from bias. The key ideas are: adoption of bias prediction model, and a regularizing loss function based on mutual information between feature embedding and bias. The bias predictor is trained to predict bias label from the feature extractor that is shared with the classifier. The classifer is trained adversarially against the bias predictor so that the feature extractor unlearns bias information.

Setup

Python 3
TensorFlow 2.1
Pillow

Download Dataset

Download Colored-MNIST dataset
Please refer to the paper for more information on the dataset.

Train model

Train model by learning-not-to-learn method.

python main.py --phase=train \
               --data_dir=./dataset/colored-mnist/{FILENAME}.npy \
               --max_epoch=100 \
               --batch_size=128 \
               --lr=0.001 \
               --loss_lambda=0.01

To train model by baseline method, add --train_baseline argument.

python main.py --phase=train \
               --data_dir=./dataset/colored-mnist/{FILENAME}.npy \
               --max_epoch=100 \
               --batch_size=128 \
               --lr=0.001 \
               --train_baseline

Once you begin, you can launch TensorBoard on ./logs/ directory to monitor training.

tensorboard --logdir=./logs/

Test model

python main.py --phase=test \
               --data_dir=./dataset/colored-mnist/{FILENAME}.npy \
               --batch_size=128

Reference

Byungju Kim, Hyunwoo Kim, Kyungsu Kim, Sungjin Kim, Junmo Kim, "Learning Not to Learn: Training Deep Neural Networks with Biased Data", in CVPR, 2019

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
colored-mnist-example.png		colored-mnist-example.png
figure1.png		figure1.png
main.py		main.py
model.py		model.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

learning-not-to-learn-tensorflow

Introduction

Setup

Download Dataset

Train model

Test model

Reference

About

Releases

Packages

Languages

License

hushon/learning-not-to-learn-tensorflow

Folders and files

Latest commit

History

Repository files navigation

learning-not-to-learn-tensorflow

Introduction

Setup

Download Dataset

Train model

Test model

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages