text classification in tensorflow

Implementing multi models for Text Classification in TensorFlow.

Models are used to perform sentiment analysis on movie reviews from the Large Movie Review Dataset, which contains 25,000 highly polar movie reviews for training, and 25,000 for testing.
In this task, given a movie review, the model attempts to predict whether it is positive or negative. This is a binary classification task.

Preprocess

Load positive and negative sentences from the raw data files.
Clean the text data.
Pad each sentence to the maximum sentence length.
Word vector mapping, Each sentence becomes a bag of word vectors.

Models

1. FastText

text classification: Bag of Tricks for Efficient Text Classification, 2016.07

2. TextCNN

3. BiLSTM

Bidirectional Recurrent Neural Networks, 1997

4. TextRCNN

Learn more contextual information than conventional window-based neural networks.

Recurrent Convolutional Neural Networks for Text Classification, 2015

5. Hierarchical Attention Networks

Hierarchical Attention Networks for Document Classification, 2016

6. Seq2seq with Attention

Neural Machine Translation by Jointly Learning to Align and Translate, 2014.09

Performance

epochs = 10
batch_size = 64
max_learning_rate = 0.001
decay_rate = 0.8
decay_steps = 2000
l2_reg_lambda = 1e-3
embedding_trainable = False

Models	fastText	TextCNN	BiLSTM	TextRCNN	HierAtteNet	Seq2seqAttn	EntityNet	DynamicMemory	Transformer
Accuracy	0.834304	0.878276	0.884974	0.840128	0.889314

References

License

This project is licensed under the terms of the MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
1.FastText		1.FastText
2.TextCNN		2.TextCNN
3.BiLSTM		3.BiLSTM
4.TextRCNN		4.TextRCNN
5.HierarchicalAttentionNetworks		5.HierarchicalAttentionNetworks
6. Seq2seqAttention		6. Seq2seqAttention
conf		conf
imgs		imgs
input		input
utils		utils
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

text classification in tensorflow

Contents

Data and Preprocess

Data

Preprocess

Models

1. FastText

2. TextCNN

3. BiLSTM

4. TextRCNN

5. Hierarchical Attention Networks

6. Seq2seq with Attention

Performance

References

License

About

Releases

Packages

Languages

SunnyMarkLiu/tensorflow_text_classification

Folders and files

Latest commit

History

Repository files navigation

text classification in tensorflow

Contents

Data and Preprocess

Data

Preprocess

Models

1. FastText

2. TextCNN

3. BiLSTM

4. TextRCNN

5. Hierarchical Attention Networks

6. Seq2seq with Attention

Performance

References

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages