GitHub - coranholmes/zh-NER-TF: A very simple BiLSTM-CRF model for English Named Entity Recognition (TensorFlow)

A simple BiLSTM-CRF model for Chinese Named Entity Recognition

This repository includes the code for buliding a very simple character-based BiLSTM-CRF sequence labelling model for English Named Entity Recognition task. Its goal is to automatically recognize different types of Named Entity appearing in labelled data.

This code works on Python 2.7.5 & TensorFlow 1.4.1 and the following repository https://github.com/guillaumegenthial/sequence_tagging gives me much help.

model

This model is similar to the models provied by paper [1] and [2]. Its structure looks just like the following illustration:

For one Chinese sentence, each character in this sentence has / will have a tag which belongs to the set {O, B-PER, I-PER, B-LOC, I-LOC, B-ORG, I-ORG}.

The first layer, look-up layer, aims at transforming character representation from one-hot vector into character embedding. In this code I initialize the embedding matrix randomly and I know it looks too simple. We could add some language knowledge later. For example, do tokenization and use pre-trained word-level embedding, then every character in one token could be initialized with this token's word embedding. In addition, we can get the character embedding by combining low-level features (please see paper[2]'s section 4.1 and paper[3]'s section 3.3 for more details).

The second layer, BiLSTM layer, can efficiently use both past and future input information and extract features automatically.

The third layer, CRF layer, labels the tag for each character in one sentence. If we use Softmax layer for labelling we might get ungrammatic tag sequences beacuse Softmax could only label each position independently. We know that 'I-LOC' cannot follow 'B-PER' but Softmax don't know. Compared to Softmax layer, CRF layer could use sentence-level tag information and model the transition behavior of each two different tags.

data format

id\001�PS Vita Little Big Planet / R1 (English)\001�["B-brand", "O", "O", "O", "O", "O", "O", "O"]

train

run python main.py

test

run python main.py --action test

references

[1] Bidirectional LSTM-CRF Models for Sequence Tagging

[2] Neural Architectures for Named Entity Recognition

[3] Character-Based LSTM-CRF with Radical-Level Features for Chinese Named Entity Recognition

[4] https://github.com/guillaumegenthial/sequence_tagging

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data_path		data_path
pics		pics
.gitignore		.gitignore
README.md		README.md
data.py		data.py
main.py		main.py
model.py		model.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A simple BiLSTM-CRF model for Chinese Named Entity Recognition

model

data format

train

test

references

About

Releases

Packages

Languages

coranholmes/zh-NER-TF

Folders and files

Latest commit

History

Repository files navigation

A simple BiLSTM-CRF model for Chinese Named Entity Recognition

model

data format

train

test

references

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages