TurboTransformers/example/python at master · Tencent/TurboTransformers

History

Name		Name	Last commit message	Last commit date
parent directory ..
distil_bert		distil_bert
memory_comparison		memory_comparison
README.md		README.md
albert_example.py		albert_example.py
bert_example.py		bert_example.py
bert_for_sequence_classification_example.py		bert_for_sequence_classification_example.py
bert_smart_batch.py		bert_smart_batch.py
bert_smart_pad.py		bert_smart_pad.py
gpt2_example.py		gpt2_example.py
roberta_example.py		roberta_example.py

README.md

How to use examples

prepare your model

Modify the corresponding parts in the cpu(gpu)_example.py

I want to use the PyTorch huggingface/transformers.

model_id = "bert-base-uncased"

I want to use a PyTorch saved model

We can load a model from the directory of pre-trained model.

model_id = "your_saved_model" directory

I want to use a Tensorflow checkpoint model

cd /workspace
python tools/convert_huggingface_bert_tf_to_npz.py bert-base-uncased /workspace/bert_tf.npz

update the corresponding line in bert_example.py

tt_model = turbo_transformers.BertModel.from_npz(
    '/workspace/bert_tf.npz', cfg)

run examples

python bert_example.py

Attention : If you want to use turbo with C++ backend instead of onnxrt. Directly linking an MKL of Pytorch installed by conda will lead to poor performance in our hand-crafted C++ version. You should install an official MKL an set MKL PATH in CMakeLists.txt. As a not so elegant alternative, you can uninstall OpenNMT-py and downgrade torch to 1.1.0.

I have prepared an image for bert only runtime on dockerhub with .

thufeifeibear/turbo_transformers_cpu:bert_only_v0.1

Attention : If you want to use turbo with C++ backend instead of onnxrt. Directly linking an MKL of Pytorch installed by conda will lead to poor performance in our hand-crafted C++ version. You should install an official MKL an set MKL PATH in CMakeLists.txt. As a not so elegant alternative, you can uninstall OpenNMT-py and downgrade torch to 1.1.0.

I have prepared an image for bert only runtime on dockerhub with .

thufeifeibear/turbo_transformers_cpu:bert_only_v0.1

How to customized your post-processing layers after BERT encoder

Chinese Version Because TurboTransformer has accelerated embedding + BERT encoder + pooler, which are major hotspots. Users may have to customize the not so time-consuming post-processing layers according to their own needs. We take a classfication task as an example. It requires a Linear Layer after pooler.

First of all, we also need to prepare a bert-classification model trained using huggingface (which can be any kind of model of huggingface, such as BertPreTrainedModel, BertForSequenceClassification, here taking BertForSequenceClassification as an example) The sequence classification model in the code example can be downloaded from Baidu Cloud. Place it in the same directory as the example file, link: https://pan.baidu.com/s/1WzMIQ2I3ncXb9aPLTJ7QNQ Password: hj18
Write a new class to replace the original huggingface's implementation. This class needs to implement the four functions __init__, __call__, from_torch, from_pretrained. The implementation code and description of the class can refer to bert_for_sequence_classification_example.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

python

python

README.md

How to use examples

prepare your model

run examples

How to customized your post-processing layers after BERT encoder

Files

python

Directory actions

More options

Directory actions

More options

Latest commit

History

python

Folders and files

parent directory

README.md

How to use examples

prepare your model

run examples

How to customized your post-processing layers after BERT encoder