Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
lda_model		lda_model
utils		utils
README.md		README.md
mgr_clustering.ipynb		mgr_clustering.ipynb
mgr_lda.ipynb		mgr_lda.ipynb
nn-char.py		nn-char.py
nn-vect.py		nn-vect.py
requirements.txt		requirements.txt
svm.py		svm.py
train_bert.py		train_bert.py
vae.py		vae.py

Repository files navigation

BERT on Celtic dataset

Usage: Train BERT on Celtic

Here is an example of using this package.

Install requirements

pip install -r /path/to/requirements.txt

Train RoBERTa-large model

python train_bert.py --model-name roberta-large --dataset-path ./data/1preproc.tsv

The plots of the training:
1. roberta-large, uncased (data/1preproc.tsv) dataset:
  
  https://wandb.ai/wsosnowski/huggingface/reports/Celtic-Bert-base--VmlldzoyMDg1OTkz?accessToken=9se65uycy7vrssfnxn9ag2uxvf0wvcg1d47o5xszuswa2zlutxt34yvq15a57ior
2. roberta-large, cased (data/final.tsv) dataset:
  
  https://wandb.ai/wsosnowski/huggingface/reports/BERT-on-cased-celtic-dataset---VmlldzoyMDg2MDcz?accessToken=6p31gbb5i1djqgolzjzodbthplsblwfrrolz19kzan2pnp2uickh30uwh81u8zuf

Autors

Olha Dovbnia [email protected]
Witold Sosnowski [email protected]
Anna Wróblewska [email protected]

About

No description, website, or topics provided.

Custom properties

Report repository

Releases

No releases published

Packages

No packages published

Languages