Skip to content

Latest commit

 

History

History
26 lines (22 loc) · 1.23 KB

README.md

File metadata and controls

26 lines (22 loc) · 1.23 KB

Wug InDirect Evidence Test (WIDET)

Repository Contents

  • WIDET data: /data
  • Additional train instances: /data/additional_train
    • Additional train instances that we mainly employed (<wug#123>): /data/additional_train/tag
      • Direct evidence: de.txt, Lexically indirect evidence: lexie.txt, Syntactically Indirect evidence: synie.txt
    • Additional train instances adding morphology into a tag (<wug#321>s): /data/additional_train/tag_w_morph
    • Additional train instances adding morphology into a tag (wug): /data/additional_train/wug
  • Evaluation instances:
    • Evaluation instances that we mainly employed: data/eval/tag
  • Tokenizer added <wug#\n>: /data/tokenizer/wikipedia_vocab9k
  • Pretrain data used in this work: /data/pretrain

License

WIDET is distributed under a CC-BY license.