Skip to content

A PLM-based Multi-choice QA model for the Classical Chinese Poem dataset, BAAI CUGE benchmark. For PKU course No.04812160 (Computational Linguistics).

Notifications You must be signed in to change notification settings

ZacharyChenpk/CL-Course

Repository files navigation

CL-Course

This project is based on the backbone repository: https://github.com/TobiasLee/CCPM-baseline.

Dependencies

pip install -r requirements.txt

Getting Start

  1. Modifying the module importing part of main.py Line 52 (Optional)
from model_allcat_tag import unwrapped_preprocess_function, MyModule, DataCollatorForMultipleChoice, MyTokenizer, MyOptimizer
  1. Modifying the parameters and available CUDA device in run_model.sh (Optional)

chmod +x run_model.sh
./run_model.sh

Files Structure

zip
├── CCPM-data ### Original and split data
│   ├── split_test.jsonl
│   ├── split_valid.jsonl
│   ├── test_public.jsonl
│   ├── train.jsonl
│   └── valid.jsonl
├── data_split.py ### code for splitting valid.jsonl
├── main.py ### entry of all models except PLM-Match
├── main_sim.py ### entry of PLM-Match model
├── model_allcat.py ### PLM-All
├── model_allcat_tag.py ### PLM-All-Tag
├── model.py ### PLM-CLS
├── model_sim.py ### PLM-Match
├── README.md
├── requirements.txt
├── run_model.sh

About

A PLM-based Multi-choice QA model for the Classical Chinese Poem dataset, BAAI CUGE benchmark. For PKU course No.04812160 (Computational Linguistics).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published