diff --git a/.spellcheck-en-custom.txt b/.spellcheck-en-custom.txt index eb773134..1f9c3ab9 100644 --- a/.spellcheck-en-custom.txt +++ b/.spellcheck-en-custom.txt @@ -5,8 +5,8 @@ Backport backported CLI codebase -composable Composable +composable config configs customizable @@ -29,6 +29,7 @@ JSON Langchain's LLM LLMBlock +mbta MCQ Merlinite Mixtral diff --git a/docs/dataset_formats.md b/docs/dataset_formats.md index 0d6ae7cf..ea1bbbbd 100644 --- a/docs/dataset_formats.md +++ b/docs/dataset_formats.md @@ -48,9 +48,17 @@ The fields in a knowledge contribution are: Examples of these files used in CI are found [here](https://github.com/instructlab/instructlab/tree/main/scripts/test-data): -* [e2e-qna-freeform-skill](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/e2e-qna-freeform-skill.yaml) -* [e2e-qna-grounded-skill](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/e2e-qna-grounded-skill.yaml) -* [e2e-qna-knowledge](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/e2e-qna-knowledge.yaml) +* Compositional skills + * Freeform + * [e2e-qna-freeform-palindrome-skill](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/compositional_skills/freeform/e2e-qna-freeform-palindrome-skill.yaml) + * [e2e-qna-freeform-siblings-skill](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/compositional_skills/freeform/e2e-qna-freeform-siblings-skill.yaml) + * Grounded + * [e2e-qna-grounded-employee-skill](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/compositional_skills/grounded/e2e-qna-grounded-employee-skill.yaml) + * [e2e-qna-grounded-punctuation-skill](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/compositional_skills/grounded/e2e-qna-grounded-punctuation.yaml) +* Knowledge + * [e2e-qna-knowledge-mbta](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/knowledge/e2e-qna-knowledge-mbta.yaml) + * [e2e-qna-knowledge-phoenix](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/knowledge/e2e-qna-knowledge-phoenix.yaml) + * [e2e-qna-knowledge](https://github.com/instructlab/instructlab/blob/main/scripts/test-data/knowledge/e2e-qna-knowledge.yaml) ### Pregenerated Dataset (Input)