Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include year_letter_clade & more in metadata #373

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

corneliusroemer
Copy link
Member

@corneliusroemer corneliusroemer commented Dec 12, 2022

We currently output only composite clade names, e.g. 21L (Omicron)
Nextclade now produces also atomic clades,
that are Nextstrain and WHO only: 21L and Omicron.

Nextclade will at some point switch "clade" from legacy to Nextstrain
This commit insulates ingest-metadata from this upcoming change
by using clade_legacy in place of clade in the column_map as
source of Nextstrain_clade.

At some point we may drop/deprecate the complex Nextstrain_clade - or keep it for backwards compatibility.

For new scripts, one should use year_letter_clade from now on, this is the best unique clade key for the Nextstrain clade schema.

This commit also adds:

  • year_letter_clade sourced from clade_nextstrain
  • who_variant sourced from clade_who
  • clade_display_name: calculated as 21L (Omicron)/20C

Tested using config/debug_sample_genbank.yaml and it worked there

Test runs:

  • Genbank 3864fece-2ed7-496a-88cf-603f47bcf708
  • GISAID a7ded173-ade8-43aa-91fd-0b24d273aa31

We currently output only composite clade names, e.g. `21L (Omicron)`
Nextclade now produces also atomic clades,
that are Nextstrain and WHO only: `21L` and `Omicron`.

Nextclade will at some point switch "clade" from `legacy` to `Nextstrain`
This commit insulates ingest-metadata from this upcoming change
by using `clade_legacy` in place of `clade` in the column_map as
source of `Nextstrain_clade`.

This commit also adds:
- `year_letter_clade` sourced from `clade_nextstrain`
- `who_variant` sourced from `clade_who`
- `clade_display_name`: calculated as `21L (Omicron)`/`20C`
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Development

Successfully merging this pull request may close these issues.

2 participants