-
Download generated files into nextclade data workflow repo:
scp -rC [email protected]:~/nextclade_data_workflows/sars-cov-2/output output
-
Plug them into nextclade.org advanced view.
-
Filter to new nodes and check that:
- clades are clean
- no big outliers
-
Check
tag.json
is up to date (ideally update inprofiles/tag.json
for posterity) -
Check
qc.json
does not regress (ideally update inprofiles/qc.json
for posterity) [beware, codons are 0 indexed] -
Potentially run
scripts/common_stops.py
andscripts/common_frameshifts.py
to add new stops/frameshifts that have become more common toqc.json
-
Download metadata to
data/metadata_raw.tsv
-
Run snakemake workflow with following commands/targets:
snakemake --profile=profiles/clades pre-processed/frameshifts.tsv -R select_frameshifts snakemake --profile=profiles/clades pre-processed/stops.tsv -R select_stops
-
Format most commons stops/fs into qc.json JSON format using
python3 scripts/common_stops.py python3 scripts/common_frameshifts.py
-
Manually check resul for plausibility and add to qc.json
-
Go to nextclade_data_workflow repo
-
Checkout branch, open PR to master
-
Copy output from workflow repo to data repo
cp -r output/sars-cov-2/references/MN908947/versions/ ../../nextclade_data/data/datasets/sars-cov-2/references/MN908947/versions
-
Update
changelog.md
-
Get Ivan to review
-
Merge into master
Follow release guidelines as outlined here: https://github.com/nextstrain/nextclade_data#dataset-release-process