-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Description
Current State
During coverage aggregation, bgzipped results are concatenated together as the process runs.
| template 'mlr_agg.sh' |
bravo_data_prep/workflows/coverage/templates/mlr_agg.sh
Lines 28 to 31 in 3349238
| # Aggregate depths from depth file chunks | |
| mlr -N --tsv 'nest' --ivar ";" -f 3 \${PIPES[@]} |\ | |
| sort --numeric-sort --key=2 |\ | |
| bgzip >> ${result_file} |
Tabix sometimes fails to produce a valid index for concatenated summary data occasionally. An index gets written, has the contig name, but can't be used to get data by region. tabix file.tsv.gz chr22:10000100-10000200 | wc -l gets 0.
Work around currently involves re-writing entire bgzipped file.
Action items
- Generate small reproducible example of tabix not producing an index of data.
- Sort out solution that is more efficient that re-writing the entire gzipped file.
Reference
Suspected to be related to:
Metadata
Metadata
Assignees
Labels
No labels