Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Capture uploaded allele correctly for VCF input #1744

Open
wants to merge 14 commits into
base: postreleasefix/114
Choose a base branch
from

Conversation

likhitha-surapaneni
Copy link
Contributor

@likhitha-surapaneni likhitha-surapaneni commented Aug 20, 2024

Ticket: ENSVAR-5858

  • Fix Unit tests
  • Review header and output columns

@likhitha-surapaneni likhitha-surapaneni marked this pull request as draft August 20, 2024 10:50
@likhitha-surapaneni likhitha-surapaneni marked this pull request as ready for review August 21, 2024 09:20
@nakib103 nakib103 self-requested a review August 21, 2024 09:27
modules/Bio/EnsEMBL/VEP/Parser.pm Outdated Show resolved Hide resolved
modules/Bio/EnsEMBL/VEP/Parser.pm Show resolved Hide resolved
@nuno-agostinho nuno-agostinho removed their request for review September 6, 2024 12:47
@likhitha-surapaneni likhitha-surapaneni changed the base branch from postreleasefix/113 to main December 4, 2024 15:30
modules/Bio/EnsEMBL/VEP/Parser.pm Outdated Show resolved Hide resolved
modules/Bio/EnsEMBL/VEP/OutputFactory.pm Show resolved Hide resolved
t/OutputFactory.t Outdated Show resolved Hide resolved
Comment on lines +970 to +971
# Updating a flag to minimise multi-allelic variants in split_variants/rejoin_variants
$vf->{minimised} = 1;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Multi-allelic is not getting minimised for default format. For example - 1 961320 961324 GCAGG/GCA/GCAG +

But in the output still getting MINIMISED=1, (without the PR they are also not minimised but there is no MINIMISED=1).

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @nakib103 , can you please test this example with the latest commit. The allele is expected to be similar to when running --minimal

@likhitha-surapaneni likhitha-surapaneni changed the base branch from main to postreleasefix/114 December 17, 2024 13:44
@dglemos dglemos requested review from dglemos and nuno-agostinho and removed request for nuno-agostinho January 30, 2025 09:57
@dglemos dglemos self-assigned this Jan 30, 2025
@dglemos
Copy link
Contributor

dglemos commented Jan 31, 2025

The MINIMISED is missing from the header, so in the VCF output even when the variant is minimised this info is not in the file. The only outputs with this info are the default and json.

@dglemos
Copy link
Contributor

dglemos commented Jan 31, 2025

When the input is in VCF format, example:
1 961320 . G GCAGGCTCGGCC . . .

In the vep default output, the Uploaded_variation is represented with minimised alleles:
1_961321_-/CAGGCTCGGCC
In this scenario matching the alleles is more difficult.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants