Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Capture uploaded allele correctly for VCF input #1744

Open
wants to merge 11 commits into
base: main
Choose a base branch
from

Conversation

likhitha-surapaneni
Copy link
Contributor

@likhitha-surapaneni likhitha-surapaneni commented Aug 20, 2024

Ticket: ENSVAR-5858

  • Fix Unit tests
  • Review header and output columns

@likhitha-surapaneni likhitha-surapaneni marked this pull request as draft August 20, 2024 10:50
@likhitha-surapaneni likhitha-surapaneni marked this pull request as ready for review August 21, 2024 09:20
@nakib103 nakib103 self-requested a review August 21, 2024 09:27
modules/Bio/EnsEMBL/VEP/Parser.pm Outdated Show resolved Hide resolved
modules/Bio/EnsEMBL/VEP/Parser.pm Show resolved Hide resolved
@nuno-agostinho nuno-agostinho removed their request for review September 6, 2024 12:47
@likhitha-surapaneni likhitha-surapaneni changed the base branch from postreleasefix/113 to main December 4, 2024 15:30
my $alt_allele_count;

foreach my $alt(@alleles) {
if (length($ref_allele_string) != length($alt) or $original_allele_string =~ /^-/){
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@likhitha-surapaneni , it seems we are checking only if ref allele is - with $original_allele_string =~ /^-/ unlike for any allele like before. Was it intended?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

$original_allele_string contains both ref_allele and alt_alleles in the same string separated by "/"

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

true, but with ^ you are only checking - allele for ref and missing deletions?

modules/Bio/EnsEMBL/VEP/OutputFactory.pm Show resolved Hide resolved
Comment on lines +1681 to +1683
# print("Before rejoin\n");
# use Data::Dumper;
# print(Dumper($ib->buffer));
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should we remove the comments?

Comment on lines +970 to +971
# Updating a flag to minimise multi-allelic variants in split_variants/rejoin_variants
$vf->{minimised} = 1;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Multi-allelic is not getting minimised for default format. For example - 1 961320 961324 GCAGG/GCA/GCAG +

But in the output still getting MINIMISED=1, (without the PR they are also not minimised but there is no MINIMISED=1).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants