Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Split-read SV calling improvements #15

Merged
merged 49 commits into from
Oct 25, 2024
Merged

Split-read SV calling improvements #15

merged 49 commits into from
Oct 25, 2024

Conversation

jonperdomo
Copy link
Collaborator

@jonperdomo jonperdomo commented Apr 12, 2024

Improvements to split-read SV calling:

  • Read sequences are not aligned multiple times. We choose the best sequence based on mismatch rates, and trim the other sequence
  • Only the highest probability SV is kept when comparing copy number prediction in split alignment regions

Also added an ethnicity argument for gnomAD allele frequencies, and fixed errors insertion and duplication breakpoints

@jonperdomo jonperdomo linked an issue Apr 12, 2024 that may be closed by this pull request
@jonperdomo
Copy link
Collaborator Author

Distribution of scoring values for HG002 SVs from model trained on HG002 SV gold standard:
Distribution of score values for HG002 SVs with logreg model trained on Truvari TP and FN

@jonperdomo jonperdomo removed a link to an issue Oct 21, 2024
@jonperdomo jonperdomo changed the title Add initial scoring model Split-read SV calling improvements Oct 21, 2024
@jonperdomo jonperdomo marked this pull request as ready for review October 21, 2024 18:59
@jonperdomo jonperdomo marked this pull request as draft October 21, 2024 18:59
@jonperdomo jonperdomo marked this pull request as ready for review October 25, 2024 16:21
@jonperdomo jonperdomo merged commit 2812c1e into main Oct 25, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant