##Recombination complicates the natural history of lentiviruses
Lentiviruses notoriously recombine, which effectively means that different portions of the lentiviral genome have different evolutionary--and phylogenetic--histories. Thus, in order to reconstruct the history of cross-species transmission among primate lentiviruses, we had to first assess the extent of recombination between SIV lineages. We used GARD to look for evidence of recombination breakpoints across a modified version of the LANL compendium alignment of HIV and SIVs.
To ease computational intensity, we ran GARD on 3kb regions of the genome (with 1kb overlaps on either end). We repeated this with the windows offset in order to control for proximity to region edges. Alignment coordinates listed. All analyses used the 012234 nucleotide model and 3 bins of site variation.
Window | Start | End | N breakpoints found |
---|---|---|---|
A | 1 | 3000 | 1 |
B | 2000 | 5000 | 2 |
C | 4000 | 7000 | 3 |
D | 6000 | 9000 | 2 |
E | 8000 | 11000 | 3 |
F | 10000 | end | 3 |
G | 1 | 2500 | 1 |
H | 1500 | 4500 | 3 |
I | 3500 | 6500 | 3 |
J | 5500 | 8500 | 2 |
K | 7500 | 10500 | 2 |
L | 9500 | 12500 | 3 |
M | 11500 | end | 2 |
Consensus breakpoint coordinates: 1, 2317, 2987, 3593, 4888, 5824, 6320 (omitted), 7474, 8136, 9084, 9734, 10716, 11068 (omitted), 11766, 13500
We then split the full dataset alignments along these breakpoints (all alignments held the compendium fixed and discarded insertions, coordinates are the same) and built maximum likelihood trees with RAxML for each segment. Tracing each taxa across segments allows us to visualize how each taxa's phylogenetic placement varies across breakpoints.