Skip to content

EIAV Project Data

Robert J. Gifford edited this page Dec 3, 2024 · 8 revisions

Genome-Length Reference Sequences

The EIAV extension layer includes a set of genome-length EIAV reference sequences from NCBI Nucleotide, in a source called "ncbi-refseqs-eiav."

Details for each isolate and sequence can be found in the accompanying data file.

A reference phylogeny was generated using maximum likelihood phylogenetic reconstruction, implemented in RAXML. An annotated phylogeny PDF is provided in this repository.

The reference phylogeny defines the following genotypes and subtypes:

  • Genotype American:
  • Genotype Asian:
  • Genotype European1
  • Genotype European2
  • Genotype Spain:
  • Genotype Brazil:
  • Genotype UK:

Nuccore Sequences

The EIAV extension layer contains a regularly updated set of EIAV sequences downloaded from NCBI Nucleotide (GenBank), under the source "ncbi-nuccore-eiav." This set excludes the genome-length references listed above.

These sequences are linked to standardized metadata extracted from GenBank XML using GLUE's GenbankXmlPopulator module. In cases where isolate data are missing from GenBank entries, we have supplemented these fields with values obtained through our EIAV origins investigation. Missing data are added to the Lentivirus-GLUE database using GLUE's textFilePopulator module, configured in this XML file.

Genotype and subtype assignments for all nuccore sequences, previously calculated using a maximum likelihood-based genotyping tool provided with this extension, are also imported from a tabular file.

Additionally, this EIAV extension layer includes comprehensive metadata linked to each EIAV sequence. Metadata categories include:

  • Sequence information (length, publication date)
  • Taxonomic data (genotype, subtype)
  • Isolate data (host species, sampling date, location, isolation source)

Alignment Tree

This extension layer establishes a dedicated alignment tree, rooted on bovine viruses, complementing the genus-level alignment tree created during the Lentivirus-GLUE build.

The alignment tree defines three clade categories:

  1. Subgenus: Equine
  2. Genotype: Follows established nomenclature regarding American, Asian and European clades.

Clone this wiki locally