This repository contains the code that was used to perform the analysis for this manuscript. Most of the analysis are performed using the SnakeMake workflow, and to perform the homologous sequence search requiures the setup of a Diamond database of the Genome Taxonomy Database.