Assembly Flye Long Read

This pipeline performs assembly of long read data using Flye and polishing carried out by Racon/Medaka. QC statistics are provided using QUAST. This pipeline assumes data has been basecalled using Guppy and does not support trimming of adapters as the latest versions of Guppy trim these. Reads of short length or poor quality are removed within the pipeline to ensure high quality assembly either by nanoq or chopper based on preference.

Inputs:

A manifest is supplied using the --manifest option.

The manifest is a file consisting of three columns in CSV format.

ID,long_fastq,genome_size

ID: name of the sample output directory including assembly and quast outputs
long_fastq: path to the basecalled, adapter trimmed and demultiplexed read associated to that ID
genome_size: approximate size of the genome; used by flye to reduce memory use while producing the draft assemblies (optional - leave blank if not supplied or unknown)

Outputs:

Within the runName directory you will have a copy of the submitted manifest and a quast summary TSV for all samples within the submitted manifest.

Each sample output directory has the following subdirectories:

Draft_assembly - Contains your draft assembly directly from flye and the flye.log file for extra information on the assembly.
lr_polished_assembly - Contains the consensus fasta final assembly from medaka and quast summary based on this.

Unique to the current release:

Currently the pipeline supports optional parameters within Flye and Racon processes. This will need to be removed at the time of production.
Currently the pipeline publishes all assemblies and quast summaries under the runName directory. This needs to be changed before central use is enabled.

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
modules		modules
subworkflows		subworkflows
tests		tests
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
.talismanrc		.talismanrc
LICENSE		LICENSE
README.md		README.md
assembly_flye_long_read.sh		assembly_flye_long_read.sh
main.nf		main.nf
module.template		module.template
nextflow.config		nextflow.config
nf-test.config		nf-test.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Assembly Flye Long Read

Inputs:

Outputs:

Unique to the current release:

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

sanger-pathogens/assembly_flye_long_read

Folders and files

Latest commit

History

Repository files navigation

Assembly Flye Long Read

Inputs:

Outputs:

Unique to the current release:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages