GitHub - cauyrd/transIndel: Indel caller for DNA-seq or RNA-seq

Introduction

transIndel is used to detect indels (insertions and deletions) from DNA-seq or RNA-seq data by parsing chimiric alignments from BWA-MEM.

Prerequisites

Samtools/1.0 or newer (http://www.htslib.org/) Python 3.6 or newer (https://www.python.org/) Python packages:

Pysam/0.13.0 or newer (https://pypi.org/project/pysam)
HTSeq/0.6.1 or newer (https://pypi.python.org/pypi/HTSeq)

Getting Soure Code

git clone git://github.com/cauyrd/transIndel.git
cd transIndel

Running transIndel

STEP 1: Build new BAM file with redefined CIGAR string

analyzing DNA-seq data (whole genome seq/exome-seq/targeted capture)

 python transIndel_build_DNA.py -i input_bam_file -o output_bam_file [options]

analyzing RNA-seq data

 python transIndel_build_RNA.py -i input_bam_file -r reference_genome_fasta -g gtf_file -o output_bam_file [options]

Options:

-h, --help            show this help message and exit
-i INPUT, --input INPUT
                    Input BAM file
-o OUTPUT, --output OUTPUT
                    Output BAM file
-r REF, --ref REF     reference genome used for analyzing RNA-seq data
-g GTF, --gtf GTF     gene annotatino file used for analyzing RNA-seq data
-s SPLICE_BIN, --splice_bin SPLICE_BIN
                    splice site half bin size (default: 20)
-m MAPQ, --mapq MAPQ  minimal MAPQ of read from BAM file for supporting
                    Indel (default: 15)
-l LENGTH, --length LENGTH
                     Maximum deletion length to be detected (default:1000000)
-v, --version         show program's version number and exit

Input:

input_bam_file   			:input BAM file is produced by BWA-MEM and is sorted and indexed.
reference_genome_fasta (for RNA-seq)    :reference genome in FastA format
gtf_file (for RNA-seq)    		:gene annotation file in GTF format

Output:

your_output_bam_file			:BAM file for CIGAR string redefinement.

transIndel generates the following optional fields in output BAMs

Tag| Meaning
--------------------------------------------------------------------------------------
OA | original representative alignment; format: (pos,CIGAR)
JM | splicing junction reads; infered from GTF or splicing motif (used in RNA-seq BAM)

STEP 2: Call indel

Option 1: using transIndel_call.py script

 python transIndel_call.py -i input_bam_from_transIndel_build -o output_vcf_filename_prefix [options]

Options:

-h, --help            show this help message and exit
-i INPUT, --input INPUT
                        Input BAM file
-o OUTPUT, --output OUTPUT
                    output VCF file prefix
-c AO, --ao AO        minimal observation count for Indel (default: 4)
-d DEPTH, --depth DEPTH
                 minimal depth to call Indel (default: 10)
-f VAF, --vaf VAF     minimal variant allele frequency (default: 0.1)
-l LENGTH, --length LENGTH
                 minimal Indel length (>=1) to report (default: 10)
-m MAPQ, --mapq MAPQ  minimal MAPQ of read from BAM file to call Indel
                 (default: 15)
-t REGION             Limit analysis to targets listed in the BED-format
                 FILE or a samtools region string
-v, --version         show program's version number and exit

Input:

input_bam_file   			:input BAM file is produced by transIndel_build.py

Output:

output_vcf_file   			:Reported Indels with VCF format

Option 2: using existing variant caller (e.g. VarDict, freebayes, GATK)
```
 following the specific variant caller's manual
```

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
LICENSE		LICENSE
README.md		README.md
transIndel_build_DNA.py		transIndel_build_DNA.py
transIndel_build_RNA.py		transIndel_build_RNA.py
transIndel_call.py		transIndel_call.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Prerequisites

Getting Soure Code

Running transIndel

STEP 1: Build new BAM file with redefined CIGAR string

Options:

Input:

Output:

STEP 2: Call indel

Options:

Input:

Output:

About

Releases 2

Packages

Languages

License

cauyrd/transIndel

Folders and files

Latest commit

History

Repository files navigation

Introduction

Prerequisites

Getting Soure Code

Running transIndel

STEP 1: Build new BAM file with redefined CIGAR string

Options:

Input:

Output:

STEP 2: Call indel

Options:

Input:

Output:

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages