Instructions to use HTS pipeline

This pipeline works for raw files coming from the Bauer center sequencing facility. It works in the HMS Orchestra cluster as of 10/20/2015

(IMPORTANT !!!!!!!!!!the master branch is now deprecated and is not been updated and probably won't work correctly, use sabre branch).

Make sure to have in the directory where you plan to analyze the results, aside from the sequencing file, the following files:

analysis.sh
analysis.py
log_parser.py
BDGtoWIG.py
fastq_trimmer.py
a setup.cfg file (check example file)
sizes ** In the setup file don't change the first 3 lines unless you know what you're doing, the next lines should contain the name you want to give your sample followed by the Input barcode name (eg:BAR40) followed by the IP barcode name, each one separated by a tab. If you don't have input samples or you have very few reads on the input just put as input bar code the same as your sample barcode (the peak calling won't give you any meaningfull results but without inputs you shouldn't be able to peak call, but you would still have the track pileup)

The easiest way to have all the files is going to your orchestra directory and running: git clone https://github.com/LuisSoares/HTSbauer.git And then copy the Bauer Center Raw file to the HTS directory.

Update July 2016, the fastest way is now to use the sabre branch of the git repository (this branch seems stable so it will probably be merged with master soon)

Sugestion: Personal folders in orchestra are restricted to 100GB, at one point the analysis will need more than 30GB so it is better that you run the analysis in a folder in the temporary /n/scratch2 filesystem where you have several terabytes available (beware that scratch is only temporary!).

To Run the script:

Start your ssh session
Goto the Directory with your required files
run: "source analysis.sh "BAUER_CENTER_SEQUENCING_FILE" "
wait around 6/7 hours (You will receive an email once is over) (less than 1 hour if sabre branch is used)

The script will output the following when using the sabre branch:

mochiview directory with wig files ready to load in mochiview
Jbrowser directory with bigwig files to load in yeast genome database browser
fastqc directory with one html and one zip file for quality controls for which barcode
temp directory containing the bam files for each barcode as well as the peak calling files for each barcode
a IGV directory with sorted and indexed bam files to load in IGV (
demultiplex and mapping and duplication pdf files with statistics of these two steps
log.out file with the entire log of the script run

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Instructions to use HTS pipeline

(IMPORTANT !!!!!!!!!!the master branch is now deprecated and is not been updated and probably won't work correctly, use sabre branch).

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.gitignore		.gitignore
BDGtoWIG.py		BDGtoWIG.py
Readme.md		Readme.md
analysis.py		analysis.py
analysis.sh		analysis.sh
fastq_trimmer.py		fastq_trimmer.py
log_parser.py		log_parser.py
setup.cfg		setup.cfg

LuisSoares/HTSbauer

Folders and files

Latest commit

History

Repository files navigation

Instructions to use HTS pipeline

(IMPORTANT !!!!!!!!!!the master branch is now deprecated and is not been updated and probably won't work correctly, use sabre branch).

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages