-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rename output.gd file with the name of the fastq file or part of it #275
Comments
I often use bash for loops to rename the 'output.gd' files based on what I set the output fold as when I started the run. So for the breseq commands: After the breseq runs are complete, i would execute the following command inside the Output_folder: I actually tend to get more elaborate putting this as part of a small bash script i have named "genome_diff_collection.sh"
If you aren't using unique output directories at the breseq command line, I have some ideas of how you could use grep to capture the "#=READSEQ" lines and edit them in a text editor or excel to give you a list of copy commands that you would paste onto the prompt to rename everything for you. |
Thank you Daniel for sharing your code. I think thats the way to go |
To add a couple of other ways of accomplishing workflows that involve many samples to the thread...
Start by creating GenomeDiff files with
Then, name these like Now, you can use After the output is present for all samples, commands like this should now work for comparing everything, because each
With this method, you still have to create these initial GenomeDiff files that describe each sample. I think @danieldeatherage has a script for creating these from a folder of FASTQ and some metadata.
This is a script I created for the common situation of needing to do something once for each directory inside of the current directory or once for each file inside of the current directory. It accomplishes what @danieldeatherage's bash script does, but with some other functionality (by invoking a Perl script). Download this and put it in your To get the full help, use:
I always recommend running things in test mode first to print what commands it will run if you are not in test mode first (so you don't do something crazy by mistake).
Now, if you are in a folder that contains many folders of breseq output for different samples named
What's happening is the script enters each directory and runs this command, replacing You can use the |
Dear all,
Does anybody have an idea on how to rename hundreds output.gd files with the name of the fastq file they are coming from?
So far I am doing it manually.
Thank you for your time
The text was updated successfully, but these errors were encountered: