-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create beerDEcoded.yaml #7
base: main
Are you sure you want to change the base?
Changes from 1 commit
8b2621a
464a318
582a990
fd93de0
196ae73
9f102af
53a8253
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,188 @@ | ||
id: beerDEcoded | ||
name: BeerDEcoded - StreetScienceCommunity | ||
description: BeerDEcoded - StreetScienceCommunity | ||
title_default: <b>BeerDEcoded</b> | ||
steps: | ||
- title: <b>BeerDEcoded - StreetScienceCommunity</b> | ||
content: >- | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. idea maybe the introduction could contain a bit more info about the beer microbiome. Like what microorganisms can be found in a beer. Why are they important. A more general introduction is hopefully already done before and then we could have some repetition issues here. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I suggest this introduction for the tour: "Beer contains DNA that comes from its ingredients (hops, malts, yeast) and some hundred microbes. There are 1,000+ yeasts used for brewing and 200+ hop varieties, each one bearing a different DNA and contributing to differentiate its properties. Thanks for new technologies of sequencing it is easier nowadays to explore the potential of genome sequencing to understand the contribution of various species to product characteristics. The sequencing of the full genome of 157 brewing yeast strains was, for example, recently reported (Gallone B, Steensels J, Prahl T, et al. 2016). Based on the identification of strains present in beer with desired characteristics, controlled experiments in which the microbial composition of the brew is altered could allow us to investigate if the presence of specific microorganisms affects flavour. The origin of each yeast species can be investigated; i.e. whether they come with the ingredients or from the environment at the production site. Furthermore, plant DNA, such as malt and hop varieties, can be found in beer samples, and the bacterial diversity can be mapped. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. In general: sounds very good. Maybe break it up into two boxes, so it is not too much text in one.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
What if we add also information about 3 steps of the process: There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. That is a good idea. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. done |
||
What is a beer microbiome? There are collections of small living | ||
creatures. These small creatures are called bacteria and they are | ||
everywhere. In our gut, in the soil, even on vending machines. Some of | ||
these bacteria are actually very good for us. And some others can make us | ||
very ill. Bacteria come in different shapes and sizes but they have the | ||
same components. One crucial component is the DNA, the blueprint of life. | ||
The DNA encodes the shape and size and many other things unique for a | ||
bacterial species. Because of the encoding information the DNA can be used | ||
to identify what kind of bacteria the DNA is from. Therefore, within a | ||
sample form soil, our gut or beer one can specify what kind of species are | ||
inside the sample. Follow this tour to learn more about this kind of | ||
analysis. <br><br> Read and Follow the instructions before clicking | ||
<b>'Next'</b>.<br><br> Click <b>'Prev'</b> in case you missed out on any | ||
step." | ||
backdrop: true | ||
- title: <b>Create a new history</b> | ||
element: '#history-new-button' | ||
intro: >- | ||
Let's start by creating a new history.<br>Click on the button <b>'Create | ||
new history'</b><br> | ||
position: left | ||
preclick: | ||
- '#center-panel' | ||
- title: <b>Rename the history</b> | ||
element: '#current-history-panel > div.controls' | ||
intro: Change the name of the new history to <b>'BeerDEcoded'</b>. | ||
position: left | ||
- title: <b>Uploading the input data</b> | ||
content: We need to upload data. We will start with a <b>fastq</b> file. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Idea : incorporate a box shortly repainting the structure of a fastq file. Contains reads. One read is stored in 4 lines. What do the lines mean? ... We should have it explained somewhere before I guess, but for this formats, repetition does not harm. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I suggest to add: "FASTQ format is a text-based format for storing both a biological sequence and its corresponding quality scores. Both the sequence letter and the quality score are encoded with a single ASCII character for brevity. A FASTQ file normally uses four lines per sequence. You can find more info in the Wikipedia article. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. good :-) |
||
backdrop: true | ||
- title: <b>Uploading the input data</b> | ||
element: .upload-button | ||
intro: >- | ||
We will import fastq into the history we just created.<br><br> Click | ||
<b>'Next'</b> and the tour will open Galaxy Upload Manager and take you to | ||
the Upload screen. | ||
position: right | ||
postclick: | ||
- .upload-button | ||
- title: <b>Uploading the input data</b> | ||
intro: Upload the data by clicking the <b>'Paste/Fetch Data'</b> button. | ||
- title: <b>Uploading the input data</b> | ||
content: >- | ||
Load the data into your history by providing the links or choose your | ||
fastq file from your computer. <br>Click on <b>'Start'</b> to upload the | ||
data into your Galaxy history. | ||
- title: <b>Uploading the input data</b> | ||
content: >- | ||
The upload may take awhile.<br><br> Hit the <b>Close</b> button when you | ||
see that the files are uploaded into your history. | ||
- title: <b>Uploading the input data Complete !</b> | ||
content: 'Now that your data is ready, let''s use some tools.<br>' | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Idea Now that your data is ready, let's analyze it! There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. done |
||
backdrop: true | ||
- title: <b>Assign taxonomic classifications</b> | ||
content: >- | ||
One of the key steps in metagenomic data analysis is to identify the taxon | ||
to which the individual reads belong. Taxonomic classification tools are | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. belongs There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
done |
||
based on microbial genome databases to identify the origin of each | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. are using microbial ... There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
done |
||
sequence.<br> | ||
backdrop: true | ||
- title: <b>Taxonomic classification with Kraken2</b> | ||
content: >- | ||
To perform the taxonomic classification we will use Kraken2. This tool | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Maybe explain the tool in a more simple linguage. Words like: minimizer method k-mers, deterministic, memory, processing time and dustmasker. It could help to draw the tasks or steps of the tool if this is possible. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I suggest the explanation: "This tool uses k-mers (the read’s subsequences of length k) to assign a taxonomic label to the sequence (if possible). There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. A figure would be nice. Maybe a pile of readers split in k-mers and then searched in the DB. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This looks very nice. Thanks! |
||
uses the minimizer method to sample the k-mers (all the read’s | ||
subsequences of length k) in a deterministic fashion in order to reduce | ||
memory constumption and processing time. In addition, it masks | ||
low-complexity sequences from reference sequences by using dustmasker.<br> | ||
backdrop: true | ||
- title: <b>Taxonomic classification with Kraken2</b> | ||
element: .toolMenuContainer | ||
intro: Available tools appear here in the <b>tool menu</b>. | ||
position: right | ||
- title: <b>Taxonomic classification with Kraken2</b> | ||
element: '#__BVID__106' | ||
content: >- | ||
You can use <b>'tool search'</b> to locate tools.<br><br> Search for | ||
<b>'Kraken2'</b> and select it.<br><br> Tools may take a couple of moments | ||
to load. | ||
placement: right | ||
- title: <b>Taxonomic classification with Kraken2</b> | ||
element: .toolMenuContainer | ||
intro: >- | ||
Open <b>'Kraken2'</b> tool<br><br> Click <b>'Next'</b> to continue our | ||
tour. | ||
position: right | ||
- title: <b>Assign taxonomic labels with Kraken2</b> | ||
element: 'div[tour_id="use_names"]' | ||
intro: Have a look at the tool's parameters of the <b>'Kraken2'</b> tool. | ||
placement: right | ||
- title: <b>Assign taxonomic labels with Kraken2</b> | ||
intro: >- | ||
Please select the following parameters:<br><br> <b>'Single or paired | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Would be nice to have a short explanation for each or some of the parameters to change. It could be marked as side info, so the games do not have to read them, but if they would like to learn more they can. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
I added the info about parameters: Single or paired reads, Confidence and database. |
||
reads'</b>: Single <br> <b>'Input sequences'</b>: Uploaded dataset<br> | ||
<b>'Print scientific names instead of just taxids'</b>: Yes<br> | ||
<b>'Confidence'</b>: 0.0<br> <b>'In “Create Report”'</b>:<br> | ||
<b>'Print a report with aggregrate counts/clade to | ||
file'</b>: Yes<br> <b>'Format report output like | ||
Kraken 1’s kraken-mpa-report'</b>: Yes<br> <b>'Select a Kraken2 | ||
database'</b>: fungi2019-03' <br><br>Click <b>'Next'</b> and the tour will | ||
<b>'Execute'</b> the Kraken2 tool for you.' | ||
postclick: | ||
- '#execute' | ||
- title: <b>Assign taxonomic labels with Kraken2</b> | ||
element: '#current-history-panel > div.controls' | ||
intro: >- | ||
Congratulations, you have created two files. It contains Classification | ||
and Report file.<br><br> It will remain stored in your history.<br><br> | ||
Click the <b>'eye'</b> icon to view the data once the history item turns | ||
green. | ||
position: left | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I propose to add one additional step with additonal information about Classification file anf Report file format of Kraken2:
Kraken 2's standard sample report file format is tab-delimited with one line per taxon. The fields of the output, from left-to-right, are as follows:
" There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Would be good to also link your source. (Kraken manual) There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. done |
||
- title: <b>Analyze taxonomic assigment</b> | ||
content: >- | ||
Once we have assigned the corresponding taxa to the sequence, the next | ||
step is to properly visualize the data, for which we will use the Krona | ||
pie chart tool (<a href="https://doi.org/10.1186/1471-2105-12-385">Ondov | ||
et al. 2011</a>).<br> | ||
backdrop: true | ||
- title: <b>Adjust dataset format</b> | ||
element: '#__BVID__106' | ||
intro: >- | ||
But before that, we need to adjust the format of the data output from | ||
Kraken2.<br><br> Search and select the <b>'Reverse'</b> tool. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. add: It can happen, that the output format of the tool needs to be changed in order for the next tool to read the data. Galaxy offers several "manipulation" tools. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. It could help to explain what format changes should happen, so they understand why they are using the tools. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
done There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
I added the info about format changes we need and info about what info kraken2 output contains. |
||
position: right | ||
- title: <b>Adjust dataset format</b> | ||
intro: >- | ||
You have selected the Reverse tool.<br> Ensure that for <b>'Input tabular | ||
dataset'</b> you have selected the report output file from | ||
Kraken2.<br><br> <b>'Execute'</b> the Reverse tool when you are ready. | ||
position: right | ||
- title: <b>Adjust dataset format</b> | ||
element: '#__BVID__106' | ||
intro: Search and select the <b>'Replace Text'</b> tool. | ||
position: right | ||
- title: <b>Adjust dataset format</b> | ||
intro: >- | ||
You have selected the Replace Text tool.<br><br> Please select the | ||
following parameters:<br><br> <b>'File to process'</b>: Output dataset | ||
'out_file' from 'Reverse' tool<br> <b>'In “Replacements”'</b>:<br> | ||
<b>'Replacement 1'</b>:<br> | ||
<b>'Find pattern'</b>: | ||
\|<br> <b>'Replace | ||
with:'</b>: \t<br> <b>'Replacement 2'</b>:<br> | ||
<b>'Find pattern'</b>: | ||
[a-z]__<br> <b>'Replace | ||
with:'</b>: Empty <br><br> <b>'Execute'</b> the Reverse tool when you are | ||
ready. | ||
position: right | ||
- title: <b>Visualize the taxonomical classification with Krona</b> | ||
content: >- | ||
<b>Krona</b> allows hierarchical data to be explored with zooming, | ||
multi-layered pie charts. With this tool, we can easily visualize the | ||
composition of the bacterial communities and compare how the populations | ||
of microorganisms are modified according to the conditions of the | ||
environment. | ||
backdrop: true | ||
- title: <b>Visualize metagenomics analysis results</b> | ||
element: '#__BVID__106' | ||
intro: Search and select the <b>'Krona pie chart'</b> tool. | ||
position: right | ||
- title: <b>Visualize metagenomics analysis results</b> | ||
intro: >- | ||
You have selected the 'Krona pie chart' tool.<br><br> Please select the | ||
following parameters:<br><br> <b>'What is the type of your input | ||
data'</b>: Tabular<br> <b>'Input file'</b>: Output dataset 'out_file' from | ||
'Replace Text' tool<br> <b>'Provide a name for the basal rank'</b>: | ||
Root<br> <b>'Combine data from multiple datasets?'</b>: No<br> <br><br> | ||
<b>'Execute'</b> the 'Krona pie chart' tool when you are ready. | ||
position: right | ||
- title: <b>Visualize metagenomics analysis results</b> | ||
element: '#current-history-panel > div.controls' | ||
intro: >- | ||
Let’s take a look at the result by clicking eye icon. Using the search bar | ||
we can check if certain taxa are present. | ||
position: left | ||
preclick: | ||
- '#center-panel' | ||
- title: <b>A tour BeerDEcoded</b> | ||
intro: >- | ||
You have reached the end of the tour.<br><br>Thank you for going through | ||
our tutorial. | ||
backdrop: true |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Workflow finds yeast strains contained in a sequenced beer sample.