Skip to content

Commit 6ee96c9

Browse files
author
Winston Hide
committed
Update chapman_bcbio.tex
added concepts of remaining up to date. Need a table of sites and architectures upon which the code is actively deployed
1 parent b34cf2e commit 6ee96c9

File tree

1 file changed

+7
-6
lines changed

1 file changed

+7
-6
lines changed

papers/bcbio-nextgen/chapman_bcbio.tex

Lines changed: 7 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@
3939
and cancer tumor/normal pairings. However, rapidly changing best
4040
practice approaches in alignment and variant calling, coupled with
4141
large data sizes, make it a challenge to develop scalable, accurate
42-
pipelines. Coordinated community development overcomes these
42+
pipelines that can remain up to date. Coordinated community development overcomes these
4343
challenges by sharing testing and updates across groups relying on the
4444
same infrastructure.
4545

@@ -69,7 +69,7 @@ \section*{Introduction}
6969
mechanism to assess variant quality and interfaces with downstream tools for
7070
variant analysis. Practically, it installs with a single command on multiple
7171
computing architectures, scales to large whole genome analyses, and is community
72-
developed. The goal is to provide a platform for moving from raw sequencing data
72+
developed. The goal is to provide a robust platform for moving from raw sequencing data
7373
to high-quality variant calls that evolves as algorithms and sequencing
7474
technologies change.
7575

@@ -116,7 +116,8 @@ \section*{Introduction}
116116

117117
\item Community developed: Due to the focus on solving the problems
118118
of setting up and maintaining a complex analysis pipeline, multiple
119-
sequencing centers and research laboratories use bcbio-nextgen. We
119+
sequencing centers and research laboratories use bcbio-nextgen <<<SUCH AS
120+
AND REFER TO A TABLE OF THE SITES AT WHICH IT IS EMPLOYED TOGETEHR WITH THE ARCHITECTURES>>>>. We
120121
actively encourage contributors to the code base and make it easy to
121122
get started with a fully automated installer and updater that
122123
prepares all third party software and reference genomes.
@@ -213,9 +214,9 @@ \section*{Validation}
213214
calling without recalibration and realignment, both HaplotypeCaller and
214215
FreeBayes perform as good or better without these steps.
215216

216-
The main benefit of validation is to enables experiments that quantitatively
217+
The main benefit of validation is to enable experiments that quantitatively
217218
assess widely held approaches. We expect best practices to change with new
218-
releases and algorithms, and the automated assessment mechanism allows
219+
releases and algorithms. The automated assessment mechanism allows
219220
bcbio-nextgen to track and adapt to continuously improving tools.
220221

221222
\FloatBarrier
@@ -265,7 +266,7 @@ \section*{Scaling}
265266
memory usage and disk IO to maximize the throughput of multiple simultaneous
266267
processes. An input configuration files specifies available memory usage for
267268
programs that allow memory restrictions, and expected memory usage for those
268-
that do not. These inputs allow an accurate estimate of memory consumption and
269+
that do not. These inputs allow for an accurate estimate of memory consumption and
269270
bcbio-nextgen avoids overscheduling jobs relative to available memory on each
270271
machine. Similarly, simultaneous disk IO on shared filesystems is a common
271272
bottleneck during processing. bcbio-nextgen minimizes this by use of streaming

0 commit comments

Comments
 (0)