Releases: picrust/picrust2
PICRUSt2 v2.5.3
Minor compatibility changes:
- Add file for editable pip install to continue working
- Addressed pandas deprecation warnings
- Change dependencies to be compatible with latest QIIME 2 environment (including latest SEPP and specifying version of dendropy that is compatible).
PICRUSt2 v2.5.2
-
Fixed dtype warning issue reported here (Note that this did not seem to affect the output either way)
-
Changed lower bound for h5py to enable compatibility with QIIME 2 conda environment (and also enable glpk and r-castor to be compatible with higher versions)
-
flake8-identified formatting cleanups
-
Cleaned-up redundant info and version info across files (specifically, just list version number, license, etc., in setup.pyrather than in all files)
-
Removed unnecessary script
print_picrust2_config.py
PICRUSt2 v2.5.1
Fixed incompatibility issues related to newer versions of dependencies, specifically numpy and pandas.
PICRUSt2 v2.5.0
-
Fixed important bug where the "norm_taxon_function_contrib" column of the contrib (i.e., stratified) output tables contained wrong values (e.g., #206). This was especially clear from the fact that the values were frequently NA or > 1 (when they should all be proportional). Many thanks to @ragavishn and Vincent Darbot for bringing this to my attention.
-
Added several checks to input sequence abundance table (especially when it is provided as a simple TSV table). This should provide more informative error messages regarding how the table is malformed and how it can be fixed.
-
Added check for whitespace in input FASTA file headers to provide informative message (otherwise this often results in issues with hmmalign / placement workflow that isn't obvious from the errors)
-
Bumped versions of gappa and castor dependencies to latest versions.
-
Added "cython" as a dependency to avoid installation problems (thanks @crusher083!)
PICRUSt2 v2.4.2
Minor update, primarily to add functionality with Python 3.8. Also now tests are run with Circle CI instead of Travis CI.
PICRUSt2 v2.4.1
Quick release to fix bug that prevented showing shuffle_predictions.py
help documentation.
PICRUSt2 v2.4.0
Major changes:
-
Added SEPP functionality, which can now be used as a low-memory alternative to EPA-ng for placing sequences
-
Added in norm_taxon_function_contrib column which is the same as the column ContributionPercentOfSample in the PICRUSt1 metagenome contributions output (which is also present in the converted legacy file now too)
-
Added a new script (shuffle_predictions.py), which can be used to quickly get shuffled trait predictions for comparing with actual predictions
Minor changes:
-
Added edge_exponent as an HSP option (which is particularly useful sometimes when running SEPP)
-
Removed white space from end of FASTA headers to avoid common error
-
Allow for cases where no pathways are output
-
Automatically check if any tips match trait table when quotes removed and remove if so
-
Check if only very few rows of trait table found in tree
-
Throw error if all NAs for any trait and warning if any NA in prediction table
-
Default reference FASTA is now unzipped to make it compatible with SEPP
-
Clarified message about sequences below NSTI cut-off to make it less confusing
-
Fixed recurrent typo of "Enzyme Classification" to be "Enzyme Commission"
-
Suppress printing log now when running MinPath (it was dominating the log output previously and making it hard to see other, more pertinent, messages)
-
Increased version numbers used for dependencies and made minor syntax changes to get them working, esp. pandas. This was done so that the tool would be compatible with the latest QIIME 2 version.
PICRUSt2 v2.3.0-b
Major changes
-
Now screen out study sequences that align poorly to reference sequences (taken to be 80% of original length by default). These sequences aren't placed at all and are listed in standard error.
-
Fixed bug where a mix of integers and floats were all converted to floats when used as sequence ids.
Minor changes
-
Improved how stderr and stdout of subprocess commands is captured when running
picrust2_pipeline.py
. -
Will output range of study sequence lengths when
--verbose
option set. -
Bumped r-castor version from 1.4.1 to 1.5.0
-
Bumped epa-ng version from 0.3.5 to 0.3.6
-
Bumped gappa version from 0.4.0 to 0.5.1
-
Set required hmmer version range to be >=3.1.b2,<=3.2.1 rather than just 3.2.1 (Confirmed that study sequence alignments are identical with these three hmmalign versions: 3.1b2, 3.2, 3.2.1)
-
Will now check for a column called "taxonomy" and throw a warning if it exists.
-
Added more verbose options and changed option to be
--verbose
rather than--print_cmds
for clarity in individual scripts. -
Throw error if space found in input FASTA filepath (causes problems for downstream placement tools)
-
Create pipeline output directory after running basic sanity checks (like checking that all files exist)
-
Set upperbound of 0.25.0 for pandas version for now to avoid massive spike in system time usage with newest versions
PICRUSt2 v2.2.0-b
Major:
-
Long-format ("contributional") stratified output is now the default output stratified table, which greatly reduces memory usage. The wide-format table can still be output when the "--wide_table" option is set.
-
Added option to skip copy number normalization to metagenome_pipeline.py and picrust2_pipeline.py along with new unittests.
-
Fixed bug that caused error when running --per_seq_contrib with picrust2_pipeline.py.
-
Bumped r-castor to 1.3.6 -> 1.4.1 and gappa from 0.2.0 -> 0.4.0
-
Convert ASV ids to be strings to avoid cases where they are treated as floats.
-
Changed output format of "metagenome contributions" filetype. Now both relative and raw ASV abundances are output and column names are simplified.
Minor:
-
Removed exact pinning for most conda dependencies to fit with bioconda distribution best-practices.
-
Added pre-print citation in setup.py
-
Added garbage collector (i.e.
gc()
) calls to castor R script to help reduce memory usage. -
Now output an additional unstrat pathway abundance table based on regrouping per-seq-contrib stratified output when --per_sequence_contrib used.
-
Added ability for convert_table.py to convert to legacy contributional format.
-
Made option usage for add_descriptions.py clearer.
-
Moved all MetaCyc pathway descriptions to single file.
-
Gzipped all testfiles.
-
Added test for --per-seq-contrib when running full pipeline.
-
Fixed bug where samples with no ASVs / predictions were blank in the weighted NSTI output (now output as 0).
PICRUSt2 v2.1.4-b
Major changes:
- Added metagenome_contrib option to pipeline script (i.e. a similar breakdown of taxonomic contributors to functions as output in PICRUSt1).
- Changed default output files to be gzipped, which greatly saves on disk usage.
- Bumped to castor version 1.3.6.
Thanks to @efratmuller for suggesting the option to remove intermediate files and for an implementation in R for getting metagenome contributions to compare against.
Minor changes:
- Added flag to picrust2_pipeline.py to optionally remove intermediate files.
- Changed pd.read_table calls to pd.read_csv to avoid future warning.
- Removed unused tutorial folders.
- Removed unnecessary future imports.
- Changed pipeline timing to only be printed if verbose option set.
- Minor typo and help documentation changes.