All notable changes to this project will be documented in this file.
- Initial online documentation;
- Migrated the codebase to Python 3.8+;
- new extention for
embedding
module with ESM transformer-based protein embedding model;
requirements.txt
updated;- Updated
README.md
;
- K-mer transformer with parallelization support;
- Embedding for bacteriophages nucleotides with new Example in Jupyter Notebook format;
requirements.txt
with pandarallel, scikit-learn, gensim and numpy;
- Initial Travis CI integration;
- Bacteriophage structural protein classifier;
- Updated
README.md
;
- Examples in Jupyter Notebook format;
- Improved
setup.py
script and dependencies; - Updated
README.md
;
- Added
setup.py
script for Python Index Packages (PyPI);
embedding
module with BERT pre-trained embedding model for proteins with support for single protein vectorization as well as for proteins sets which are averagend to bacteriophage level;
features
module with protein-based (amino-acid) feature extractor for single protein sequence as well as for multifasta;crawlers
module with MillardLab website support;
requirements.txt
with fake-useragent, biopython, lxml, python-requests and pandas;