GitHub - danhalligan/rosalind.info: Solutions to the bioinformatic coding challenges at rosalind.info

This repository contains solutions to bioinformatics coding challenges from rosalind.info. Problems are organised by the various different locations:

Python Village: initial problems to learn a few basics about the Python programming language.
Bioinformatics Stronghold: problems to discover the algorithms underlying a variety of bioinformatics topics.
Bioinformatics Armory: unlike the stronghold in the Armory we solve problems by using existing tools.
Bioinformatics Textbook Track: problems associated with Bioinformatics Algorithms: An Active Learning Approach.
Algorithmic Heights: exercises to accompany the book Algorithms.

Running the solutions

This repository is written as a python module and uses poetry and typer.

Solutions for each problem are located in individual files inside the directory for each location.

You can install the versions of dependencies used here with:

poetry install

To run solutions within this environment run, e.g.:

poetry run rosalind ini2 rosalind_ini2.txt

To run the solution on the provided "Sample Dataset" from rosalind.info (which should reproduce the "Sample Output"), run the solution in "test" mode:

poetry run rosalind --test ini2

Testing

pytest-snapshot is used to test solutions to problems. In many cases solutions generated will and should exactly match the "Sample Output" given at rosalind.info. In cases, where e.g. ordering is not important, the expected solutions (in tests/expected) have been updated to match code used here, but are equally valid solutions.

To run the tests use:

poetry run pytest

To update the tests (adding or modifying snapshots / expected output) use:

poetry run pytest --snapshot-update

Note that some solutions (that use Entrez) require an email address. This should be set as an environment variable, e.g.:

export [email protected]

About

My rosalind profile: https://rosalind.info/users/danhalligan/

Solutions

Python Village

Bioinformatics Stronghold

Notes

For "QRTD" I have cheated by using tqDist. For the solution to run you will need to install quartet_dist and have it available in your path. Well done to anyone else who solved this properly!

Bioinformatics Armory

Notes

For "MEME" and "CLUS" I have not written a solution. Use the web interface as instructed.
For "SUBO", you need to run the online interface, identify the 32-40 bp and then can use the solution here to count the occurrences of this in the sequences.

Name		Name	Last commit message	Last commit date
Latest commit History 134 Commits
.github/workflows		.github/workflows
.vscode		.vscode
images		images
rosalind		rosalind
template		template
tests		tests
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Running the solutions

Testing

About

Solutions

Python Village

Bioinformatics Stronghold

Notes

Bioinformatics Armory

Notes

Bioinformatics Textbook Track

Algorithmic Heights

About

Releases

Packages

Languages

License

danhalligan/rosalind.info

Folders and files

Latest commit

History

Repository files navigation

Running the solutions

Testing

About

Solutions

Python Village

Bioinformatics Stronghold

Notes

Bioinformatics Armory

Notes

Bioinformatics Textbook Track

Algorithmic Heights

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages