-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Improvement suggestion] Optimization of step 4 in phagcn
#56
Comments
Hi @valentynbez , In our own test, a one-million >10 kbp sequence FASTA file will take no more than five hours to obtain the prediction with 40 threads on our HPC. I am not sure what happened to your run but you can see whether the program is still running by checking whether the intermediate files are still generated in your 'midfolder'. If not, maybe it is stuck because of the memory system (I suppose). But in any case, you can first split your. vOTU into smaller files and run them separately. Hope this will help Best, |
Thanks for your answer. |
Sure, the codes are indeed the same. Another potential solution is using the codes below to replace the original
|
Currently, I am trying to classify ~200k vOTUs. The software is stuck for 2 weeks.
The issue lies in this chunk of code:
PhaBOX/src/phabox2/phagcn.py
Lines 135 to 173 in 6733c8b
Optimization of underlying steps would be helpful for large scale data.
The text was updated successfully, but these errors were encountered: