Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Line limit of bedtools getfasta and other sub-commands? #1107

Open
YuanfengZhang opened this issue Nov 11, 2024 · 0 comments
Open

Line limit of bedtools getfasta and other sub-commands? #1107

YuanfengZhang opened this issue Nov 11, 2024 · 0 comments

Comments

@YuanfengZhang
Copy link

When passing a sorted bed file containing more than 20 billion lines of genomic regions to getfasta, I only got the results of chr1, chr10 and chr11. The rows behind them were ignored. By the way, I'm sure the RAM is abundant (1TB 3200MHz).

I searched the closed issues and bedtools documentation but didn't get a proper answer. I know I can avoid this by separating single file to multiple small files, but I'm just curious that does bedtools have a line limit which is not described in the doc, or it's more likely to be a buffer/tmp file issue on my Ubuntu server?

Thanks for your help in advance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant