-
Notifications
You must be signed in to change notification settings - Fork 62
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bus errors when writing DataFrame
#474
Comments
Further details, as shared on Slack already: |
that's pretty extreme, why isn't the table transposed? otherwise I don't know what could Bus error indicate, Arrow.jl is not super friendly with memory but this doesn't seem to be OOM like? |
The table isn't transposed because in general the columns are more heterogeneous. But this should be well below any hard limits, I've even written tables with 100000 columns before without any problems. |
I am getting a similar error when writing a TypedTable. This TypedTable is only 10x23. I got this error from running The result is a zero byte file at Arrow version 2.6.2, Julia version 1.8.5. |
is it possible to provide the schema of the 10x23 table? or better can you write a snippet reproducer to generate dummy data? |
I think I resolved the issue. It has to do with how Arrow reads Tables from disk. It does not load the entire table into memory but uses only a view. If you write this view to the same file, it causes this bus error.
I wonder if this is also the case in @simsurace's error. |
I don't believe that was the case. I was writing some intermediate results from a large calculation to a new file name, and the error occurred intermittently. |
Can confirm the same issue that @stuartthomas25 identified. My quite obvious workaround is to write the table to a temporary file and then overwrite the original. However, julia |
I intermittently get bus errors with crash to terminal when writing
DataFrame
s to.arrow
:This was with Julia 1.9.1 and Arrow.jl v2.6.2
The text was updated successfully, but these errors were encountered: