Replies: 1 comment
-
Hi, First of all i moved this thread to the discussion section since this is not an issue/bug but a question. Ok now about your question:
This is all in the basics of Vaex, and i really think you should at least quickly go through the tutorial to understand how Vaex works, since under the hood the behavior is different from other dataframe libraries. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi there,
![1660456076379](https://user-images.githubusercontent.com/49967236/184524313-bb5a3b87-7cbe-4c62-ae33-97680e70ec98.png)
![1660456146677](https://user-images.githubusercontent.com/49967236/184524331-39b28670-2682-4df5-8347-c7bf32b27cc0.png)
I have a large dataset of about 20 million samples. The task is simple: I want to get sum of a column.
The weird thing is like this: when I run codes below, the time estimate is 19 s.
but when I run sum command individually, it takes about 1s.
But, the steps before sum calculation takes only 0.3 s.....so I don't know what happens/what makes the difference between speed in the two figures.
Thoughts? solutions?
Beta Was this translation helpful? Give feedback.
All reactions