Data science tools #785
Replies: 2 comments 4 replies
-
Apache Arrow is more an underlying technology for many higher level tools like https://github.com/pola-rs/polars and https://arrow.apache.org/datafusion/ In my opinion, the tech is still a bit young, but is great if you are doing interoperability between languages. I think even duckdb-rust uses arrow for passing data between the c++ library and rust. |
Beta Was this translation helpful? Give feedback.
-
I've been eyeing pola.rs for a while, and looking to leverage it as an engine for qsv. @baggiponte also suggested using it - #725 Right now, I'm thinking of tipping my toes into it by trying to use it for its time-series capabilities, particularly it temporal groupby and resampling capabilities, as we often need to do so with CSV files from IoT type feeds. And it'd be very convenient to do so from the command-line, without having to write a script/program. Are there any other "data-science" type CSV data-wrangling tasks from the command-line on your wishlist? |
Beta Was this translation helpful? Give feedback.
-
(Continuation of discussion about data science tools.)
How common and useful is Apache Arrow in data science?
Beta Was this translation helpful? Give feedback.
All reactions