Skip to content

GeoParquet Read vs Write Performance #440

Answered by Maxxen
youngpm asked this question in Q&A
Discussion options

You must be logged in to vote

I don't think this is a bug, Parquet is a heavily read-optimized format so I think its expected that writes are much slower depending on the column types/compression codecs used. The "Geo"-Parquet code path only adds a small amount of additional processing to calculate the geo-specific metadata/statistics as required by the geoparquet specification, so I'd only be inclined to investigate write-speed if it turns out that writing geoparquet is significantly slower than normal parquet.

Replies: 3 comments 3 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
3 replies
@Maxxen
Comment options

Answer selected by youngpm
@Maxxen
Comment options

@youngpm
Comment options

Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants