Skip to content

Conversation

@nuts-rice
Copy link

@nuts-rice nuts-rice commented Apr 29, 2025

An attempt to address #120 by having a script to run both rust and python benchmarks

The script so far:

  • Checks for and downloads some geojson data
  • Checks for and downloads some parquet data
  • Install requirements for generate_data.py and rtree.py
  • cargo bench for regular target and the rayon feature
  • runs rtree.py

rtree.py is WIP now, I attempted to follow closely to bench.py with some fixes for building the r-tree.
The resources used by generate_data.py and bench.py were broken so I've replaced them with other geojson and parquet datasets respectively

@nuts-rice nuts-rice force-pushed the python_rust_benches branch from f879d42 to 0e2736e Compare April 29, 2025 21:10
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd prefer using a basic pyproject.toml here using uv https://docs.astral.sh/uv/, so the benches are repeatable

geoindex-rs
geopandas
pyogrio
pyarrow
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

you're also missing requests, numpy, shapely from the dependency list

import shapely


gdf = gpd.read_file("Utah.geojson.zip", engine="pyogrio")
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@nuts-rice nuts-rice marked this pull request as ready for review May 19, 2025 19:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants