Skip to content

Releases: apache/hudi-rs

0.2.0

26 Nov 09:32
a97978c
Compare
Choose a tag to compare

What's changed

🚀 Features

  • Support loading hudi global configs (#118) by @zzhpro
  • Add base file records' in-memory size to FileStats (#140) by @xushiyan
  • Support partition prune api (#119) by @KnightChess
  • Add partition filter arg in Python APIs (#153) by @xushiyan
  • Add HudiFileGroupReader with consolidated APIs to read records (#164) by @xushiyan
  • Add TableBuilder API for creating Table instances (#163) by @kazdy
  • Implement datafusion TableProviderFactory (#162) by @kazdy

🐛 Bug Fixes

  • Register object store with datafusion (#107) by @abyssnlp
  • Handle validating table when DropsPartitionFields not present (#142) by @xushiyan
  • Make partition loading more efficient (#152) by @xushiyan
  • Simplify partition filter format by taking tuple of strings (#170)
  • Improve api to get file slices splits (#185)
  • Handle schema retrieval for datafusion api (#187)

🚜 Refactor

  • Extract common test code for creating table (#117) by @gohalo
  • Improve APIs for handling options (#161) by @xushiyan
  • Improve TableBuilder API for taking single option (#171) by @xushiyan
  • Minor improvement to fix coverage report status (#173) by @xushiyan

📚 Documentation

🎨 Styling

🛠️ Build

  • Use exact versions for arrow and datafusion (#105) by @xushiyan
  • Bump up datafusion to version 41, arrow to 52.2 (#120) by @yjshen
  • (deps) Update zip-extract requirement from 0.1.3 to 0.2.1 (#130) by @dependabot[bot]
  • (deps) Upgrade datafusion, pyarrow, pyo3, python versions (#149) by @kazdy
  • (deps) Upgrade arrow dependencies (#168) by @kazdy
  • (release) Bump version to 0.2.0-rc.1
  • (deps) Upgrade datafusion and object store (#182)
  • (deps) Upgrade datafusion to 42.2.0 (#192)
  • (release) Bump version to 0.2.0-rc.2

⚙️ Miscellaneous Tasks

New Contributors

0.1.0

17 Jul 05:13
2c84570
Compare
Choose a tag to compare

What's Changed

🚀 Features

  • Initial rust implementation to integrate with datafusion (#1) by @xushiyan
  • Add python binding (#21) by @xushiyan
  • Implement HudiTable as python API (#23) by @xushiyan
  • Use object_store for common storage APIs (#25) by @xushiyan
  • Implement Rust and Python APIs to read file slices (#28) by @xushiyan
  • Add APIs for time-travel read (#33) by @xushiyan
  • Implement datafusion API using ParquetExec (#35) by @xushiyan
  • Add HudiConfigs for parsing and managing named configs (#37) by @xushiyan
  • Add config validation when creating table (#49) by @xushiyan
  • Add internal config to skip validation (#51) by @xushiyan
  • Support time travel with read option (#52) by @xushiyan
  • Support taking env vars for cloud storages (#55) by @xushiyan

🐛 Bug Fixes

  • Handle replacecommit for loading file slices (#53) by @xushiyan

🚜 Refactor

📚 Documentation

  • Update readme, contributing guide, and issue template (#57) by @xushiyan
  • Update CONTRIBUTING with minor changes (#58) by @codope

🎨 Styling

🛠️ Build

🧪 Testing

  • Add tests crate and adopt testing tables (#30) by @xushiyan
  • Add test cases for different table setup (#36) by @xushiyan

⚙️ Miscellaneous Tasks

New Contributors