-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Insights: apache/iceberg
Overview
30 Pull requests merged by 16 people
-
Build: Bump software.amazon.awssdk:bom from 2.30.26 to 2.30.31
#12439 merged
Mar 9, 2025 -
Docs: fix typo in rest-catalog-open-api.yaml.
#12480 merged
Mar 8, 2025 -
Site: Fix Footer Link
#12478 merged
Mar 7, 2025 -
Core: Provide access to format-version of metadata table
#12462 merged
Mar 7, 2025 -
Parquet: Support unknown and timestamp(9) in generics and internal model
#12463 merged
Mar 6, 2025 -
Data: Expose snapshot-id instead of branch for computing partition stats
#12464 merged
Mar 6, 2025 -
Core: Don't create empty RemovePartitionSpecs MetadataUpdate
#12465 merged
Mar 6, 2025 -
Spec: Add implementation note on
current-snapshot-id
#12334 merged
Mar 6, 2025 -
Core: Write
null
forcurrent-snapshot-id
for V3+#12335 merged
Mar 6, 2025 -
Parquet: Implement Variant writers
#12323 merged
Mar 5, 2025 -
support create table like in flink catalog
#12199 merged
Mar 5, 2025 -
Core: Ensure current and newly added view versions are retained in ViewMetadata build
#12401 merged
Mar 5, 2025 -
Wrap variant in PrimitiveHoder so serialization can result same instance
#12317 merged
Mar 5, 2025 -
Avro: Support timestamp(9) and unknown types
#12455 merged
Mar 5, 2025 -
Spark: Infer partition spec in ADD_FILES procedure for FileTables than taking latest table spec
#12327 merged
Mar 5, 2025 -
Core: Add Variant logical type for Avro
#12238 merged
Mar 4, 2025 -
Core: Change RemoveSnapshots to remove unused schemas
#12089 merged
Mar 4, 2025 -
Docs: Deprecate data_file.distinct_counts in v3
#12182 merged
Mar 3, 2025 -
[1.7.x] Fix Kafka-connect
LICENSE
andNOTICE
#12400 merged
Mar 3, 2025 -
[1.7.x] Align azure-storage-blob version in azure-bundle LICENSE
#12445 merged
Mar 3, 2025 -
Build: Bump net.snowflake:snowflake-jdbc from 3.22.0 to 3.23.0
#12437 merged
Mar 3, 2025 -
Build: Bump io.netty:netty-buffer from 4.1.118.Final to 4.1.119.Final
#12440 merged
Mar 3, 2025 -
Fix typo in delete statement: change database name from "dbl" to "db" in spark-writes.md
#12426 merged
Mar 3, 2025 -
Build: Bump jackson-bom from 2.18.2 to 2.18.3
#12434 merged
Mar 3, 2025 -
Build: Bump slf4j from 2.0.16 to 2.0.17
#12436 merged
Mar 3, 2025 -
Build: Bump org.mongodb:bson from 4.11.0 to 4.11.5
#12438 merged
Mar 3, 2025 -
Build: Bump datamodel-code-generator from 0.28.1 to 0.28.2
#12433 merged
Mar 2, 2025 -
Build: Bump org.openapitools:openapi-generator-gradle-plugin from 7.11.0 to 7.12.0
#12435 merged
Mar 2, 2025 -
Build: Ignore README.md/LICENSE/NOTICE in all paths on CI
#12429 merged
Mar 2, 2025 -
Build: Bump mkdocs-material from 9.6.5 to 9.6.6
#12432 merged
Mar 2, 2025
21 Pull requests opened by 15 people
-
Fix partitioning on timestamp field with year and month transformations
#12443 opened
Mar 3, 2025 -
API: Speed up Timestamps#toHumanString
#12447 opened
Mar 3, 2025 -
Support In and notIn operators in ParquetFilters.ConvertFilterToParquet
#12449 opened
Mar 4, 2025 -
Spark-3.5: Add spark action to compute partition stats
#12450 opened
Mar 4, 2025 -
Spark-3.5: Add procedure to compute partition stats
#12451 opened
Mar 4, 2025 -
Add unit test for AddFilesProcedure to check invalid column in partition filter
#12456 opened
Mar 5, 2025 -
Avro: Add variant readers and writers
#12457 opened
Mar 5, 2025 -
Avoid Avro recursive schema for Variant schema.
#12459 opened
Mar 5, 2025 -
Decouple Committer from Kafka and Enable Custom Coordinator Election
#12460 opened
Mar 5, 2025 -
HIVE-28801 Iceberg: Refactor HMS table parameter setting to be able to reuse
#12461 opened
Mar 5, 2025 -
Adds AWS to vendors page
#12468 opened
Mar 6, 2025 -
Flink 1.20: Support Avro and Parquet timestamp(9), unknown, and defaults
#12470 opened
Mar 6, 2025 -
Core: Use InternalData with avro and common DataIterable for readers.
#12476 opened
Mar 7, 2025 -
Spark 3.5: Apply Equality Deletes when Doing Copy on Write
#12479 opened
Mar 7, 2025 -
Docs: Update Iceberg talks with recent Iceberg meetup sessions
#12481 opened
Mar 8, 2025 -
Spark: Use correct statistics file in SparkScan::estimateStatistics(Snapshot)
#12482 opened
Mar 8, 2025 -
Build: Bump mkdocs-material from 9.6.6 to 9.6.7
#12483 opened
Mar 9, 2025 -
Build: Bump testcontainers from 1.20.5 to 1.20.6
#12484 opened
Mar 9, 2025 -
Build: Bump com.google.cloud:libraries-bom from 26.55.0 to 26.56.0
#12485 opened
Mar 9, 2025 -
Build: Bump com.gorylenko.gradle-git-properties:gradle-git-properties from 2.4.2 to 2.5.0
#12486 opened
Mar 9, 2025 -
Build: Bump com.azure:azure-sdk-bom from 1.2.31 to 1.2.32
#12487 opened
Mar 9, 2025
11 Issues closed by 5 people
-
Broken link on the iceberg web page.
#12475 closed
Mar 7, 2025 -
Writing to Iceberg Tables in Polaris not working
#12469 closed
Mar 7, 2025 -
Extends Iceberg table stats API to allow publish data and stats atomically
#6442 closed
Mar 7, 2025 -
Serialize `null` for `current-snapshot-id` when there is no current snapshot for ≥V3
#12310 closed
Mar 6, 2025 -
RewriteDataFiles maintenance action never converges
#6669 closed
Mar 6, 2025 -
ViewMetadataBuilder does not always retain changes in current build
#12113 closed
Mar 5, 2025 -
deadlock when spark call delete row postition
#10987 closed
Mar 5, 2025 -
Add option to provide partition spec in spark ADD_FILES procedure
#12325 closed
Mar 5, 2025 -
show table extended not supported for v2 table.
#5782 closed
Mar 4, 2025 -
It's not possible to readStream from an Iceberg table as source when its snapshots expire
#9504 closed
Mar 4, 2025 -
Add aliyun-bundle jar
#10970 closed
Mar 3, 2025
14 Issues opened by 13 people
-
Increase concurrency during rewriting position delete files in Iceberg Tables.
#12477 opened
Mar 7, 2025 -
Support full table scanning of partitioned table is prohibited
#12474 opened
Mar 7, 2025 -
Flink/Spark: add visitor support for variant
#12473 opened
Mar 7, 2025 -
Parquet: Support Variant Array read and write
#12472 opened
Mar 7, 2025 -
Spark RewriteTablePath Procedure not fully using AssumeRoleAwsClientFactory config
#12471 opened
Mar 6, 2025 -
Data Integrity Issue with DELETE Operation Using Copy-on-Write (COW) and Equality Deletes
#12467 opened
Mar 6, 2025 -
Spark can't get information from metadata tables
#12466 opened
Mar 6, 2025 -
Spark returns incorrect results when reading Parquet bloom filters created by Trino
#12458 opened
Mar 5, 2025 -
Flaky test `TestMergeOnReadMerge > testMergeWithSnapshotIsolation()`
#12453 opened
Mar 4, 2025 -
[Feature Request ][iceberg] use AWS IAM role with serviceAccount instated of IAM user
#12448 opened
Mar 4, 2025 -
Spark Procedure Azure Exception Signed expiry time must be after signed start time
#12446 opened
Mar 3, 2025 -
Spark readStream not progresing
#12444 opened
Mar 3, 2025 -
Can not read data when there is an required filed under an optional stuct
#12441 opened
Mar 3, 2025
48 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
API, Core: Add geometry and geography types support
#12346 commented on
Mar 7, 2025 • 43 new comments -
Handling no coordinator and data loss in ICR mode
#12372 commented on
Mar 7, 2025 • 16 new comments -
Auth Manager API part 6: API enablement
#12197 commented on
Mar 6, 2025 • 16 new comments -
Flink: Dynamic Iceberg Sink Contribution
#12424 commented on
Mar 5, 2025 • 6 new comments -
Retry on NoSuchNamespaceException not found in rename table for rest catalog
#12159 commented on
Mar 5, 2025 • 6 new comments -
pass proxy configuration from environment vars to http client
#12406 commented on
Mar 7, 2025 • 3 new comments -
Docs: Update statements mentioning Hive's alpha/beta versions
#12430 commented on
Mar 6, 2025 • 1 new comment -
OpenAPI: Use more clear language in recommending error responses
#12376 commented on
Mar 7, 2025 • 1 new comment -
Azure: Support vended credentials refresh in ADLSFileIO.
#11577 commented on
Mar 8, 2025 • 1 new comment -
Core: Extended header support for RESTClient implementations
#12194 commented on
Mar 7, 2025 • 1 new comment -
Spark: Rewrite V2 deletes to V3 DVs
#12250 commented on
Mar 7, 2025 • 1 new comment -
WIP File format write
#12164 commented on
Mar 7, 2025 • 0 new comments -
Spark: Support singular form of years, months, days, and hours functions
#12117 commented on
Mar 6, 2025 • 0 new comments -
Spec: Update partition stats for V3
#12098 commented on
Mar 7, 2025 • 0 new comments -
Parquet: Fix Reader leak by removing useless copy
#12079 commented on
Mar 3, 2025 • 0 new comments -
Adding new rewrite manifest spark action to accept custom partition o…
#11881 commented on
Mar 5, 2025 • 0 new comments -
Check if dependencies in libs.versions.toml are the latest supported ones for JDK11
#10852 commented on
Mar 2, 2025 • 0 new comments -
Spark 3.5: Add Comet tests
#12176 commented on
Mar 8, 2025 • 0 new comments -
Spark: DVs + Positional Deletes + Compaction
#12181 commented on
Mar 9, 2025 • 0 new comments -
Core: Make totalRecordCount optional in PartitionStats
#12226 commented on
Mar 7, 2025 • 0 new comments -
Spark: Structured Streaming read limit support follow-up
#12260 commented on
Mar 6, 2025 • 0 new comments -
Core: use ReachableFileCleanup when table has discontinuous snapshots
#12261 commented on
Mar 5, 2025 • 0 new comments -
Spark: Detect dangling DVs properly
#12270 commented on
Mar 7, 2025 • 0 new comments -
Spark-3.5: Add unit tests for ColumnarBatchUtil
#12275 commented on
Mar 3, 2025 • 0 new comments -
AWS: Integrate S3 analytics accelerator library
#12299 commented on
Mar 5, 2025 • 0 new comments -
Iceberg table not able to read data from S3 after few hours using Athena .
#9684 commented on
Mar 3, 2025 • 0 new comments -
Review new ImmutablesReferenceEquality error-prone check
#10855 commented on
Mar 3, 2025 • 0 new comments -
Kafka: runtime integration test failure or flaky
#11046 commented on
Mar 3, 2025 • 0 new comments -
Cannot update map key
#9163 commented on
Mar 4, 2025 • 0 new comments -
Iceberg Spark streaming skips rows of data
#10156 commented on
Mar 4, 2025 • 0 new comments -
AWS Glue Apache Iceberg Data Recovery
#11077 commented on
Mar 4, 2025 • 0 new comments -
I cannot create iceberg tables using databrics.
#10889 commented on
Mar 4, 2025 • 0 new comments -
Flaky test `TestHadoopCommits > testConcurrentFastAppends(File)`
#11047 commented on
Mar 4, 2025 • 0 new comments -
Flaky Spark tests due to initializationError
#12009 commented on
Mar 5, 2025 • 0 new comments -
S3FileIO does not raise exception when failing to drop/purge data
#12420 commented on
Mar 5, 2025 • 0 new comments -
Partition stats task tracker
#8450 commented on
Mar 7, 2025 • 0 new comments -
Iceberg Glue Concurrent Update can result in missing metadata_location
#9411 commented on
Mar 9, 2025 • 0 new comments -
com.esotericsoftware.kryo.KryoException: java.lang.ClassCastException: java.lang.Integer cannot be cast to java.nio.ByteBuffer
#9738 commented on
Mar 9, 2025 • 0 new comments -
Spark Streaming Job with multiple queries MERGE INTO the same target table (Runtime file filtering is not possible)
#11094 commented on
Mar 9, 2025 • 0 new comments -
Core: HadoopFileIO to support bulk delete through the Hadoop Filesystem APIs
#10233 commented on
Mar 5, 2025 • 0 new comments -
GCP: Add Iceberg Catalog for GCP BigQuery Metastore
#11039 commented on
Mar 9, 2025 • 0 new comments -
Core, Spark: Refactor FileRewriter interface to separate planning and execution
#11513 commented on
Mar 8, 2025 • 0 new comments -
Spark: 4.0 snapshot support
#11583 commented on
Mar 6, 2025 • 0 new comments -
Flink: Replace use of deprecated methods
#11658 commented on
Mar 5, 2025 • 0 new comments -
Reduce code duplication in VectorizedParquetDefinitionLevelReader
#11661 commented on
Mar 6, 2025 • 0 new comments -
Open-API: Fix compilation errors in generated Java classes due to mismatched return types
#11806 commented on
Mar 9, 2025 • 0 new comments -
Core: Bulk deletion in RemoveSnapshots
#11837 commented on
Mar 3, 2025 • 0 new comments -
Ignore partition fields that are dropped from the current-schema
#11868 commented on
Mar 6, 2025 • 0 new comments