You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As a Data steward, I'd like to tag and add relationships like upstream to fields of repeated structs. These kind of tables are very common in big data because they denormalize two tables with a "has many" relationship in a single table because of performance reasons. Nevertheless governance and discoverability is still a requirement for them and OpenMetadata should support them like it supports traditional flatten or nested without repeated tables.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Use case
As a Data steward, I'd like to tag and add relationships like
upstream
to fields of repeated structs. These kind of tables are very common in big data because they denormalize two tables with a "has many" relationship in a single table because of performance reasons. Nevertheless governance and discoverability is still a requirement for them and OpenMetadata should support them like it supports traditional flatten or nested without repeated tables.BigQuery defines a new type called
repeated
that it is equivalent toarray<struct>>
. https://medium.com/google-cloud/bigquery-explained-working-with-joins-nested-repeated-data-1941646ccb5bSolution
Here we advocate for the adoption of
repeated
in OpenMetadata.Advantages:
user.sessions.clicks.ip
, User has many sessions. Session has many clicks. A click has an IP that it is tagged as PII.Implementation details
array<struct>
torepeated
.References
Beta Was this translation helpful? Give feedback.
All reactions