Best Practices for DataTree with Multiscale Data #9577
Replies: 1 comment 3 replies
-
It's very satisfying to me to see people taking these same data management concepts and applying them to fusion 😊
Yeah that's hopefully a pre-datatree anti-pattern now.
Instead of answering directly yet I'm going to use you as a guinea pig and point you to some upcoming documentation I wrote - PR #9501, the latest build is here. I'm hoping reading that helps answers your question - if it doesn't then please say so.
Nothing dedicated like that exists for datatree yet, but it could exist. In fact that sounds very similar to calling |
Beta Was this translation helpful? Give feedback.
-
Mostly pinging @TomNicholas, I see
DataTree
is making it into the public API; exciting!I was hoping to get your thoughts on best practices, with a concrete fusion example.
Let's consider magnetic data. We have that at three resolutions:
We also have many shots of data. Shots tend to be between 0.5 seconds and 3 seconds, which isn't a huge range, but a large enough range that we probably shouldn't just nan pad the ends and have "shot" just be a dimension along an array.
What kind of structure would you recommend with
DataTree
? Should we just have one node for each "shot"? Is there some built-in functionality to "transpose" a variable? (i.e. go from multiple arrays of "signalX" to a single array of "signalX" with a new dimension and nan padding to handle different lengths).Beta Was this translation helpful? Give feedback.
All reactions