Expected chunking behaviour for idxmin / idxmax. #9421

KBodolai · 2024-09-02T16:15:47Z

KBodolai
Sep 2, 2024

Hi there! I have a question about the chunking behaviour when using idxmin / idxmax for a chunked array.

What is the expected behaviour for the chunks after we run idxmin over one of the dimensions? Naively I'd expect it to keep the chunks along the other dimensions, but that doesn't seem to be what happens: (Example below with time, x, y)

import numpy as np
import xarray as xr

# create some dummy data and chunk
x, y, t = 1000, 1000, 57
rang = np.arange(t*x*y)
da = xr.DataArray(rang.reshape(t, x, y), coords={'time':range(t), 'x': range(x), 'y':range(y)})
da = da.chunk(dict(time=-1, x=256, y=256))

Now when I look at the array, it looks something like this:

da.idxmin('time')

But after doing idxmin I get the outputs below

My understanding is that it seems to be trying to keep the size of chunks small, But oddly, when we do it for floats:

da = da.astype('float32')

before and after doing the idxmin looks like this:

Is this the expected behavour for this operation? I'm guessing the reshaping in the source code happens here, but I haven't been able to figure out how yet.

Thanks!
K.

Answered by KBodolai

Nov 21, 2024

Just for context for people finding this in the future, @dcherian picked this up almost immediately (thanks!) in #9425 , which I didn't realise until know.

(marking as answered since there's a PR about it)

View full answer

KBodolai · 2024-11-21T09:52:05Z

KBodolai
Nov 21, 2024
Author

Just for context for people finding this in the future, @dcherian picked this up almost immediately (thanks!) in #9425 , which I didn't realise until know.

(marking as answered since there's a PR about it)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expected chunking behaviour for idxmin / idxmax. #9421

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Expected chunking behaviour for idxmin / idxmax. #9421

KBodolai Sep 2, 2024

Replies: 1 comment

KBodolai Nov 21, 2024 Author

KBodolai
Sep 2, 2024

KBodolai
Nov 21, 2024
Author