-
-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Groupby reduction with nd labels and a subset of dims returns factorized labels #9803
Comments
Hmmm... we've lost an error message somewhere in all the groupby PRs I've been pushing. Note it doesn't work at all without flox
|
As in, we should fail but we don't?
Good point... For completeness, stacking is a reasonable workaround (with or without flox)
|
Yes we should be erroring with same message in both cases.
This isn't the same! You can't apply a subset of dims without |
Ah yes, ofc
Am I right in thinking we're pretty close to the correct result though? We're just missing putting the labels on? (obviously it would still be work, reasonable to error in the meantime — but is my assessment correct that the difficult piece is working correctly?) |
Yes correct, flox supports this. Though I think I wanted consistent behaviour in both code paths that's all... |
Note this is a dupe of #1013 though this one is clearer about the end-goal. |
What happened?
When grouping by a coord which has multiple dimensions and reducing by a subset of dimensions, the returned dimensions are given as the int values
(0,1)
, rather than the actual labels.Check out MCVE below
What did you expect to happen?
No response
Minimal Complete Verifiable Example
This is as expected:
But then we get
g (g) int64 16B 0 1
if we do.sum('a')
:Notably, removing some of the conditions make it work fine:
MVCE confirmation
Relevant log output
No response
Anything else we need to know?
Happens both on current and earlier versions of xarray — doesn't seem like a new thing with the recent groupby changes
Environment
INSTALLED VERSIONS
commit: 339ed93
python: 3.11.10 (main, Sep 7 2024, 01:03:31) [Clang 15.0.0 (clang-1500.3.9.4)]
python-bits: 64
OS: Darwin
OS-release: 23.6.0
machine: arm64
processor: arm
byteorder: little
LC_ALL: en_US.UTF-8
LANG: None
LOCALE: ('en_US', 'UTF-8')
libhdf5: 1.14.3
libnetcdf: 4.9.2
xarray: 2024.9.1.dev32+gece582dd
pandas: 2.2.2
numpy: 2.0.2
scipy: 1.14.1
netCDF4: 1.7.1.post2
pydap: None
h5netcdf: 1.3.0
h5py: 3.11.0
zarr: 2.18.3
cftime: 1.6.4
nc_time_axis: 1.4.1
iris: None
bottleneck: 1.4.0
dask: 2024.8.2
distributed: 2024.8.2
matplotlib: 3.9.2
cartopy: None
seaborn: 0.13.2
numbagg: 0.8.1
fsspec: 2024.9.0
cupy: None
pint: None
sparse: None
flox: 0.9.12
numpy_groupies: 0.11.2
setuptools: 69.2.0
pip: 24.0
conda: None
pytest: 8.3.3
mypy: 1.11.2
IPython: 8.24.0
sphinx: None
The text was updated successfully, but these errors were encountered: