Not sure if a bug or not... #37

jamestwebber · 2021-05-04T13:58:27Z

jamestwebber
May 4, 2021

I opened #35 because I thought I spotted a bug in the weight standardization code but @vballoli says it's fine, so I'm opening a discussion (which I never knew existed!) to figure it out.

The code reads

fan_in = torch.prod(torch.tensor(self.weight.shape[0:]))

This was suspicious to me as shape[0:] is just making a needless copy of shape, and calculates fan_in as the size of the entire tensor.

The code in the deepmind repository here reads shape[:-1], which means fan_in is the product over all but the last dimension, which makes more sense to me.

Maybe I am missing a pytorch vs jax implementation difference? What's the reason for the discrepancy?

vballoli · 2021-05-04T16:20:43Z

vballoli
May 4, 2021
Maintainer

Thanks for raising this! I misinterpreted the issue yesterday, sorry about that. I've fixed the fan-in. Do let me know if these implementations match.

1 reply

jamestwebber May 4, 2021
Author

This is still not matching, it's now shape[1:] (skipping the first dim) but the original implementation is shape[:-1] (skipping the last dim).

If there's some implicit transpose elsewhere that I didn't see, maybe this makes sense, but I think it should be shape[:-1]

vballoli · 2021-05-04T18:37:21Z

vballoli
May 4, 2021
Maintainer

I don't think the weight shape is same as the Jax. See https://github.com/rwightman/pytorch-image-models/blob/4ea593196414684d2074cbb81d762f3847738484/timm/models/layers/std_conv.py#L79.

2 replies

vballoli May 4, 2021
Maintainer

The iokk orientation is different afaik.

jamestwebber May 4, 2021
Author

ah! I always forget this. the torch weights are multiplied from the left which is never what I expect to happen.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not sure if a bug or not... #37

{{title}}

Replies: 2 comments 3 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Not sure if a bug or not... #37

jamestwebber May 4, 2021

Replies: 2 comments · 3 replies

vballoli May 4, 2021 Maintainer

jamestwebber May 4, 2021 Author

vballoli May 4, 2021 Maintainer

vballoli May 4, 2021 Maintainer

jamestwebber May 4, 2021 Author

jamestwebber
May 4, 2021

Replies: 2 comments 3 replies

vballoli
May 4, 2021
Maintainer

jamestwebber May 4, 2021
Author

vballoli
May 4, 2021
Maintainer

vballoli May 4, 2021
Maintainer

jamestwebber May 4, 2021
Author