nan loss for shallow networks #2670

numisveinsson · 2025-01-18T23:37:31Z

When deep supervision is toggled on and you have two layer networks (for smaller patched datasets), the deep supervision weights end up being np.array([0]). When they get normalized, they become NaN which end up in NaN loss values. Should be easy to fix here.

Thank you!

Numi Sveinsson
numisveinsson.com

FabianIsensee assigned dojoh Jan 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nan loss for shallow networks #2670

nan loss for shallow networks #2670

numisveinsson commented Jan 18, 2025

nan loss for shallow networks #2670

nan loss for shallow networks #2670

Comments

numisveinsson commented Jan 18, 2025