Make Binary cross entropy with logit numerically stable for high logit values #2562

The documentation of binary_cross_entropy_with_logit says that it expects the target to be of type usize which is wrong and yields an error at runtime due to dtype mismatch in the multiplication step.

In the current implementation of binary_cross_entropy_with_logit the loss will actually be NaN due to taking the log(0) which occurs for high logits passing through a sigmoid and an affine transformation: inp.affine(-1., 1.)?.log()? ^ ^ ^ | | | 1.0 | | 0.0 | NaN The proposed implementation is actually taken more or less directly from pytorch https://github.com/pytorch/pytorch/blob/41977a05314bbf537e1c5d6cf5916a368d1907d9/aten/src/ATen/native/Loss.cpp#L362

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make Binary cross entropy with logit numerically stable for high logit values #2562

Make Binary cross entropy with logit numerically stable for high logit values #2562

Commits on Oct 14, 2024

Make Binary cross entropy with logit numerically stable for high logit values #2562

Are you sure you want to change the base?

Make Binary cross entropy with logit numerically stable for high logit values #2562

Commits on Oct 14, 2024