Loss goes to NaN #5

markus-hinsche · 2021-02-19T09:58:12Z

For a regression task, I am using a mid-size CNN consisting of Conv and MaxPool layers in the first layers and Dense layers in the last layers.

This is how I integrate the evidential loss (Before I used MSE loss):

optimizer = tf.keras.optimizers.Adam(learning_rate=7e-7)
def EvidentialRegressionLoss(true, pred):
    return edl.losses.EvidentialRegression(true, pred, coeff=CONFIG.EDL_COEFF)
model.compile(
    optimizer=optimizer,
    loss=EvidentialRegressionLoss,
    metrics=["mae"]
)

This is how I integrated the layer DenseNormalGamma:

    # lots of ConvLayers
    model.add(layers.Conv2D(filters=256, kernel_size=(3, 3), padding="same", activation="relu"))
    model.add(layers.Conv2D(filters=256, kernel_size=(3, 3), padding="same", activation="relu"))
    model.add(layers.MaxPooling2D(pool_size=(2, 2)))
    model.add(layers.Flatten())
    model.add(layers.Dense(1024, activation="relu"))
    model.add(layers.Dense(128, activation="relu"))

    model.add(edl.layers.DenseNormalGamma(1))  # Instead of Dense(1)

    return model

Here is the issue I am facing:

Before introducing evidential-deep-learning I used 0.0007=7e-4 as a learning rate that worked well.
Now I get loss=NaN with this learning rate, also if I make it smaller (7e-7) I get loss=NaN, mostly already in the very first epoch of training
If I set the learning rate ridiculously low (7e-9) I don't get NaN but of course the network is not learning fast enough

Is there any obvious mistake I make? Any thoughts and help appreciated

The text was updated successfully, but these errors were encountered:

wanzysky · 2021-04-15T11:38:28Z

This is maybe because of

evidential-deep-learning/evidential_deep_learning/losses/continuous.py

Line 35 in 7a22a2c

- alpha*tf.math.log(twoBlambda) \

, where the log is not safe.

Bunnybeibei · 2024-10-09T06:31:16Z

So hou

This is maybe because of

evidential-deep-learning/evidential_deep_learning/losses/continuous.py

Line 35 in 7a22a2c

- alpha*tf.math.log(twoBlambda) \

, where the log is not safe.

I have met the same problem, could you tell me how to solve it？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loss goes to NaN #5

Loss goes to NaN #5

markus-hinsche commented Feb 19, 2021

wanzysky commented Apr 15, 2021 •

edited

Loading

Bunnybeibei commented Oct 9, 2024

Loss goes to NaN #5

Loss goes to NaN #5

Comments

markus-hinsche commented Feb 19, 2021

wanzysky commented Apr 15, 2021 • edited Loading

Bunnybeibei commented Oct 9, 2024

wanzysky commented Apr 15, 2021 •

edited

Loading