About the ot_loss why optimizing the dual term's derivate instead of the OT distance ? #29

Lynyanyu · 2022-05-04T09:14:24Z

Why setting the optimal target equals the dual term "<β*, z^>"s derivate times the prediction instead of the original OT distance? It makes sense to optimize the entire OT loss term "W(z, z^)" or its dual term "<β*, z^>" to force dot regression more sparse and accurate, but why the derivate? Is it mentioned in the paper or supplements?

henvh · 2024-01-24T21:48:48Z

I actually had the same question. And why is the distance matrix for the ot computation defined like that?

Lynyanyu changed the title ~~About the ot_loss why did you use the prediction's derivative times itself instead of the OT distance ?~~ About the ot_loss why did you use the prediction's derivate times itself instead of the OT distance ? May 4, 2022

Lynyanyu changed the title ~~About the ot_loss why did you use the prediction's derivate times itself instead of the OT distance ?~~ About the ot_loss why optimizing the dual term's derivate instead of the OT distance ? May 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the ot_loss why optimizing the dual term's derivate instead of the OT distance ? #29

About the ot_loss why optimizing the dual term's derivate instead of the OT distance ? #29

Lynyanyu commented May 4, 2022 •

edited

Loading

henvh commented Jan 24, 2024

About the ot_loss why optimizing the dual term's derivate instead of the OT distance ? #29

About the ot_loss why optimizing the dual term's derivate instead of the OT distance ? #29

Comments

Lynyanyu commented May 4, 2022 • edited Loading

henvh commented Jan 24, 2024

Lynyanyu commented May 4, 2022 •

edited

Loading