Percentage of correct segments: Ignore silences between words optionally #13

f90 · 2019-07-01T12:57:05Z

Percentage of correct segments is currently computed by also checking for overlap of predicted and actual silence durations between words. However for Mauch and Jamendo datasets there are no annotated word offsets, so it makes no sense to check for silences in this case.

I had to convert my predictions to have word offset=word onset time of the next word to ensure the metric is correctly calculated in this case, but it feels wrong to have to throw away the models predicted offsets for this.

It would be better to have a flag in the evaluation code that optionally activates/deactivates checking the silence segments so you don't have to change the predictions you feed in.

georgid · 2019-10-23T18:42:38Z

Percentage of correct segments metric by design needs the offset of every word, correct.

A dataset which does not have the word-offset timestamps is practically not suited for this metric. The workaround of adding as offset the next word is feasible, but we have to be careful about the offset of a word, which is the last before a non-vocal segment - e.g. instrumental interlude.

Indeed, currently the evaluation code is doing this workaround itself

This should be changed by either annotating manually the offset of the last word in a seciton or skipping this metric at all.
@f90

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Percentage of correct segments: Ignore silences between words optionally #13

Percentage of correct segments: Ignore silences between words optionally #13

f90 commented Jul 1, 2019

georgid commented Oct 23, 2019

Percentage of correct segments: Ignore silences between words optionally #13

Percentage of correct segments: Ignore silences between words optionally #13

Comments

f90 commented Jul 1, 2019

georgid commented Oct 23, 2019