Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multispeaker WER #34

Open
mpariente opened this issue Nov 17, 2020 · 2 comments
Open

Multispeaker WER #34

mpariente opened this issue Nov 17, 2020 · 2 comments

Comments

@mpariente
Copy link
Contributor

Hi, thanks a bunch for this tool !

When working with speech mixtures, WER can take into account that words from each speaker might be picked up.
There is a description of the method here: https://my.fit.edu/~vkepuska/ece5527/sctk-2.3-rc1/doc/asclite.html

Would you be willing to integrate this feature in Jiwer?

@nikvaessen
Copy link
Collaborator

I think there are two ways of implementing this:

  1. we need a either wrap around asclite which will require shipping its binary for every platform
  2. or write a custom dynamic programming solution, which would be most likely be very slow if implemented in python, or difficult if it needs to be written in C (I don't have much if any experience in writing C and integrating it into a python application).

How would you use this feature? Are there many speech datasets which have this problem?

@mpariente
Copy link
Contributor Author

Thanks for your answer.

How would you use this feature? Are there many speech datasets which have this problem?

All datasets that include overlapping speech have this problem. Few examples: Chime5-6, AMI, wsj0-mix, Librimix. In order to evaluate speech separation algorithm, this seems to be needed.

I'd go with solution 1.
I personally wouldn't ship the binaries but link to the installation instructions. This would be an optional feature of jiwer, and the user would need to make an extra step to benefit from it. WDYT?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants