Capture whether a Tool implement an ensemble strategy #259

tschaffter · 2021-09-05T23:59:47Z

This proposal comes after reading the article On the “usefulness” of the Netflix Prize. In this article, it is highlighted that complicated ensemble methods may not be suitable for production ready application. We have experienced a similar situation with the Digital Mammography DREAM Challenge where the final method was an ensemble of 11 tools/docker images. This strategy is actually adopted in most DREAM challenges, where the final model published is an ensemble of the best performing models submitted during the competitive or collaborative phase.

There are two bits of information that we may want to capture, possibly as properties of the Tool schemas:

Whether the "tool" - the docker image submitted - is an ensemble of the output of multiple algorithms.
- This may also flag tools that may take a long time to run, though we will ultimately capture and report on tool runtime in the future.
Whether the submitted tool can be trained, re-trained and/or fine-tuned
- Distinguish between a tool that has not yet been trained and must absolutely be trained before being used for inference, and a tool that has been previously trained and can be further re-trained or fine-tuned, for example periodically on new data.
- Enabling submitted tools to train on private data from data sites should be possible before the end of 2021.

The text was updated successfully, but these errors were encountered:

tschaffter · 2021-09-06T00:03:46Z

Curious to hear what are your thoughts regarding the additional tool metadata mentioned above. @cascadianblue @yy6linda @ymnliu @chepyle

tschaffter · 2021-09-06T01:39:51Z

I'm wondering if there are existing schemas that describe the "trainability"/training state of a model that we could reuse for the NLP Sandbox Tool schema.

boyleconnor · 2021-09-07T16:52:11Z

https://arxiv.org/pdf/2008.04277.pdf

tschaffter added Enhancement New feature or request Priority: Low labels Sep 5, 2021

tschaffter mentioned this issue Sep 7, 2021

Capture information about NLP models #260

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Capture whether a Tool implement an ensemble strategy #259

Capture whether a Tool implement an ensemble strategy #259

tschaffter commented Sep 5, 2021 •

edited

Loading

tschaffter commented Sep 6, 2021 •

edited

Loading

tschaffter commented Sep 6, 2021

boyleconnor commented Sep 7, 2021

Capture whether a Tool implement an ensemble strategy #259

Capture whether a Tool implement an ensemble strategy #259

Comments

tschaffter commented Sep 5, 2021 • edited Loading

tschaffter commented Sep 6, 2021 • edited Loading

tschaffter commented Sep 6, 2021

boyleconnor commented Sep 7, 2021

tschaffter commented Sep 5, 2021 •

edited

Loading

tschaffter commented Sep 6, 2021 •

edited

Loading