[BUG: https://mistral.ai/news/mistral-large-2407/ Are there relevant papers, and what are the metrics used to measure the dataset? For example, Is the evaluation metric for MultiPL-E pass@1 ? #235

886gb · 2024-12-06T08:49:01Z

Are there relevant papers, and what are the metrics used to measure the dataset?     For example, Is the evaluation metric for MultiPL-E pass@1 ?

Are there relevant papers, and what are the metrics used to measure the dataset?     For example, Is the evaluation metric for MultiPL-E pass@1 ?

Are there relevant papers, and what are the metrics used to measure the dataset? For example, Is the evaluation metric for MultiPL-E pass@1 ?

Are there relevant papers, and what are the metrics used to measure the dataset? For example, Is the evaluation metric for MultiPL-E pass@1 ?

No response

Provide feedback