Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Meaning of "samples/sec" for DLRM offline #3

Open
calvinqi opened this issue Oct 26, 2020 · 2 comments
Open

Meaning of "samples/sec" for DLRM offline #3

calvinqi opened this issue Oct 26, 2020 · 2 comments

Comments

@calvinqi
Copy link

calvinqi commented Oct 26, 2020

Hi! I was looking at the results in the table here (https://mlperf.org/inference-results-0-7/) and had a question about the meaning of the unit for DLRM offline samples/s.

Is one "sample" taken to be a single data point, i.e. input features for (user, item, metadata) to generate one single prediction probability, or are they somehow batched according to a distribution like this one https://github.com/mlperf/inference/blob/master/recommendation/dlrm/pytorch/tools/dist_quantile.txt? I'm a little confused on the terminology and what's being measured in these benchmarks. Some clarification would be amazing, thank you!

@TheKanter
Copy link
Contributor

Clarifications here: mlcommons/inference_policies#203

@cayleyhamilton
Copy link

The term 'user-item pair' is not defined. The definition of sample hinges on this and even the Ad Criteo dataset does not explicitly define it. Could a definition of 'user-item pair' be added? Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants