You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Aug 23, 2022. It is now read-only.
On 14 Nov 2020, at 13:42, zjin-lcf ***@***.***> wrote:
Dumping the output of the adv benchmark shows that the results are different significantly for each run. Can you reproduce that ?
Thanks
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or unsubscribe.
When a seed is used for initializing input data, the outputs still differ significantly for each run.
In addition, do you think the range is right for s_cubInterpT as shown below ? I am not familiar with your domain. Could you please provide a CPU reference implementation for comparison/verification ?
for(int j = 0; j < p_cubNq; ++j; @inner(1))
for(int i = 0; i < p_cubNq; ++i; @inner(0)) {
const int id = i + j * p_cubNq;
if (id < p_Nq * p_cubNq)
s_cubInterpT[0][id] = cubInterpT[id]; // s_cubInterpT[0][id] where id ranges from 0 to p_cubNq, not from 0 to p_Nq * p_cubNq
"""
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Dumping the output of the adv benchmark shows that the results are different significantly for each run. Can you reproduce that ?
Thanks
The text was updated successfully, but these errors were encountered: