Cuda is slower than cpu #17

dhkdnduq · 2021-03-25T07:47:07Z

gpu : rtx3090
cpu : i5-10400F 6core
with openmp

Most of the remaining codes are the same and only mahalanobis codes are different.
This began because the implementation of project from c++ to libtorch(cuda) was slower than Python's numpy.

from image preprocess to mahalanobis loop

1.opencv(cpu): 1.5~2sec ,
*Gpumat is not yet supported.

2.libtorch(cuda) : 0.4~0.45 sec

3.libtorch(cpu) : 0.25~0.35 sec

4.eigen(cpu):0.2~0.25sec

i hope this helps

DeepKnowledge1 · 2021-03-25T08:38:10Z

@dhkdnduq , can you see here

#8 (comment)

0- Be sure that the model and images are in the GPU
1- You could also try to train the model and test directly, not to load the model pickle.

DeepKnowledge1 · 2021-03-25T08:41:17Z

@dhkdnduq try to reduce the forward pass by cut the particular layers
#13 (comment)

dhkdnduq · 2021-03-25T12:38:50Z

@DeepKnowledge1 thanks for replying . I think it's because of openmp. Gpu has little effect on parallelism.
i'll check #13

dhkdnduq · 2021-05-03T05:49:34Z

Thanks for your sharing! Could you tell me how to get the forward propagation feature maps by libtorch?

save good features and resnet weight file .I've been using jit
Implement model structure in c++(wide_resnet), Because there is no function named layer hook. using c++ torchvision
Implement mahalanobis. (u - v; matmul; dot; sqrt;)

dhkdnduq · 2021-05-10T01:45:01Z

Thanks for your reply！Now，I have a problem when loading the PyTorch trained model(mean, cov) in c++. I saved the mean&cov as a tensor by PyTorch, but I can't read the model by libtorch. 任秋霖 @.***
…
------------------ 原始邮件 ------------------ 发件人: "xiahaifeng1995/PaDiM-Anomaly-Detection-Localization-master" @.>; 发送时间: 2021年5月3日(星期一) 中午1:49 @.>; @.@.>; 主题: Re: [xiahaifeng1995/PaDiM-Anomaly-Detection-Localization-master] Cuda is slower than cpu (#17) Thanks for your sharing! Could you tell me how to get the forward propagation feature maps by libtorch? save good features .I've been using jit Implement model structure in c++(wide_resnet), Because there is no function named layer hook. using c++ torchvision Implement mahalanobis. (u - v; matmul; dot; sqrt;) — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or unsubscribe.

//python
class TrainFeature(torch.jit.ScriptModule):
constants = ['mean,conv_inv']

def __init__(self,mean_,conv_inv_):
    super(TrainFeature, self).__init__()
    self.mean = mean_
    self.conv_inv = conv_inv_

def forward(self):
    pass

//c++
auto anomaly_features = torch::jit::load(...);
... = anomaly_features.attr("mean").toTensor().to(at::kCPU);

Leonardo0325 · 2022-11-05T03:33:24Z

Thanks for your sharing! Could you tell me how to get the forward propagation feature maps by libtorch?

save good features and resnet weight file .I've been using jit

Implement model structure in c++(wide_resnet), Because there is no function named layer hook. using c++ torchvision

Implement mahalanobis. (u - v; matmul; dot; sqrt;)

Hello，Can I borrow your LibTorch code about PaDiM，，thank you very much indeed

dhkdnduq closed this as completed Mar 25, 2021

dhkdnduq reopened this Mar 25, 2021

dhkdnduq changed the title ~~Inference time in c++~~ Cuda is slower than cpu Mar 25, 2021

dhkdnduq closed this as completed Mar 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cuda is slower than cpu #17

Cuda is slower than cpu #17

dhkdnduq commented Mar 25, 2021 •

edited

Loading

DeepKnowledge1 commented Mar 25, 2021

DeepKnowledge1 commented Mar 25, 2021

dhkdnduq commented Mar 25, 2021

dhkdnduq commented May 3, 2021 •

edited

Loading

dhkdnduq commented May 10, 2021

Leonardo0325 commented Nov 5, 2022

Cuda is slower than cpu #17

Cuda is slower than cpu #17

Comments

dhkdnduq commented Mar 25, 2021 • edited Loading

DeepKnowledge1 commented Mar 25, 2021

DeepKnowledge1 commented Mar 25, 2021

dhkdnduq commented Mar 25, 2021

dhkdnduq commented May 3, 2021 • edited Loading

dhkdnduq commented May 10, 2021

Leonardo0325 commented Nov 5, 2022

dhkdnduq commented Mar 25, 2021 •

edited

Loading

dhkdnduq commented May 3, 2021 •

edited

Loading