Trying to do object detection by Halide(tensorflow ssd_mobilenet_v1_coco ), before I set Halide as backend, fps is around 22~23, cpu rate is around 70~80%, memory usage of gpu ram is around 431MiB(monitor by nvidia-smi)
After I turned on Halide,
net.setPreferableBackend(cv::dnn::DNN_BACKEND_HALIDE);
fps drop down to 6~10fps, usage of cpu and gpu ram still similar to the case without Halide.
I intent to leverage gpu power to speed things up and lower the burden of cpu, but looks like Halide cannot speed things up nor lower the cpu usage.Any suggestion to speed things up by Halide? Thanks