1 | initial version |
That is correct behavior. Most of the OpenCV codebase uses Intel TBB and parallel_for() to stripe matrix operations to make use of multiple cores. Most of the algorithms for histogram of gradients can be parallelized which is why it runs so much faster.