NVIDIA CUDA Toolkit 5.0 Visual Profiler “Enable concurrent kernels profiling” application requirements
I'm using CUDA Toolkit 5.0 because I need to use the new feature of the NVIDIA Visual Profiler of this Toolkit that allows to view in the timeline concurrent kernels executed asynchronously (this is not possible with the CUDA Toolkit 4.2). For this reason, I built (succesfully) the source code of OpenCV 2.4.2 with this Toolkit (5.0) installed on my pc, and I'm able to compile and execute correctly my application with concurrent kernels: some of them are invoked by functions of the module OpenCV_GPU and others are kernels I directly wrote in CUDA. Unfortunately, CUDA 5.0 NVIDIA Visual Profiler can't trace the timeline of my application if I enable the feature: "Enable concurrent kernels profiling". It creates the timeline correctly both for code written using ONLY OpenCV functions and for code written using ONLY CUDA functions. Indeed, it stops working when I mix the two in the same application . I think this may be caused by the fact that OpenCV calls should use the same CUDA Context as the rest of the CUDA code. How can I manage the CUDA Context in order to allow the Profiler to trace the timeline?
OS: Windows 7 64 bit; Compiler: Visual Studio 2010 Professional; Driver: 306.23; Device: GeForce GTX 680 or GeForce GT 650M
Thank you for your attention!