I am wondering in a single frame grab
cap >> frame
If I am using Logitech C920 which has its own hardware encoder, and I am grabbing 720p image, Am I getting 12807203=921600*3=2700000 bytes from the USB port directly?
Or I am getting a compressed image out first and decode it into 2700000bytes?