I don't quite understand how the pixels are summed in each rectangle area described here: Face Detection using Haar Cascades
It says:
Each feature is a single value obtained by subtracting sum of pixels under white rectangle from sum of pixels under black rectangle.
What are summed in each region (rectangle)? The number of pixels or the intensity of all pixels in that region?