I'm going to detect objects using a method proposed by Navneet Dalal, Bill Triggs, and Cordelia Schmid in 2006 (Human Detection Using Oriented Histograms of Flow and Appearance)
In that case they first calculate appearance descriptor using HOG algorithm and motion descriptor using Optical flow. Then they combined these two descriptors to get the final descriptor as I understand. But I couldn't find how they combined that.
So my question is how to combine appearance and motion descriptor to get the final descriptor. (I'm going to use leaner SVM for training)