Hello all,
The algorithm implementing the bilateral filter is the old-fashioned pixel-based version of Banduchi and Tomasi.
Nowadays, there are much faster algorithms for the bilateral filter.
If someone answers, I can give her/him some references and even plain C implementations of those algorithms for creating a fork or whatever.
I don't do it myself because I'm not so good at efficient programing.
Cheers!