How about add so called Wavelet algorythm for building visual audio data ? It has some advantages comparing to FFT.