Electronic – How are phase values able to capture motion from video

fourierfrequencyphasephasorsignal processing

I know that the phase spectrum contains most of the structural information about the image.
But I want to know more about importance of phase spectrum related to video signals.

I have read that temporal variations of phase values are able to capture most of the dynamical characteristics of the video sequence like global motion in the video.but i don't understand how it does?

please consider any real time example say video of rotating wheel or wave (or video of traffic on road.) if i compute its phase spectrum using Fourier Transform,the phase values captures motion of rotating wheel (or complex motion of moving car on road). but i don't understand how it does?By which property of Fourier transform could u explain it to me?

also is there any mathematical relation between motion and phase? please correct if i am getting wrong somewhere.

Best Answer

To understand the effect first consider the simple case of a one dimensional line of length TwoPi. Along that line we consider values of a simple cosine wave of unit amplitude and frequency. When we take the FT of the cosine signal that is spaced along the line, we get a value of 1 for the cosine coefficient of spatial frequency 1. The sine coefficient for spatial frequency 1, along with all other spatial frequency components, should remain zero. The phase vector for the fundamental frequency cosine wave will lie initially along the +x axis.

As the cosine wave is shifted sideways in space, the phase of the spatial frequency component will sweep in a circle from the +cosine through the +sine, then to -cosine, through -sine and back to +cosine. In effect the phasor rotates once each time the wave is moved sideways by one spatial period. The direction of spatial frequency vector rotation is decided by the direction of spatial movement. (This is the "fourier shift theorem" at work).

If the whole of a complex pattern moves sideways the phase of the fundamental will change at a rate proportional to the rate of image movement. Small movements may show up better in higher harmonics, but the highest harmonics will look like noise as the image content will change significantly with larger movements, (unless the panorama wraps around).

A small object that crosses a large fixed background will cause only a small difference in the real cos(1) and imaginary sin(1) coefficients. The point of the phase vectors will move in a small circle due to the small contribution of the part of the image that moves. If you plot all the phase vectors on an Argand diagram then as the image pans, you will see the entire constellation of phase vectors rotating about the centre. But if only a small object moves across the background you will see all the phase vectors rotate in small circles about the tips of their average background values. The rate of rotation will be proportional to the spatial frequency.

The principle of superposition does not usually apply to spatial images because an object that moves against the background does not sum to the background, it replaces background with an object. In effect the moving object removes other information temporarily while substituting it's own. Likewise, when a camera pans, information is lost on one side of the image as new information appears on the other.

So it is easy to detect a transverse movement through phase, but a rotating wheel is hard to detect using phase in a 2D spatial transform, unless it rolls across the image.

Related Topic