Jump to content

Optical transfer function

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Memestream (talk | contribs) at 23:35, 25 April 2011 (Oversampling and Downconversion to maintain MTF: fix wikilink ccd). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The optical transfer function (OTF) of an imaging system (camera, video system, microscope etc) is the true measure of resolution (image sharpness) that the system is capable of. The common practice of defining resolution in terms of pixel count is not meaningful, as it is the overall OTF of the complete system, including lens and anti-aliasing filter as well as other factors, that defines true performance. In the most common applications (cameras and video systems) it is the Modulation Transfer Function (the magnitude of the OTF), that is most relevant, although the phase component can have a secondary effect. While resolution, as commonly used with reference to camera systems, describes only the number of pixels in an image, and hence the potential to show fine detail, the transfer function describes the ability of adjacent pixels to change from black to white in response to patterns of varying spatial frequency, and hence the actual capability to show fine detail, whether with full or reduced constrast. An image reproduced with an optical transfer function that 'rolls off' at high spatial frequencies will appear 'blurred' in everyday language. Modulation Transfer Function or MTF (the OTF magnitude with phase ignored) is roughly the equivalent of frequency response in an audio system, and can be represented by a graph of light amplitude (brightness) versus spatial frequency (cycles per picture width).

Example

Taking the example of a current High Definition video system, with 1920 by 1080 pixels, Nyquists theorem says that it should be possible, in a perfect system, to resolve fully (with true black to white transitions) nearly 1920 alternate black and white lines, otherwise referred to as a spatial frequency of 960 line pairs per picture width, or 960 cycles per picture width, (definitions in terms of cycles per unit angle or per mm are also possible but generally less clear when dealing with cameras and more appropriate to telescopes etc). In practice this is far from the case, and spatial frequencies that approach the Nyquist rate will generally be reproduced with decreasing amplitude, so that fine detail, though it can be seen, is greatly reduced in contrast. This gives rise to the interesting observation that, for example, a standard definition television picture derived from a film scanner that uses oversampling, as described later, may appear sharper than a high definition picture shot on a camera with a poor Modulation Transfer Function. The two picture show an interesting difference that is often missed, the former having full contrast on detail up to a certain point but then no really fine detail, while the latter does contain finer detail, but with such reduced contrast as to appear inferior overall.

Factors affecting MTF in typical camera systems

In practice, many factors result in considerable blurring of a reproduced image, such that patterns with spatial frequency just below the Nyquist rate may not even be visible, and the finest patterns that can be seen appear 'washed out' as shades of grey, not black and white. A major factor is usually the impossibility of making the perfect 'brick wall' optical filter (often realised as a 'phase plate' or a lens with specific blurring properties in digital cameras and video camcorders). Such a filter is necessary to reduce aliasing by eliminating spatial frequencies above the Nyquist rate, but in practice it will have a response that 'rolls off' seriously before the Nyquist frequency is reached.

Oversampling and Downconversion to maintain MTF

For this reason, the only way in practice to approach the theoretical sharpness possible in a digital imaging system such as a camera is to use more pixels than the final image, and 'downconvert' or 'interpolate' using special digital processing which cuts of high frequencies above the Nyquist rate to avoid aliasing whilst maintaining a resonably flat MTF up the that frequency. This approach was first taken in the 1970s when flying spot scanners, and later CCD line scanners, were developed which sampled more pixels than were needed and then 'downconverted', which is why movies have always looked sharper on television than other material shot with a video camera. The only theoretically correct way to interpolate or downconvert is by use of a steep low-pass spatial filter, realised by convolution with a two-dimensional sinx/x weighting function which requires powerful processing. In practice, various mathematical approximations to this are used to reduce the processing requirement. These appproximations are now implemented widely in video editing systems and in image processing programs such as Photoshop.

Just as standard definition video with a flat MTF is only possible with oversampling, so HD television with full theoretical sharpness is only possible by starting with a camera that has at least twice as many pixels, and then digitally filtering. With movies now being shot in 4k and even 8k video for the cinema, using cameras like the Red, we can expect to see the best pictures on HDTV only from movies or material shot at the higher standard. However much we raise the number of pixels used in cameras, this will always remain true (unless a perfect optical spatial filter can be devised), and the same problem exists of course with stills cameras, where a better image can be expected when, say, a 10 megapixel image is converted to a 5 megapixel image, than could ever be obtained from a even the best 5 megapixel camera. Because of this problem of maintaining a flat MTF, broadcasters like the BBC did for a long time consider maintaining standard definition television, but improving its quality by shooting and viewing with many more pixels (though as previously mentioned, such a system, though impressive, does ultimately lack the very fine detail which, though attenuated, enhances the effect of true HD viewing.

Another factor in digital cameras and camcorders is lens resolution. A lens may be said to 'resolve' 1920 horizontal lines, but this does not mean that it does so with full modulation from black to white. The 'Modulation Transfer Function' (just a term for the magnitude of the optical transfer function with phase ignored) gives the true measure of lens performance, and is represented by a graph of amplitude against spatial frequency.

Lens aperture diffraction also limits MTF. Whilst reducing the aperture of a lens usually reduces aberations and hence improves the flatness of the MTF, there is an optimum aperture for any lens and image sensor size beyond which smaller apertures reduce resolution because of diffraction, which spreads light across the image sensor. This was hardly a problem in the days of plate cameras and even 35mm film, but has become an insurmountable limitation with the very small format sensors used in digital cameras and especially video cameras. First generation HD consumer camcorders used 1/4 inch sensors, for which apertures smaller than about f4 begin to limit resolution. Even professional video cameras mostly use 2/3 inch sensors, prohibiting the use of apertures around f16 that would have been considered normal for film formats. Certain cameras (such as the Pentax K10D) feature an "MTF autoexposure" mode, where the choice of aperture is optimised for maximum sharpness. Typically this means somewhere in the middle of the aperture range.[1]

The Trend to Digital Large-Format SLRs and improved MTF Potential

There has recently been a shift towards the use of large image format digital single lens reflex cameras driven by the need for low-light sensitivity and narrow depth of focus effects. This has led to such cameras becoming preferred by some film makers and television makers over even professional video cameras for use in HD video production, because of their 'filmic' potential. In theory the use of cameras with 16 and 21 megapixel sensors offers the possibility of almost perfect sharpness by downconversion within the camera, with digital filtering to eliminate aliasing. In practise such cameras currently fail in this respect and they do not have the processing power to do what is required. The Canon EOS5D is believed to use only every third pixel, and hence suffers bad aliasing, as it's optical filter is optimised for stills use. The Panasonic Lumix GH2 may do some processing across pixels, producing very sharp images, but with some aliasing. Nevertheless, such cameras produce very impressive results, and appear to be leading the way in video production towards large-format downconvertion with digital filtering becoming the standard approach to the realisation of a flat MTF with true freedom from aliasing.

Measuring Modulation Transfer Function

Although 'sharpness' is often judged on grid patterns of alternate black and white lines, it should strictly be measured using a sine-wave variation from black to white (a blurred version of the usual pattern). Where a square wave pattern is used (simple black and white lines) not only is there more risk of aliasing, but account must be taken of the fact that the fundamental component of a square wave is higher than the amplitude of the square wave itself (the harmonic components reduce the peak amplitude). A square wave test chart will therefore show optimistic results (better resolution of highh spatial frequencies than is actually achieved). The square wave result is sometimes referred to as the 'contrast transfer function' (CTF).

More Advanced Details

OTF may be broken down into the magnitude and phase components as follows:

where

and are spatial frequency in the x- and y-plane, respectively.


Phase is critically important to adaptive optics and holographic systems.

The OTF is the Fourier transform of the incoherent Point Spread Function.

The modulation transfer function represents the Bode plot of an imaging system (such as a microscope or the human eye), and thus depicts the filtering characteristic of the imaging system. The human eye, for instance, acts as a low-pass filter, meaning that very high-frequency components (sharp edges) cannot be perfectly perceived.


See also

References