Jump to content

User:Moderately Sized Greg/sandbox

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Moderately Sized Greg (talk | contribs) at 04:39, 23 December 2024. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

This is a currently a working draft for some changes for the Optical flow page.

Optical Flow

The optic flow experienced by a rotating observer (in this case a fly). The direction and magnitude of optic flow at each location is represented by the direction and length of each arrow.

Optical flow or optic flow is the pattern of apparent motion of objects, surfaces, and edges in a visual scene caused by the relative motion between an observer and a scene.[1][2] Optical flow can also be defined as the distribution of apparent velocities of movement of brightness pattern in an image.[3]

The concept of optical flow was introduced by the American psychologist James J. Gibson in the 1940s to describe the visual stimulus provided to animals moving through the world.[4] Gibson stressed the importance of optic flow for affordance perception, the ability to discern possibilities for action within the environment. Followers of Gibson and his ecological approach to psychology have further demonstrated the role of the optical flow stimulus for the perception of movement by the observer in the world; perception of the shape, distance and movement of objects in the world; and the control of locomotion.[5]

The term optical flow is also used by roboticists, encompassing related techniques from image processing and control of navigation including motion detection, object segmentation, time-to-contact information, focus of expansion calculations, luminance, motion compensated encoding, and stereo disparity measurement.[6][7]

Estimation

Optical flow can be estimated in a number of ways. Broadly, optical flow estimation approaches can be divided into machine learning based models (sometimes called data-driven models), classical models (sometimes called knowledge-driven models) which do not use machine learning and hybrid models which use aspects of both learning based models and classical models.[8]

Classical Models

Many classical models use the intuitive assumption of brightness constancy; that even if a point moves between frames, its brightness stays constant. [9] To formalise this intuitive assumption, consider two consecutive frames from a video sequence, with intensity , where refer to pixel coordinates and refers to time. In this case, the brightness constancy constraint is

where is the displacement vector between a point in the first frame and the corresponding point in the second frame.[10]

Learning Based Models

Old Text

Sequences of ordered images allow the estimation of motion as either instantaneous image velocities or discrete image displacements.[7] David J. Fleet and Yair Weiss provide a tutorial introduction to gradient based optical flow.[11] Fleet, along with John L. Barron and Steven Beauchemin, also provides a performance analysis of a number of optical flow techniques, emphasizing the accuracy and density of measurements.[12]

The optical flow methods try to calculate the motion between two image frames which are taken at times and at every voxel position. These methods are called differential since they are based on local Taylor series approximations of the image signal; that is, they use partial derivatives with respect to the spatial and temporal coordinates.

For a (2D + t)-dimensional case (3D or n-D cases are similar) a voxel at location with intensity will have moved by , and between the two image frames, and the following brightness constancy constraint can be given:

Assuming the movement to be small, the image constraint at with Taylor series can be developed to get:

higher-order terms

By truncating the higher order terms (which performs a linearization) it follows that:

or, dividing by ,

which results in

where are the and components of the velocity or optical flow of and , and are the derivatives of the image at in the corresponding directions. , and can be written for the derivatives in the following.

Thus:

or

This is an equation in two unknowns and cannot be solved as such. This is known as the aperture problem of the optical flow algorithms. To find the optical flow another set of equations is needed, given by some additional constraint. All optical flow methods introduce additional conditions for estimating the actual flow.

Methods for determination

  • Phase correlation – inverse of normalized cross-power spectrum
  • Block-based methods – minimizing sum of squared differences or sum of absolute differences, or maximizing normalized cross-correlation
  • Differential methods of estimating optical flow, based on partial derivatives of the image signal and/or the sought flow field and higher-order partial derivatives, such as:
  • Discrete optimization methods – the search space is quantized, and then image matching is addressed through label assignment at every pixel, such that the corresponding deformation minimizes the distance between the source and the target image.[15] The optimal solution is often recovered through Max-flow min-cut theorem algorithms, linear programming or belief propagation methods.

Many of these, in addition to the current state-of-the-art algorithms are evaluated on the Middlebury Benchmark Dataset.[16][17] Other popular benchmark datasets are KITTI and Sintel.

Uses

Motion estimation and video compression have developed as a major aspect of optical flow research. While the optical flow field is superficially similar to a dense motion field derived from the techniques of motion estimation, optical flow is the study of not only the determination of the optical flow field itself, but also of its use in estimating the three-dimensional nature and structure of the scene, as well as the 3D motion of objects and the observer relative to the scene, most of them using the image Jacobian.[18]

Optical flow was used by robotics researchers in many areas such as: object detection and tracking, image dominant plane extraction, movement detection, robot navigation and visual odometry.[6] Optical flow information has been recognized as being useful for controlling micro air vehicles.[19]

The application of optical flow includes the problem of inferring not only the motion of the observer and objects in the scene, but also the structure of objects and the environment. Since awareness of motion and the generation of mental maps of the structure of our environment are critical components of animal (and human) vision, the conversion of this innate ability to a computer capability is similarly crucial in the field of machine vision.[20]

The optical flow vector of a moving object in a video sequence.

Consider a five-frame clip of a ball moving from the bottom left of a field of vision, to the top right. Motion estimation techniques can determine that on a two dimensional plane the ball is moving up and to the right and vectors describing this motion can be extracted from the sequence of frames. For the purposes of video compression (e.g., MPEG), the sequence is now described as well as it needs to be. However, in the field of machine vision, the question of whether the ball is moving to the right or if the observer is moving to the left is unknowable yet critical information. Not even if a static, patterned background were present in the five frames, could we confidently state that the ball was moving to the right, because the pattern might have an infinite distance to the observer.

Optical flow sensor

Various configurations of optical flow sensors exist. One configuration is an image sensor chip connected to a processor programmed to run an optical flow algorithm. Another configuration uses a vision chip, which is an integrated circuit having both the image sensor and the processor on the same die, allowing for a compact implementation.[21][22] An example of this is a generic optical mouse sensor used in an optical mouse. In some cases the processing circuitry may be implemented using analog or mixed-signal circuits to enable fast optical flow computation using minimal current consumption.

One area of contemporary research is the use of neuromorphic engineering techniques to implement circuits that respond to optical flow, and thus may be appropriate for use in an optical flow sensor.[23] Such circuits may draw inspiration from biological neural circuitry that similarly responds to optical flow.

Optical flow sensors are used extensively in computer optical mice, as the main sensing component for measuring the motion of the mouse across a surface.

Optical flow sensors are also being used in robotics applications, primarily where there is a need to measure visual motion or relative motion between the robot and other objects in the vicinity of the robot. The use of optical flow sensors in unmanned aerial vehicles (UAVs), for stability and obstacle avoidance, is also an area of current research.[24]

See also

References

  1. ^ Burton, Andrew; Radford, John (1978). Thinking in Perspective: Critical Essays in the Study of Thought Processes. Routledge. ISBN 978-0-416-85840-2.
  2. ^ Warren, David H.; Strelow, Edward R. (1985). Electronic Spatial Sensing for the Blind: Contributions from Perception. Springer. ISBN 978-90-247-2689-9.
  3. ^ Horn, Berthold K.P.; Schunck, Brian G. (August 1981). "Determining optical flow" (PDF). Artificial Intelligence. 17 (1–3): 185–203. doi:10.1016/0004-3702(81)90024-2. hdl:1721.1/6337.
  4. ^ Gibson, J.J. (1950). The Perception of the Visual World. Houghton Mifflin.
  5. ^ Royden, C. S.; Moore, K. D. (2012). "Use of speed cues in the detection of moving objects by moving observers". Vision Research. 59: 17–24. doi:10.1016/j.visres.2012.02.006. PMID 22406544. S2CID 52847487.
  6. ^ a b Aires, Kelson R. T.; Santana, Andre M.; Medeiros, Adelardo A. D. (2008). Optical Flow Using Color Information (PDF). ACM New York, NY, USA. ISBN 978-1-59593-753-7.
  7. ^ a b c Beauchemin, S. S.; Barron, J. L. (1995). "The computation of optical flow". ACM Computing Surveys. 27 (3). ACM New York, USA: 433–466. doi:10.1145/212094.212141. S2CID 1334552.
  8. ^ Zhai, Mingliang; Xiang, Xuezhi; Lv, Ning; Kong, Xiangdong (2021). "Optical flow and scene flow estimation: A survey". Pattern Recognition. 114: 107861. doi:10.1016/j.patcog.2021.107861.
  9. ^ Fortun, Denis; Bouthemy, Patrick; Kervrann, Charles (2015-05-01). "Optical flow modeling and computation: A survey". Computer Vision and Image Understanding. 134: 1–21. doi:10.1016/j.cviu.2015.02.008. Retrieved 2024-12-23.
  10. ^ Brox, Thomas; Bruhn, Andrés; Weickert, Joachim (2004). "High Accuracy Optical Flow Estimation Based on a Theory for Warping". Computer Vision - ECCV 2004. ECCV 2004. Berlin, Heidelberg: Springer Berlin Heidelberg. pp. 25–36.
  11. ^ Fleet, David J.; Weiss, Yair (2006). "Optical Flow Estimation" (PDF). In Paragios, Nikos; Chen, Yunmei; Faugeras, Olivier D. (eds.). Handbook of Mathematical Models in Computer Vision. Springer. pp. 237–257. ISBN 978-0-387-26371-7.
  12. ^ Barron, John L.; Fleet, David J. & Beauchemin, Steven (1994). "Performance of optical flow techniques" (PDF). International Journal of Computer Vision. 12: 43–77. CiteSeerX 10.1.1.173.481. doi:10.1007/bf01420984. S2CID 1290100.
  13. ^ a b Zhang, G.; Chanson, H. (2018). "Application of Local Optical Flow Methods to High-Velocity Free-surface Flows: Validation and Application to Stepped Chutes" (PDF). Experimental Thermal and Fluid Science. 90: 186–199. Bibcode:2018ETFS...90..186Z. doi:10.1016/j.expthermflusci.2017.09.010.
  14. ^ Glyn W. Humphreys and Vicki Bruce (1989). Visual Cognition. Psychology Press. ISBN 978-0-86377-124-8.
  15. ^ B. Glocker; N. Komodakis; G. Tziritas; N. Navab; N. Paragios (2008). Dense Image Registration through MRFs and Efficient Linear Programming (PDF). Medical Image Analysis Journal.
  16. ^ Baker, Simon; Scharstein, Daniel; Lewis, J. P.; Roth, Stefan; Black, Michael J.; Szeliski, Richard (March 2011). "A Database and Evaluation Methodology for Optical Flow". International Journal of Computer Vision. 92 (1): 1–31. doi:10.1007/s11263-010-0390-2. ISSN 0920-5691. S2CID 316800.
  17. ^ Baker, Simon; Scharstein, Daniel; Lewis, J. P.; Roth, Stefan; Black, Michael J.; Szeliski, Richard. "Optical Flow". vision.middlebury.edu. Retrieved 2019-10-18.
  18. ^ Corke, Peter (8 May 2017). "The Image Jacobian". QUT Robot Academy.
  19. ^ Barrows, G. L.; Chahl, J. S.; Srinivasan, M. V. (2003). "Biologically inspired visual sensing and flight control". Aeronautical Journal. 107 (1069): 159–268. doi:10.1017/S0001924000011891. S2CID 108782688 – via Cambridge University Press.
  20. ^ Brown, Christopher M. (1987). Advances in Computer Vision. Lawrence Erlbaum Associates. ISBN 978-0-89859-648-9.
  21. ^ Moini, Alireza (2000). Vision Chips. Boston, MA: Springer US. ISBN 9781461552673. OCLC 851803922.
  22. ^ Mead, Carver (1989). Analog VLSI and neural systems. Reading, Mass.: Addison-Wesley. ISBN 0201059924. OCLC 17954003.
  23. ^ Stocker, Alan A. (2006). Analog VLSI circuits for the perception of visual motion. Chichester, England: John Wiley & Sons. ISBN 0470034882. OCLC 71521689.
  24. ^ Floreano, Dario; Zufferey, Jean-Christophe; Srinivasan, Mandyam V.; Ellington, Charlie, eds. (2009). Flying insects and robots. Heidelberg: Springer. ISBN 9783540893936. OCLC 495477442.