Abstract
3D Imaging, Analysis and Applications is a comprehensive textbook on 3D shape capture, 3D shape processing and how such capture and processing can be used. Eleven chapters cover a broad range of concepts, algorithms and applications and they are split into three parts, as follows: Part I, 3D Imaging and Shape Representation, presents techniques for capture, representation and visualization of 3D data; Part II, 3D Shape Analysis and Processing presents feature-based methods of analysis, registration and shape matching and, finally, Part III, 3D Imaging Applications presents application areas in 3D face recognition, remote sensing and medical imaging. This introduction provides the reader with historical and background information, such as that relating to the development of computer vision; in particular, the development of automated 3D imaging. It briefly discusses general depth estimation principles for 3D imaging, details a selection of seminal papers, sketches applications of 3D imaging and concludes with an outline of the book’s remaining chapters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Typically, this term is used when the 3D data is acquired from multiple viewpoint 2D images.
- 2.
Typically, this term is used when a scanner acquired the 3D data, such as a laser stripe scanner.
- 3.
Typically, this term is used when the data is ordered in a regular grid, such as the 2D array of depth values in a range image, or a 3D array of data in volumetric medical imaging.
- 4.
Euclid of Alexandria, Greek mathematician, also referred to as the Father of Geometry, lived in Alexandria during the reign of Ptolemy I (323–283 BC).
- 5.
Alhazen (Ibn al-Haytham), born 965 CE in Basra, Iraq, died in 1040. Introduced the concept of physical optics and experimented with lenses, mirrors, camera obscura, refraction and reflection.
- 6.
Sir Austen Henry Layard (1817–1894), British archaeologist, found a polished rock crystal during the excavation of ancient Nimrud, Iraq. The lens has a diameter of 38 mm, presumed creation date 750–710 BC and now on display at the British Museum, London.
- 7.
Lucius Annaeus Seneca, around 4 BC–65 CE, was a Roman philosopher, statesman, dramatist, tutor and adviser of Nero.
- 8.
Small and thin bi-convex lenses look like lentils, hence the name lens, which is Latin for lentil.
- 9.
Nicéphore Niépce, 1765–1833, is credited as one of the inventors of photography by solar light etching (Heliograph) in 1826. He later worked with Louis-Jacques-Mandé Daguerre, 1787–1851, who acquired a patent for his Daguerreotype, the first practical photography process based on silver iodide, in 1839. In parallel, William Henry Fox Talbot, 1800–1877, developed the calotype process, which uses paper coated with silver iodide. The calotype produced a negative image from which a positive could be printed using silver chloride coated paper [19].
- 10.
The Greek word stereos for solid is used to indicate a spatial 3D extension of vision, hence stereoscopic stands for a 3D form of visual information.
- 11.
Gabriel Lippmann, 1845–1921, French scientist, received the 1908 Nobel price in Physics for his method to reproduce color pictures by interferometry.
- 12.
Sir Charles Wheatstone, 1802–1875, English physicist and inventor.
- 13.
The terms disparity and parallax are sometimes used interchangeably in the literature and this misuse of terminology is a source of confusion. One way to think about parallax is that it is induced by the difference in disparity between foreground and background objects over a pair of views displaced by a translation. The end result is that the foreground is in alignment with different parts of the background. Disparity of foreground objects and parallax then only become equivalent when the distance of background objects can be treated as infinity (e.g. distant stars), in this case the background objects are stationary in the image.
- 14.
Sir David Brewster, 1781–1868, Scottish physicist and inventor.
- 15.
Szeliski, Computer Vision: Algorithms and Applications, p. 10 [49].
- 16.
Intrinsic Image Dimension (IID) describes the local change in the image. Constant image: 0D, linear structures: 1D, point structures: 2D.
- 17.
A pdf version is also available for personal use on the website http://szeliski.org/Book/.
- 18.
This triangle defines an epipolar plane, which is discussed in Chap. 2.
- 19.
Kinect is a trademark of Microsoft.
- 20.
Figures are a preprint from the forthcoming Encyclopedia of Computer Vision [29].
- 21.
Twelve milestones is a small number, with the selection somewhat subjective and open to debate. We are merely attempting to give a glimpse of the subject’s development and diversity, not a definitive and comprehensive history.
- 22.
Zhang’s seminal work is pre-dated by a large body of pioneering work on calibration, such as D.C. Brown’s work in the context of photogrammetry, which dates back to the 1950s and many other works in computer vision, such as the seminal two-stage method of Tsai [53].
- 23.
A geodesic distance between two points on a surface is the minimal across-surface distance.
- 24.
Kinect and XBox are trademarks of Microsoft Corporation.
References
Adelson, E.H., Bergen, J.R.: The plenoptic function and the elements of early vision. In: Landy, M., Movshon, J.A. (eds.) Computational Models of Visual Processing (1991)
Arun, K.S., Huang, T.S., Blostein, S.D.: Least-squares fitting of two 3d point sets. IEEE Trans. Pattern Anal. Mach. Intell. 9(5), 698–700 (1987)
Bartczak, B., Vandewalle, P., Grau, O., Briand, G., Fournier, J., Kerbiriou, P., Murdoch, M., Mller, M., Goris, R., Koch, R., van der Vleuten, R.: Display-independent 3d-TV production and delivery using the layered depth video format. IEEE Trans. Broadcast. 57(2), 477–490 (2011)
Bennet, R.: Representation and Analysis of Signals. Part xxi: The Intrinsic Dimensionality of Signal Collections, Rep. 163. The Johns Hopkins University, Baltimore (1965)
Besl, P., McKay, N.D.: A method for registration of 3D shapes. IEEE Trans. Pattern Anal. Mach. Intell. 14(2), 239–256 (1992)
Bigun, J., Granlund, G.: Optimal orientation detection of linear symmetry. In: First International Conference on Computer Vision, pp. 433–438. IEEE Computer Society, New York (1987)
Blundell, B., Schwarz, A.: The classification of volumetric display systems: characteristics and predictability of the image space. IEEE Trans. Vis. Comput. Graph. 8, 66–75 (2002)
Boyer, K., Kak, A.: Color-encoded structured light for rapid active ranging. IEEE Trans. Pattern Anal. Mach. Intell. 9(1) (1987)
Brewster, S.D.: The Stereoscope: Its History, Theory, and Construction with Applications to the fine and useful Arts and to Education. John Murray, Albemarle Street, London (1856)
Brownson, C.D.: Euclid’s optics and its compatibility with linear perspective. Arch. Hist. Exact Sci. 24, 165–194 (1981). doi:10.1007/BF00357417
Creusot, C.: Automatic landmarking for non-cooperative 3d face recognition. Ph.D. thesis, Department of Computer Science, University of York, UK (2011)
Curtis, G.: The Cave Painters. Knopf, New York (2006)
Faugeras, O.: What can be seen in three dimensions with an uncalibrated stereorig? In: Sandini, G. (ed.) Computer Vision: ECCV’92. Lecture Notes in Computer Science, vol. 588, pp. 563–578. Springer, Berlin (1992)
Faugeras, O., Luong, Q., Maybank, S.: Camera self-calibration: theory and experiments. In: Sandini, G. (ed.) Computer Vision: ECCV’92. Lecture Notes in Computer Science, vol. 588, pp. 321–334. Springer, Berlin (1992)
Faugeras, O.D., Hebert, M.: The representation, recognition and locating of 3-d objects. Int. J. Robot. Res. 5(3), 27–52 (1986)
Forsyth, D., Ponce, J.: Computer Vision: A Modern Approach. Prentice Hall, Upper Saddle River (2003)
Fusiello, A.: Visione computazionale. Appunti delle lezioni. Pubblicato a cura dell’autore (2008)
Gennery, D.B.: A stereo vision system for an autonomous vehicle. In: Proc. 5th Int. Joint Conf. Artificial Intell (IJCAI), pp. 576–582 (1977)
Gernsheim, H., Gernsheim, A.: The History of Photography. Mc Graw-Hill, New York (1969)
Harris, C., Stephens, M.J.: A combined corner and edge detector. In: Alvey Vision Conference (1988)
Hartley, R., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University, Cambridge (2003). ISBN 0-521-54051-8
Hartley, R.I.: In defence of the 8-point algorithm. In: Proceedings of the Fifth International Conference on Computer Vision, ICCV’95, p. 1064. IEEE Computer Society, Washington (1995)
Hartley, R.I.: In defence of the 8-point algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 19(6), 580–593 (1997)
Horn, B.K.P.: Shape from shading: a method for obtaining the shape of a smooth opaque object from one view. Ph.D. thesis, MIT, Cambridge, MA, USA (1970)
Horn, B.K.P.: Closed-form solution of absolute orientation using unit quaternions. J. Opt. Soc. Am. A 4(4), 629–642 (1987)
Johnson, A.E., Hebert, M.: Using spin images for efficient object recognition in cluttered 3d scenes. IEEE Trans. Pattern Anal. Mach. Intell. 21(5), 433–449 (1997)
Jordt, A., Koch, R.: Fast tracking of deformable objects in depth and colour video. In: Proceedings of the British Machine Vision Conference (BMVC) (2011)
King, H.: The History of Telescope. Griffin, London (1955)
Koch, R.: Depth estimation. In: Ikeuchi, K. (ed.) Encyclopedia of Computer Vision. Springer, New York (2013)
Koch, R., Schiller, I., Bartczak, B., Kellner, F., Koeser, K.: Mixin3d: 3d mixed reality with ToF-camera. In: Dynamic 3D Imaging DAGM 2009 Workshop, Dyn3D, Jena, Germany. Lecture Notes in Computer Science, vol. 5742, pp. 126–141 (2009)
Kolb, A., Barth, E., Koch, R., Larsen, R.: Time-of-flight cameras in computer graphics. Comput. Graph. Forum 29(1), 141–159 (2010)
Kolb, A., Koch, R.: Dynamic 3D Imaging. Lecture Notes in Computer Science, vol. 5742. Springer, Berlin (2009)
Kriss, T.C., Kriss, V.M.: History of the operating microscope: from magnifying glass to microneurosurgery. Neurosurgery 42(4), 899–907 (1998)
Lippmann, G.: La photographie integrale (English translation Fredo Durant, MIT-csail). In: Academy Francaise: Photography-Reversible Prints. Integral Photographs (1908)
Longuet-Higgins, H.C.: A computer algorithm for re-constructing a scene from two projections. Nature 293, 133–135 (1981)
Ma, Y., Soatto, S., Kosecka, J., Sastry, S.: An Invitation to 3D Vision: From Images to Geometric Models. Springer, Berlin (2003)
Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 381–395 (1981)
Marr, D.: Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. Freeman, New York (1982)
Nayar, S.K., Watanabe, M., Noguchi, M.: Real-time focus range sensor. IEEE Trans. Pattern Anal. Mach. Intell. 18(12), 1186–1198 (1996)
Rioux, M.: Laser range finder based on synchronized scanners. Appl. Opt. 23(21), 3837–3844 (1984)
Savran, A., Alyuz, N., Dibeklioglu, H., Celiktutan, O., Gokberk, B., Sankur, B., Akarun, L.: Bosphorus database for 3d face analysis. In: Biometrics and Identity Management. Lecture Notes in Computer Science, vol. 5372, pp. 47–56 (2008)
Scharstein, D., Szeliski, R.: A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int. J. Comput. Vis. 47, 7–42 (2002)
Schölkopf, B., Smola, A.: Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond. MIT Press, Cambridge (2002)
Schwarte, R., Xu, Z., Heinol, H.G., Olk, J., Klein, R., Buxbaum, B., Fischer, H., Schulte, J.: New electro-optical mixing and correlating sensor: facilities and applications of the photonic mixer device (PMD). In: Proc. SPIE, vol. 3100 (1997)
Shirai, Y.: Recognition of polyhedrons with a range finder. Pattern Recognit. 4, 243–250 (1972)
Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from single depth images. In: CVPR (2011)
Smith, A.M.: Alhacen’s theory of visual perception: a critical edition, with English translation and commentary, of the first three books of Alhacen’s de aspectibus, the medieval Latin version of Ibn al-Haytham’s Kitab al-Manazir. Trans. Am. Philos. Soc. 91 (2001)
Sun, J., Ovsjanikov, M., Guibas, L.: A concise and provably informative multi-scale signature based on heat diffusion. Comput. Graph. Forum 28(5), 1383–1392 (2009)
Szeliski, R.: Computer Vision, Algorithms and Applications. Springer, Berlin (2010)
Tanimoto, S., Pavlidis, T.: A hierarchal data structure for picture processing. Comput. Graph. Image Process. 4, 104–113 (1975)
Triggs, B., McLauchlan, P.F., Hartley, R.I., Fitzgibbon, A.W.: Bundle adjustment—a modern synthesis. In: Proceedings of the International Workshop on Vision Algorithms: Theory and Practice, ICCV’99, pp. 298–372. Springer, London (2000). http://portal.acm.org/citation.cfm?id=646271.685629
Trucco, E., Verri, A.: Introductory Techniques for 3-D Computer Vision. Prentice Hall, New York (1998)
Tsai, R.Y.: A versatile camera calibration technique for high accuracy 3d machine vision metrology using off-the-shelf TV cameras and lenses. IEEE J. Robot. Autom. 3(4), 323–344 (1987)
Wheatstone, C.: Contributions to the physiology of vision. Part the first. On some remarkable, and hitherto unobserved, phenomena of binocular vision. In: Philosophical Transactions of the Royal Society of London, pp. 371–394 (1838)
Witkin, A.P.: Scale-space filtering. In: Proceedings of the Eighth International Joint Conference on Artificial Intelligence, vol. 2, pp. 1019–1022. Morgan Kaufmann, San Francisco (1983). http://portal.acm.org/citation.cfm?id=1623516.1623607
Yang, R., Pollefeys, M.: A versatile stereo implementation on commodity graphics hardware. Real-Time Imaging 11, 7–18 (2005)
Zhang, Z.: A flexible new technique for camera calibration. IEEE Trans. Pattern Anal. Mach. Intell. 22(11), 1330–1334 (2000)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag London
About this chapter
Cite this chapter
Koch, R., Pears, N., Liu, Y. (2012). Introduction. In: Pears, N., Liu, Y., Bunting, P. (eds) 3D Imaging, Analysis and Applications. Springer, London. https://doi.org/10.1007/978-1-4471-4063-4_1
Download citation
DOI: https://doi.org/10.1007/978-1-4471-4063-4_1
Publisher Name: Springer, London
Print ISBN: 978-1-4471-4062-7
Online ISBN: 978-1-4471-4063-4
eBook Packages: Computer ScienceComputer Science (R0)