Bibliography

[1]   Updated call for proposals on multi-view video coding. Join Video Team ISO/IEC JTC1/SC29/WG11 MPEG2005/N7567, October 2005.

[2]   Information technology - mpeg video technologies - part3: Representation of auxiliary data and supplemental information. International Standard: ISO/IEC 23002-3:2007, January 2007.

[3]   M. Adams and F. Kossentini. Jasper: A software-based JPEG-2000 codec implementation. In IEEE International Conference on Image Processing, volume 2, pages 53–56, October 2000.

[4]   L. Aimar, L. Merritt, E. Petit, M. Chen, J. Clay, M. Rullgard, R. Czyz, C. Heine, A. Izvorski, and A. Wright. Webpage title: x264 a free H264/AVC encoder. http://www.videolan.org/developers/x264.html, last visited: January 2009.

[5]   S. Birchfield and C. Tomasi. A pixel dissimilarity measure that is insensitive to image sampling. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(4):401–406, 1998.

[6]   S. Birchfield and C. Tomasi. Depth discontinuities by pixel-to-pixel stereo. International Journal of Computer Vision, 35(3):269–293, 1999.

[7]   M. Bleyer and M. Gelautz. A layered stereo matching algorithm using image segmentation and global visibility constraints. ISPRS Journal of Photogrammetry and Remote Sensing, 59(3):128–150, 2005.

[8]   A. F. Bobick and S. S. Intille. Large occlusion stereo. International Journal of Computer Vision, 33(3):181–200, 1999.

[9]   A. Bourge and C. Fehn. White paper on ISO/IEC 23002-3 auxiliary video data representations. ISO/IEC JTC1/SC29/WG11/N8039, April 2006.

[10]   Y. Boykov, O. Veksler, and R. Zabih. Fast approximate energy minimization via graph cuts. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(11):1222–1239, 2001.

[11]   B.-B. Chai, S. Sethuraman, and H. S. Sawhney. A depth map representation for real-time transmission and view-based rendering of a dynamic 3D scene. In First International Symposium on 3D Data Processing Visualization and Transmission, pages 107–114, June 2002.

[12]   J.-X. Chai, S.-C. Chan, H.-Y. Shum, and X. Tong. Plenoptic sampling. In International Conference on Computer graphics and interactive techniques, (ACM SIGGRAPH), pages 307–318. ACM Press, 2000.

[13]   Y. Chen, P. Pandit, and S. Yea. Study Text of ISO/IEC 14496-5:2001/PDAM 15 Reference Software for Multiview Video Coding. ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, October 2008.

[14]   Y. Chen, Y.-K. Wang, K. Ugur, M. M. Hannuksela, J. Lainema, and M. Gabbouj. The emerging MVC standard for 3D video services. EURASIP Journal on Advances in Signal Processing, (1), January 2009.

[15]   P. A. Chou, T. D. Lookabaugh, and R. M. Gray. Optimal pruning with applications to tree-structured source coding and modeling. IEEE Transactions on Information Theory, 35(2):299–315, March 1989.

[16]   C. Cigla, X. Zabulis, and A. A. Alatan. Region-based dense depth extraction from multi-view video. In IEEE International Conference on Image Processing, volume 5, pages V213–V216, San Antonio, USA, September 2007.

[17]   I. J. Cox, S. L. Hingorani, S. B. Rao, and B. M. Maggs. A maximum likelihood stereo algorithm. Computer Vision and Image Understanding, 63(3):542–567, 1996.

[18]   M. O. de Beeck, E. Fert, , C. Fehn, and P. Kauff. Broadcast Requirements on 3D Video Coding. ISO/IEC JTC1/SC29/WG11 MPEG02/M8040, March 2002.

[19]   P. E. Debevec, G. Borshukov, and Y. Yu. Efficient view-dependent image-based rendering with projective texture-mapping. In Proceedings of the 9th Eurographics Workshop on Rendering 1998, June 1998.

[20]   F. Devernay. Vision stéréoscopique et propriétés différentielles des surfaces. PhD thesis, Ecole Polytechnique, Palaiseau, France, February 1997.

[21]   M. N. Do and M. Vetterli. The finite ridgelet transform for image representation. IEEE Transactions on Image Processing, 12(1):16–28, 2003.

[22]   M. N. Do and M. Vetterli. The contourlet transform: an efficient directional multiresolution image representation. IEEE Transactions on Image Processing, 14(12):2091–2106, 2005.

[23]   D. Donoho. Wedgelets: nearly minimax estimation of edges. Annals of Statistics, 27(3):859–897, March 1999.

[24]   D. Drascic. Skill acquisition and task performance in teleoperation using monoscopic and stereoscopic video remote viewing. In Proceedings of the Human Factors Society 35th Annual Meeting, pages 1367–1371, San Fransisco, USA, September 1991.

[25]   D. Farin, Y. Morvan, and P. H. N. de With. View interpolation along a chain of weakly calibrated cameras. In IEEE Workshop on Content Generation and Coding for 3D-Television, Eindhoven, The Netherlands, June 2006.

[26]   U. Fecker and A. Kaup. H.264/AVC compatible coding of dynamic light fields using transposed picture ordering. In Proceedings of the European Signal Processing Conference (EUSIPCO), volume 1, Antalya, Turkey, September 2005.

[27]   C. Fehn. Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3d-tv. In Proceedings of the SPIE, Stereoscopic Displays and Virtual Reality Systems XI, volume 5291, pages 93–104, 2004.

[28]   C. Fehn, N. Atzpadin, M. Mueller, O. Schreer, A. Smolic, R. Tanger, and P. Kauff. An advanced 3DTV concept providing interoperabilty and scalabilty for a wide range of multi-baseline geometries. In IEEE International Conference on Image Processing, pages 2961–2964, Atlanta, October 2006.

[29]   C. Fehn, K. Schuur, P. Kauff, and A. Smolic. Coding results for EE4 in MPEG 3DAV. ISO/IEC JTC 1/SC 29/WG 11, MPEG03/M9561, March 2003.

[30]   M. A. Fischler and R. C. Bolles. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6):381–395, 1981.

[31]   M. Flier, A. Mavlankar, and B. Girod. Motion and disparity compensated coding for multi-view video. IEEE Transactions on Circuits and Systems for Video Technology, 17(7):1474–1484, 2007.

[32]   A. Fusiello, V. Roberto, and E. Trucco. Efficient stereo with multiple windowing. In IEEE Conference on Computer Vision and Pattern Recognition, pages 858–863, Puerto Rico, June 1997.

[33]   A. Fusiello, E. Trucco, and A. Verri. A compact algorithm for rectification of stereo pairs. Machine Vision and Applications, 12(1):16–22, 2000.

[34]   B. Girod. The efficiency of motion-compensating prediction for hybrid coding of video sequence. IEEE Journal on Selected Areas in Communications, 5(7):1140–1154, 1987.

[35]   A. M. Gorski. User evaluation of a stereoscopic display for space-training applications. In Proceedings of the SPIE, Stereoscopic Displays and Applications III, volume 1669, pages 236–243, San Jose, USA, June 1992.

[36]   S. J. Gortler, R. Grzeszczuk, R. Szeliski, and M. F. Cohen. The lumigraph. In International conference on Computer graphics and interactive techniques, (ACM SIGGRAPH), pages 43–54. ACM Press, 1996.

[37]   R. Hartley and A. Zisserman. Multiple View Geometry in Computer Vision. Cambridge University Press, 2004.

[38]   B. Heigl, R. Koch, M. Pollefeys, J. Denzler, and L. J. V. Gool. Plenoptic modeling and rendering from image sequences taken by hand-held camera. In Deutsche Arbeitsgemeinschaft fr Mustererkennung-Symposium, pages 94–101, 1999.

[39]   J. Ilgner, J. J.-H. Park, D. Labbé, and M. Westhofen. Using a high-definition stereoscopic video system to teach microscopic surgery. In Proceedings of the SPIE, Stereoscopic Displays and Virtual Reality Systems XIV, volume 6490, page 649008, San Jose, USA, February 2007.

[40]   N. Inamoto and H. Saito. Free viewpoint video synthesis and presentation from multiple sporting videos. In IEEE International Conference on Multimedia and Expo, page 4, Amsterdam, The Netherlands, July 2005.

[41]   T. Kanade and M. Okutomi. A stereo matching algorithm with an adaptive window: theory and experiment. IEEE Transactions on Pattern Analysis and Machine Intelligence, 16(9):920–932, 1994.

[42]   S. B. Kang, R. Szeliski, and J. Chai. Handling occlusions in dense multi-view stereo. In IEEE Conference Computer Vision and Pattern Recognition, volume 1, pages I–103–I–110, 2001.

[43]   P. Kauff, N. Atzpaadin, C. Fehn, M. Mueller, O. Schreer, A. Smolic, and R. Tanger. Depth map creation and image-based rendering for advanced 3DTV services providing interoperability and scalability. Signal Processing: Image Communication, 22(2):217–234, 2007.

[44]   A. Kaup and U. Fecker. Analysis of multi-reference block matching for multi-view video coding. In Proceedings of 7th Workshop Digital Broadcasting, pages 33–39, Erlangen, Germany, September 2006.

[45]   J. H. Kim, P. Lai, J. Lopez, A. Ortega, Y. Su, P. Yin, and C. Gomila. New coding tools for illumination and focus mismatch compensation in multiview video coding. IEEE Transactions on Circuits and Systems for Video Technology, 17(11):1519–1535, 2007.

[46]   T. Koga, K. Iinuna, A. Hirano, Y. Iijima, and T. Ishiguro. Motion Compensated Interframe Coding for Video Conferencing. In Proceedings of National Telecommunication, volume 4, pages G5.3.1–G5.3.5, New Orleans, LA, December 1981.

[47]   V. Kolmogorov and R. Zabih. Computing visual correspondence with occlusions via graph cuts. In IEEE International Conference on Computer Vision, volume 2, pages 508–515, Vancouver, Canada, 2006.

[48]   R. Krishnamurthy, B.-B. Chai, H. Tao, and S. Sethuraman. Compression and transmission of depth maps for image-based rendering. In IEEE International Conference on Image Processing, volume 3, pages 828–831, October 2001.

[49]   S. Laveau and O. Faugeras. 3-D scene representation as a collection of images. In International Conference on Pattern Recognition, volume 1, pages 689–691, Jerusalem, Israel, October 1994.

[50]   M. Levoy and P. Hanrahan. Light field rendering. In International Conference on Computer graphics and interactive techniques, (ACM SIGGRAPH), pages 31–42. ACM Press, 1996.

[51]   M. Magnor, P. Ramanathan, and B. Girod. Multi-view coding for image based rendering using 3-D scene geometry. IEEE Transactions on Circuits Systems and Video Technology, 13(11):1092–1106, November 2003.

[52]   M. Maitre and M. N. Do. Joint encoding of the depth image based representation using shape-adaptive wavelets. In IEEE International Conference on Image Processing, volume 1, pages 1768–1771, San Antonio, USA, September 2008.

[53]   W. R. Mark, L. McMillan, and G. Bishop. Post-rendering 3d warping. In Symposium on Interactive 3D graphics, pages 7–16. ACM Press, 1997.

[54]   E. Martinian, A. Behrens, J. Xin, and A. Vetro. View synthesis for multiview video compression. In Picture Coding Symposium, Beijing, China, May 2006.

[55]   W. Matusik and H. Pfister. 3D TV: a scalable system for real-time acquisition, transmission, and autostereoscopic display of dynamic scenes. ACM Transactions on Graphics, 23(3):814–824, 2004.

[56]   L. McMillan. An Image-Based Approach to Three-Dimensional Computer Graphics. PhD thesis, University of North Carolina, Chapel Hill, USA, April 1997.

[57]   P. Merkle, Y. Morvan, A. Smolic, K. Mueller, P. H. de With, and T. Wiegand. The effect of depth compression on multi-view rendering quality. In IEEE 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video, pages 245–248, Istanbul, Turkey, May 2008.

[58]   P. Merkle, Y. Morvan, A. Smolic, K. Mueller, P. H. de With, and T. Wiegand. The effects of multiview depth video compression on multiview rendering. Signal Processing: Image Communication, 24(1-2):73–88, January 2009.

[59]   P. Merkle, K. Mueller, A. Smolic, and T. Wiegand. Efficient compression of multi-view video exploiting inter-view dependencies based on H.264/MPEG4-AVC. In IEEE International Conference on Multimedia and Expo, pages 1717–1720, Toronto, Canada, July 2006.

[60]   P. Merkle, A. Smolic, K. Mueller, and T. Wiegand. Comparative study of MVC structures. ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, JVT-V132, January 2007.

[61]   P. Merkle, A. Smolic, K. Mueller, and T. Wiegand. Multi-view video plus depth representation and coding. In IEEE International Conference on Image Processing, volume 1, pages 201–204, San Antonio, USA, 2007.

[62]   P. Merkle, A. Smolic, K. Mueller, and T. Wiegand. MVC: Experiments on Coding of Multi-view Video plus Depth. ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, JVT-X064, June 2007.

[63]   P. Merkle, A. Smolic, K. Mueller, and T. Wiegand. Efficient prediction structures for multiview video coding. IEEE Transactions on Circuits and Systems for Video Technology, 17(11):1461–1473, November 2007.

[64]   Y. Morvan, P. H. N. de With, and D. Farin. Platelet-based coding of depth maps for the transmission of multiview images. In Proceedings of the SPIE, Stereoscopic Displays and Virtual Reality Systems XIII, volume 6055, page 60550K, San Jose, USA, January 2006.

[65]   Y. Morvan, D. Farin, and P. H. N. de With. Coding depth images with piecewise linear functions for multi-view synthesis. In Proceedings of the European Signal Processing Conference (EUSIPCO), Antalya, Turkey, September 2005.

[66]   Y. Morvan, D. Farin, and P. H. N. de With. Coding of depth-maps using piecewise linear functions. In 26th Symposium on Information Theory in the Benelux, pages 121–128, Brussels, Belgium, 2005.

[67]   Y. Morvan, D. Farin, and P. H. N. de With. Novel coding technique for depth images using quadtree decomposition and plane approximation. In Proceedings of the SPIE, Visual Communications and Image Processing, volume 5960, pages 1187–1194, Beijing, China, July 2005.

[68]   Y. Morvan, D. Farin, and P. H. N. de With. Design considerations for view interpolation in a 3D video coding framework. In 27th Symposium on Information Theory in the Benelux, volume 1, pages 93–100, Noordwijk, The Netherlands, May 2006.

[69]   Y. Morvan, D. Farin, and P. H. N. de With. Depth-image compression based on an R-D optimized quadtree decomposition for the transmission of multiview images. In IEEE International Conference on Image Processing, volume 5, pages V–105 – V–108, San Antonio, USA, September 2007.

[70]   Y. Morvan, D. Farin, and P. H. N. de With. Incorporating depth-image based view-prediction into H.264 for multiview-image coding. In IEEE International Conference on Image Processing, volume I, pages I–205– I–208, San Antonio, USA, September 2007.

[71]   Y. Morvan, D. Farin, and P. H. N. de With. Joint depth/texture bit-allocation for multi-view video compression. In Picture Coding Symposium, Lisboa, Portugal, November 2007.

[72]   Y. Morvan, D. Farin, and P. H. N. de With. Multiview depth-image compression using an extended H.264 encoder. In Lecture Notes in Computer Science: Advanced Concepts for Intelligent Vision Systems, volume 4678, pages 675–686, Delft, The Netherlands, August 2007.

[73]   Y. Morvan, D. Farin, and P. H. N. de With. Predictive coding of depth images across multiple views. In Proceedings of the SPIE, Stereoscopic Displays and Virtual Reality Systems XIV, volume 6490, page 64900P, San Jose, USA, January 2007.

[74]   Y. Morvan, D. Farin, and P. H. N. de With. Design considerations for a 3D-TV video coding architecture. In IEEE International Conference on Consumer Electronics, January 2008.

[75]   Y. Morvan, D. Farin, and P. H. N. de With. System architecture for Free-Viewpoint Video and 3D-TV. IEEE Transactions on Consumer Electronics, 54(2):925–932, 2008.

[76]   J.-R. Ohm. Stereo/multiview video encoding using the mpeg family of standards. In Proceedings of the SPIE, Stereoscopic Displays and Virtual Reality Systems VI, volume 3639, pages 242–253, San Jose, USA, 1999.

[77]   M. M. Oliveira. Relief Texture Mapping. PhD thesis, University of North Carolina, Chapel Hill, USA, March 2000.

[78]   A. Ortega and K. Ramchandran. Rate-distortion methods for image and video compression. IEEE Signal Processing Magazine, 15:23–50, 1998.

[79]   E. L. Pennec and S. Mallat. Sparse geometric image representations with bandelets. IEEE Transactions on Image Processing, 14(4):423–438, 2005.

[80]   G. Peyré and S. Mallat. Discrete bandelets with geometric orthogonal filters. In IEEE International Conference on Image Processing, volume 1, pages I–65–8, Genova, Italy, September 2005.

[81]   P. Prandoni. Optimal Segmentation Techniques for Piecewise Stationary Signals. PhD thesis, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, March 1999.

[82]   K. Pulli, M. Cohen, T. Duchamp, H. Hoppe, L. Shapiro, and W. Stuetzle. View-based rendering: Visualizing real objects from scanned range and color data. In Proceedings of the Eighth Eurographics Workshop on Rendering 1997, pages 23–34, 1997.

[83]   A. Redert, R.-P. Berretty, C. Varekamp, O. Willemsen, J. Swillens, and H. Driessen. Philips 3D solutions: From content creation to visualization. In Proceedings of the Third International Symposium on 3D Data Processing, Visualization, and Transmission, pages 429–431, Chapel Hill, USA, 2006.

[84]   J. B. Roerdink and A. Meijster. The watershed transform: Definitions, algorithms and parallelization strategies. FUNDINF: Fundamenta Informatica, 41, 2000.

[85]   D. Scharstein, R. Szeliski, and R. Zabih. A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. In IEEE Workshop on Stereo and Multi-Baseline Vision, pages 131–140, Dec 2001.

[86]   H. Schirmacher. Efficient Aquisition, Representation, and Rendering of Light Fields. PhD thesis, Universit¨at des Saarlandes, June 2003.

[87]   S. M. Seitz and C. R. Dyer. View morphing. In International Conference on Computer graphics and interactive techniques, (ACM SIGGRAPH), pages 21–30. ACM Press, 1996.

[88]   I. Sexton and P. Surman. Stereoscopic and autostereoscopic display systems. IEEE Signal processing magazine, 16(3):85–99, May 1999.

[89]   J. Shade, S. Gortler, L. wei He, and R. Szeliski. Layered depth images. In International Conference on Computer Graphics and Interactive Techniques (ACM SIGGRAPH), pages 231–242. ACM Press, 1998.

[90]   C. Shu, A. Brunton, and M. Fiala. Automatic grid finding in calibration patterns using delaunay triangulation. Technical Report NRC-46497/ERB-1104, National Research Council, Institute for Information Technology, Montreal, Canada, Aug 2003.

[91]   R. Shukla, P. L. Dragotti, M. N. Do, and M. Vetterli. Rate-distortion optimized tree-structured compression algorithms for piecewise polynomial images. IEEE Transactions on Image Processing, 14(3):343–359, 2005.

[92]   H.-Y. Shum and S. B. Kang. A review of image-based rendering techniques. In Proceedings of SPIE, Visual Communications and Image Processing, volume 4067, pages 2–13, June 2000.

[93]   A. Smolic. 3D Video and Free Videopoint Video - Technologies, Applications, and MPEG Standards. IEEE workshop on Content generation and coding for 3D-television, June 2006.

[94]   J. Stolfi. Oriented Projective Geometry. Academic Press, Elsevier, 1991.

[95]   Y. Su, A. Vetro, and A. Smolic. Common test conditions for multiview video coding. ISO/IEC JTC1/SC29/WG11 and ITU SG16 Q.6 JVT-U211, october 2006.

[96]   J. Sun, N.-N. Zheng, and H.-Y. Shum. Stereo matching using belief propagation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(7):787–800, 2003.

[97]   H. Tao and H. S. Sawhney. Global matching criterion and color segmentation based stereo. In IEEE Workshop on Applications of Computer Vision, pages 246–253, 2000.

[98]   H. Tao, H. S. Sawhney, and R. Kumar. A global matching framework for stereo computation. In IEEE International Conference on Computer Vision, volume 1, pages 532–539, 2001.

[99]   T. Thorm¨ahlen and H. Broszio. Automatic line-based estimation of radial lens distortion. Integrated Computer-Aided Engineering, 12(2):177–190, 2005.

[100]   D. Tzovaras, N. Grammalidis, and M. G. Strintzis. Disparity field and depth map coding for multiview image sequence. In IEEE International Conference on Image Processing, volume 2, pages 887–890, 1996.

[101]   A. Vetro and F. Bruls. Summary of BoG discussions on FTV. ISO/IEC JTC1/SC29/WG11 and ITU SG16 Q.6 JVT-Y087, October 2007.

[102]   A. Vetro, P. Pandit, H. Kimata, A. Smolic, and Y.-K. Wang. Joint draft 8.0 on multiview video coding. Joint Video Team (JVT) of ISO/IEC MPEG ITU-T VCEG ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, July 2008.

[103]   C. Wheatstone. Contributions to the physiology of vision - part the first. on some remarkable, and hitherto unobserved phenomena of binocular vision. Philosophical Transactions, 128:371–394, 1838.

[104]   R. M. Willett and R. D. Nowak. Platelets: a multiscale approach for recovering edges and surfaces in photon-limited medical imaging. IEEE Transactions on Medical Imaging, 22(3):332–350, 2003.

[105]   G. Wolberg. Digital Image Warping. IEEE Computer Society Press, July 1990.

[106]   S. W¨urmlin, E. Lamboray, and M. Gross. 3d video fragments: dynamic point samples for real-time free-viewpoint video. In Computers and Graphics, Special Issue on Coding, Compression and Streaming Techniques for 3D and Multimedia Data, pages 3–14. Elsevier, 2004.

[107]   S. Yea and A. Vetro. CE3: Study on depth issues. ISO/IEC JTC1/SC29/WG11 and ITU SG16 Q.6 JVT-X073, 2007.

[108]   Z. Zhang. A flexible new technique for camera calibration. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(11):1330–1334, 2000.

[109]   C. L. Zitnick and S. B. Kang. Stereo for image-based rendering using image over-segmentation. International Journal of Computer Vision, 75(1):49–65, October 2007.

[110]   C. L. Zitnick, S. B. Kang, M. Uyttendaele, S. Winder, and R. Szeliski. High-quality video view interpolation using a layered representation. ACM Transactions on Graphics, 23(3):600–608, 2004.

[111]   C. L. Zitnick, S. B. Kang, M. Uyttendaele, S. Winder, and R. Szeliski. Microsoft Research 3D Video Download. http://research.microsoft.com/en-us/um/people/sbkang/3dvideodownload, last visited: January 2009.