LOGIN TO YOUR ACCOUNT

Username
Password
Remember Me
Or use your Academic/Social account:

CREATE AN ACCOUNT

Or use your Academic/Social account:

Congratulations!

You have just completed your registration at OpenAire.

Before you can login to the site, you will need to activate your account. An e-mail will be sent to you with the proper instructions.

Important!

Please note that this site is currently undergoing Beta testing.
Any new content you create is not guaranteed to be present to the final version of the site upon release.

Thank you for your patience,
OpenAire Dev Team.

Close This Message

CREATE AN ACCOUNT

Name:
Username:
Password:
Verify Password:
E-mail:
Verify E-mail:
*All Fields Are Required.
Please Verify You Are Human:
fbtwitterlinkedinvimeoflicker grey 14rssslideshare1
Duckworth, TW
Languages: English
Types: Doctoral thesis
Subjects: media_dig_tech_and_creative_econ
This thesis investigates visual, spatial and temporal qualities of video based reconstruction with respect to telepresence. State of the art was improved and validated through a new parallelisation of an established algorithm; a tool that allows visio-spatial impact of algorithm and camera arrangement to be visualised; and a set of experiments to derive requirements and investigate outcomes. The motivation is to support the exchange of appearance and attention between moving humans through video based reconstruction. A previous research project showed moving humans could faithfully convey attention in virtual environments and appearance through video-conferencing, suggesting that it may be possible to combine the two. Video based 3D reconstruction of humans appeared to be able to achieve both, but it was uncertain whether this could be achieved at sufficient quality. Research began by justifying the approach and setting the requirements. A literature survey and initial experiments indicated that the visual hull provided a form suitable for modeling humans. However, evidence of visual and temporal qualities necessary to support gaze was not found. A state of the art visual hull reconstruction algorithm was parallelised to run on a modern multi-core processor, enabling human reconstruction on a single computer, thus providing stable visual and temporal qualities. A parallelisation scheme theoretically better suited for execution on a single multi-core processor than distributed over a network is proposed. Importantly, the way in which the problem is parallelised has been optimised to reflect the various stages of the process rather than the need to minimise data communication across a network. A utility application has been developed providing a framework for rapidly testing algorithms, validating requirements, and as a platform for conducting experiments. This underpinned a collaborative experiment that showed for the first time that eye gaze could be conveyed to accuracies sufficient for human social interaction. To facilitate the analysis, the utility allowed the impact of camera placement on spatial and visual quality to be investigated.
  • The results below are discovered through our pilot algorithms. Let us know how we are doing!

    • 2 Methodology 14 2.1 Setting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 2.2 Aim and Objectives . . . . . . . . . . . . . . . . . . . . . . . . . 15 4.3.4 Background segmentation . . . . . . . . . . . . . . . . . 76 4.3.5 Camera exposure and colour balance . . . . . . . . . . . 79 4.3.6 Reconstructed model transmission . . . . . . . . . . . . . 80 4.3.7 Experiment E1 - Comparing the latency of an ICVE and video conference . . . . . . . . . . . . . . . . . . . . . . 80 4.4 Temporal consistency of inputs and quality . . . . . . . . . . . . 82 4.5.3 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . 103 4.6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106 4.6.1 Quality measures and their relationship . . . . . . . . . . 106 7.2 Review of hypotheses . . . . . . . . . . . . . . . . . . . . . . . . 195 7.2.1 Shortcomings of the research . . . . . . . . . . . . . . . . 196 7.3 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 198 7.3.1 Future research directions . . . . . . . . . . . . . . . . . 200 7.1.6 Prototyping and evaluation . . . . . . . . . . . . . . . . . 193
    • [1] Je´re´mie Allard, Jean-Se´bastien Franco, Cle´ment Menier, Edmond Boyer, and Bruno Raffin. The GrImage Platform: A Mixed Reality Environment for Interactions. In Proceedings of the Fourth IEEE International Conference on Computer Vision Systems (ICVS '06), New York, January 2006. IEEE Computer Society.
    • [2] Je´re´mie Allard, Cle´ment Menier, Bruno Raffin, Edmond Boyer, and Franc¸ois Faure. Grimage: markerless 3D interactions. Proceedings of the 34th annual conference on Computer Graphics and Interactive Techniques (SIGGRAPH '07), August 2007.
    • [3] Brett Allen, Brian Curless, and Zoran Popovic´. Articulated body deformation from range scan data. In Proceedings of the 29th annual conference on Computer Graphics and Interactive Techniques (SIGGRAPH '02), San Antonio, July 2002.
    • [4] Gene M Amdahl. Validity of the single processor approach to achieving large scale computing capabilities. In Proceedings of the spring joint computer conference (AFIPS '67), Anaheim, April 1967. ACM.
    • [5] Stuart M Anstis, John W Mayhew, and Tania Morley. The perception of where a face or television'portrait'is looking. The American journal of psychology, 82(4):474-489, 1969.
    • [6] Michael Argyle and Mark Cook. Gaze and Mutual Gaze. Cambridge University Press, January 1976.
    • [8] H Baker. Three-dimensional modelling. Proceedings of the 5th International Joint Conference on Artificial Intelligence, pages 649-655, 1977.
    • [9] Bruce Baumgart. A polyhedron representation for computer vision. In Proceedings of the national computer conference and exposition (AFPIS '75), pages 589-596, Anaheim, May 1975.
    • [15] J Butime, L Galo, and I Gutierrez. Application of Computer Vision to 3D Reconstruction: A Survey of Reconstruction Methods. A Survey of Reconstruction Methods. VDM Publishing, 2010.
    • [16] WAS Buxton, A J Sellen, and M C Sheasby. Interfaces for multiparty videoconferences. Video Mediated Communication., pages 385-400, 1997.
    • [17] Joel Carranza, Christian Theobalt, Marcus A Magnor, and Hans-Peter Seidel. Free-viewpoint video of human actors. In Proceedings of the 30th annual conference on Computer Graphics and Interactive Techniques (SIGGRAPH '03), San Diego, July 2003.
    • [18] B Chazelle and H Edelsbrunner. An optimal algorithm for intersecting line segments in the plane. In 29th Annual Symposium on Foundations of Computer Science, pages 590-600, White Plains, October 1988.
    • [19] Milton Chen. Leveraging the asymmetric sensitivity of eye contact for videoconference. In Proceedings of the Conference on Human Factors in Computing Systems (SIGCHI '02), Minneapolis, April 2002.
    • [20] G K M Cheung, T Kanade, J Y Bouguet, and M Holler. A real time system for robust 3D voxel reconstruction of human motions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2000), pages 714-720, Hilton Head Island, June 2000.
    • [21] G K M Cheung, S Baker, and T Kanade. Visual hull alignment and refinement across time: a 3D reconstruction algorithm combining shape-fromsilhouette with stereo. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '03), Madison, June 2003.
    • [22] C H Chien and J K Aggarwal. Volume/surface octrees for the representation of three-dimensional objects. Computer Vision, Graphics, and Image Processing, 36(1), November 1986.
    • Machine Vision Applications (MVA '94), pages 147-150, Kanagawa, December 1994. Citeseer.
    • [24] Slo-Li Chu and Chih-Chieh Hsiao. OpenCL: Make Ubiquitous Supercomputing Possible. In IEEE International Conference on High Performance Computing and Communications (HPCC 2010), pages 556-561, Melbourne, September 2010. IEEE.
    • [25] Antonio Criminisi, Jamie Shotton, Andrew Blake, and Philip H S Torr. Gaze manipulation for one-to-one teleconferencing. In Proceedings of the 9th IEEE International Conference on Conputer Vision (ICCV '03), pages 191-198, Nice, October 2003.
    • [26] Carolina Cruz-Neira. Virtual reality based on multiple projection screens: the cave and its applications to computational science and engineering. PhD thesis, University of Illinois, Chicago, September 1995.
    • [27] B Cyganek and J P Siebert. An introduction to 3D computer vision techniques and algorithms. Wiley, 2011.
    • [28] E R Davies. Machine vision: theory, algorithms, practicalities. Elsevier, 2004.
    • [29] Mark de Berg, Otfried Cheong, Marc van Kreveld, and Mark Overmans. Delaunay Triangulations. In Computational Geometry: Algorithms and Applications, pages 191-218. Springer-Verlag, March 2008.
    • [30] Paul E Debevec, Camillo J Taylor, and Jitendra Malik. Modeling and rendering architecture from photographs: a hybrid geometry- and image-based approach. In Proceedings of the 23rd annual conference on Computer graphics and interactive techniques (SIGGRAPH '96), New Orleans, August 1996.
    • [31] B Delaunay. Sur la sphere vide. Bulletin of Academy of Sciences of the USSR, 6:793-800, 1934.
    • [32] T Duckworth and D.J Roberts. Accelerated polyhedral visual hulls using OpenCL. In IEEE Virtual Reality Conference (VR '11), pages 203-204, Singapore, March 2011.
    • [33] Tobias Duckworth and David J Roberts. Camera image synchronisation in multiple camera real-time 3D reconstruction of moving humans. In IEEE/ACM 15th International Symposium on Distributed Simulation and Real Time Applications (DSRT '11), pages 138-144, Salford, September 2011. IEEE Computer Society.
    • [34] Tobias Duckworth and David J Roberts. Parallel processing for real-time 3D reconstruction from video streams. Journal of Real-Time Image Processing, pages 1-19, 2012.
    • [36] CH Esteban and F Schmitt. Multi-stereo 3d object reconstruction. International Symposium on 3D Processing, Visualization, and Transmission (3DPVT '02), pages 159-166, June 2002.
    • [37] OD Faugeras, M Hebert, P Mussi, and JD Boissonnat. Polyhedral approximation of 3-D objects without holes. Computer Vision, Graphics, and Image Processing, 25(2):169-183, 1984.
    • [41] J Franco, C Menier, E Boyer, and B Raffin. A Distributed Approach for Real Time 3D Modeling. In Conference on Computer Vision and Pattern Recognition Workshop (CVPRW '04), pages 31-31, Washington, June 2004.
    • [42] H Fuchs, N Kelshikar, J Mulligan, and K Daniilidis. 3D Tele-Collaboration over internet 2. In International Workshop on Immersive Telepresence (ITP '02), Juan Les Pin, December 2002.
    • [43] Y Furukawa and J Ponce. Accurate, Dense, and Robust Multi-View Stereopsis. In Proceedings of the IEEE Conference on Computer Graphics and Pattern Recognition (CVPR '07), Minneapolis, June 2007.
    • [44] Y Furukawa and J Ponce. Accurate camera calibration from multi-view stereo and bundle adjustment. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR '08), Anchorage, June 2008.
    • [45] Paolo Simone Gasparello, Giuseppe Marino, Filippo Banno`, Franco Tecchia, and Massimo Bergamasco. Real-Time Network Streaming of Dynamic 3D Content with In-frame and Inter-frame Compression. In IEEE/ACM 15th International Symposium on Distributed Simulation and Real Time Applications (DSRT '11), pages 81-87, Salford, September 2011. IEEE.
    • [46] Jim Gemmell, Kentaro Toyama, C Lawrence Zitnick, Thomas Kang, and Steven Seitz. Gaze Awareness for Video-Conferencing: A Software Approach. IEEE Multimedia, 7(4), October 2000.
    • [47] Antonin Guttman. R-trees: a dynamic index structure for spatial searching. In Proceedings of the ACM Conference on Management of Data (SIGMOD '84), pages 47-57, Boston, June 1984.
    • [48] O Hall-Holt and S Rusinkiewicz. Stripe boundary codes for real-time structured-light range scanning of moving objects. In Proceedings of the 8th International Conference on Computer Vision (ICCV '01), pages 359- 366, Vancouver, July 2001. IEEE Comput. Soc.
    • [49] Richard Hartley and Andrew Zisserman. Multiple View Geometry in Computer Vision. Cambridge University Press, 2 edition, September 2003.
    • [50] J Herraiz, S Espana, S Garcia, R Cabido, A Montemayor, M Desco, J Vaquero, and J Udias. GPU acceleration of a fully 3D Iterative Reconstruction Software for PET using CUDA. IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC '09), pages 4064-4067, October 2009.
    • [51] Mark D Hill and Michael R Marty. Amdahl's Law in the Multicore Era. IEEE Computer, 41(7):33-38, July 2008.
    • [52] A Hilton, D Beresford, T Gentils, R Smith, W Sun, and J Illingworth. Whole-body modelling of people from multiview images to populate virtual worlds. The Visual Computer, 16(7):411-436, November 2000.
    • [53] G Hu, A K Jain, and G Stockman. Shape from light stripe texture. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR '86), pages 412-414, Miami, June 1986.
    • [58] Andrew Jones, Magnus Lang, Graham Fyffe, Xueming Yu, Jay Busch, Ian McDowall, Mark Bolas, and Paul Debevec. Achieving eye contact in a oneto-many 3D video teleconferencing system. ACM Transactions on Graphics, 28(3), August 2009.
    • [59] Roy S Kalawsky. Science of Virtual Reality and Virtual Environments. Addison Wesley Longman Publishing Co., Inc, April 2004.
    • [60] Kibum Kim, John Bolton, Audrey Girouard, Jeremy Cooperstock, and Roel Vertegaal. TeleHuman: effects of 3d perspective on gaze and pose estimation with a life-size cylindrical telepresence pod. In Proceedings of the ACM Conference on Human Factors in Computing Systems (SIGCHI '12), Austin, May 2012.
    • [61] D Knoblauch and F Kuester. VirtualizeMe: interactive model reconstruction from stereo video streams. In Proceedings of the ACM symposium on Virtual Reality Software and Technology (VRST '08), pages 193-196, Bordeaux, October 2008.
    • [62] G Kurillo, R Bajcsy, K Nahrsted, and O Kreylos. Immersive 3D Environment for Remote Collaboration and Training of Physical Activities. In IEEE Virtual Reality Conference (VR '08), pages 269-270, Reno, March 2008.
    • [63] Claudia Kuster, Tiberiu Popa, Jean-Charles Bazin, Craig Gotsman, and Markus Gross. Gaze correction for home video conferencing. Transactions on Graphics (TOG, 31(6), November 2012.
    • [64] KN Kutulakos and SM Seitz. A theory of shape by space carving. International Journal of Computer Vision, 38(3):199-218, 2000.
    • [65] A Ladikos, S Benhimane, and N Navab. Efficient visual hull computation for real-time 3D reconstruction using CUDA. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW '08), Anchorage, June 2008.
    • [66] Denis Laurendeau, Nathalie Bertrand, and Re´gis Houde. The mapping of texture on VR polygonal models. In Proceedings of the 2nd international conference on 3-D digital imaging and modeling (3DIM '99), Ottawa, October 1999. IEEE Computer Society.
    • [75] C N Macrae, B M Hood, A B Milne, A C Rowe, and M F Mason. Are You Looking at Me? Eye Gaze and Person Perception. Psychological Science, 13(5):460-464, September 2002.
    • [91] M Okutomi and T Kanade. A Multiple-Baseline Stereo. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(4):353-363, April 1993.
    • [92] Benjamin Petit, Jean-Denis Lesage, Jean-Se´bastien Franco, Edmond Boyer, and Bruno Raffin. Grimage: 3D modeling for remote collaboration and telepresence. In ACM Symposium on Virtual Reality Software and Technology (VRST '08), pages 299-300, Bordeaux, October 2008. ACM.
    • [93] Benjamin Petit, Jean-Denis Lesage, Cle´ment Menier, Je´re´mie Allard, JeanSe´bastien Franco, Bruno Raffin, Edmond Boyer, and Franc¸ois Faure. Multicamera Real-Time 3D Modeling for Telepresence and Remote Collaboration. International Journal of Digital Multimedia Broadcasting, 2010(2): 150-162, 2010.
    • [94] M Pollefeys, S Sinha, L Guan, and J Franco. Multiview Calibration Synchronization and Dynamic Scene Reconstruction. In Multi-Camera Networks: Principles and applications, pages 29-72. Elsevier, January 2009.
    • [99] V Ramesh, Robert L Glass, and Iris Vessey. Research in computer science: an empirical study. Journal of Systems and Software, 70(1-2):165-176, February 2004.
    • [114] Jonathan Starck, Gordon Collins, Raymond Smith, Adrian Hilton, and John Illingworth. Animated statues. Machine Vision and Applications, 14(4): 248-259, September 2003.
    • [115] G C Stockman, S W Chen, G Hu, and N Shrikhande. Sensing and recognition of rigid objects using structured light. IEEE Control Systems Magazine, 8(3):14-22, June 1988.
    • [123] T Wada, X Wu, S Tokai, and T Matsuyama. Homography based parallel volume intersection: Toward real-time volume reconstruction using active cameras. In Proc of Computer Architectures for Machine Perception (CAMP 2000), pages 331-339, Padova, September 2000.
  • No related research data.
  • No similar publications.

Share - Bookmark

Cite this article