The Future of Filmmaking: AI for Volumetric Capture and Reconstruction

Speaker:  Ilke Demir – Hermosa Beach, CA, United States
Topic(s):  Artificial Intelligence, Machine Learning, Computer Vision, Natural language processing


One picture is worth a thousand words, so what have been told with videos? What about 100 simultaneous videos to reconstruct every frame of life in a 10.000 sq. ft dome? Is it enough to reconstruct and digitize us realistically? Similar to other industries, entertainment industry is also being reshaped by AI, especially towards AR/VR consumption. Before democratization of AI and data, such immersive experiences were lacking an essential element: photorealism. As the amount of data increased, our models got deeper, and the reality became decipherable. 

This talk will introduce recent deep learning advancements in 3D vision, reconstruction, and shape understanding techniques with a focus on generative models to digitize performances and scenes. Then we will shift gears with an overview of such models in 3D, and their progression on voxels, point clouds, meshes, graphs, and other 3D representations. Back to our studio, in addition to a discussion about how to process such large visual data, the challenges of scaling 10x over current capture platforms, and over 200x over state-of-the-art datasets will be presented. The talk will conclude with a sneak peek of upcoming VR/AR productions from the worldÕs largest volumetric capture stage at Intel Studios, as an example of real-world use cases of such AI approaches.

About this Lecture

Number of Slides:  65
Duration:  90 minutes
Languages Available:  English
Last Updated: 

Request this Lecture

To request this particular lecture, please complete this online form.

Request a Tour

To request a tour with this speaker, please complete this online form.

All requests will be sent to ACM headquarters for review.