论文标题
关于神经渲染的艺术状态
State of the Art on Neural Rendering
论文作者
论文摘要
光真逼真的虚拟世界的有效渲染是计算机图形的长期努力。现代图形技术已成功地从手工制作的场景表示中综合了照片真实的图像。但是,自动生成形状,材料,照明和场景的其他方面仍然是一个具有挑战性的问题,如果解决,它将使照片现实的计算机图形学更广泛地访问。同时,计算机视觉和机器学习方面的进展已经引起了一种新的图像合成和编辑方法,即深层生成模型。神经渲染是一个新的迅速新兴领域,将生成机器学习技术与来自计算机图形的物理知识相结合,例如,通过将可区分渲染的集成到网络培训中。借助大量的计算机图形和视觉应用程序,神经渲染有望成为图形社区的新领域,但对这种新兴领域的调查不存在。该最新报告总结了神经渲染的最新趋势和应用。我们专注于将经典计算机图形技术与深层生成模型相结合的方法,以获得可控和光真逼真的输出。从基础计算机图形和机器学习概念的概述开始,我们讨论了神经渲染方法的关键方面。该最新报告的重点是许多重要的用例,用于所描述的算法,例如新型视图综合,语义照片操纵,面部和身体重演,重新构成,免费视频视频,以及为虚拟现实和增强现实的照片创建照片真实的化身。最后,我们在讨论了这种技术的社会含义并研究开放研究问题的情况下进行了讨论。
Efficient rendering of photo-realistic virtual worlds is a long standing effort of computer graphics. Modern graphics techniques have succeeded in synthesizing photo-realistic images from hand-crafted scene representations. However, the automatic generation of shape, materials, lighting, and other aspects of scenes remains a challenging problem that, if solved, would make photo-realistic computer graphics more widely accessible. Concurrently, progress in computer vision and machine learning have given rise to a new approach to image synthesis and editing, namely deep generative models. Neural rendering is a new and rapidly emerging field that combines generative machine learning techniques with physical knowledge from computer graphics, e.g., by the integration of differentiable rendering into network training. With a plethora of applications in computer graphics and vision, neural rendering is poised to become a new area in the graphics community, yet no survey of this emerging field exists. This state-of-the-art report summarizes the recent trends and applications of neural rendering. We focus on approaches that combine classic computer graphics techniques with deep generative models to obtain controllable and photo-realistic outputs. Starting with an overview of the underlying computer graphics and machine learning concepts, we discuss critical aspects of neural rendering approaches. This state-of-the-art report is focused on the many important use cases for the described algorithms such as novel view synthesis, semantic photo manipulation, facial and body reenactment, relighting, free-viewpoint video, and the creation of photo-realistic avatars for virtual and augmented reality telepresence. Finally, we conclude with a discussion of the social implications of such technology and investigate open research problems.