GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
arXiv (Cornell University)(2024)
摘要
We introduce GRM, a large-scale reconstructor capable of recovering a 3Dasset from sparse-view images in around 0.1s. GRM is a feed-forwardtransformer-based model that efficiently incorporates multi-view information totranslate the input pixels into pixel-aligned Gaussians, which are unprojectedto create a set of densely distributed 3D Gaussians representing a scene.Together, our transformer architecture and the use of 3D Gaussians unlock ascalable and efficient reconstruction framework. Extensive experimental resultsdemonstrate the superiority of our method over alternatives regarding bothreconstruction quality and efficiency. We also showcase the potential of GRM ingenerative tasks, i.e., text-to-3D and image-to-3D, by integrating it withexisting multi-view diffusion models. Our project website is at:https://justimyhxu.github.io/projects/grm/.
更多查看译文
关键词
3D Reconstruction,Volume Rendering,Rendering,Image Segmentation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要