GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

arXiv (Cornell University)(2024)

引用 0|浏览68
暂无评分
摘要
We introduce GRM, a large-scale reconstructor capable of recovering a 3Dasset from sparse-view images in around 0.1s. GRM is a feed-forwardtransformer-based model that efficiently incorporates multi-view information totranslate the input pixels into pixel-aligned Gaussians, which are unprojectedto create a set of densely distributed 3D Gaussians representing a scene.Together, our transformer architecture and the use of 3D Gaussians unlock ascalable and efficient reconstruction framework. Extensive experimental resultsdemonstrate the superiority of our method over alternatives regarding bothreconstruction quality and efficiency. We also showcase the potential of GRM ingenerative tasks, i.e., text-to-3D and image-to-3D, by integrating it withexisting multi-view diffusion models. Our project website is at:https://justimyhxu.github.io/projects/grm/.
更多
查看译文
关键词
3D Reconstruction,Volume Rendering,Rendering,Image Segmentation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要