Aes3D: Aesthetic Assessment in 3D Gaussian Splatting
Chuanzhi Xu, Boyu Wei, Haoxian Zhou, Xuanhua Yin, Zihan Deng + 3 more
TLDR
Aes3D introduces the first framework, dataset, and lightweight model for assessing the aesthetic quality of 3D Gaussian Splatting scenes directly from primitives.
Key contributions
- Proposes Aes3D, the first systematic framework for aesthetic assessment of 3D neural rendering scenes.
- Introduces Aesthetic3D, the first dataset with aesthetic annotations for 3DGS scenes, using a novel strategy.
- Presents Aes3DGSNet, a lightweight model that predicts aesthetic scores directly from 3D Gaussian primitives.
- Eliminates multi-view image rendering, significantly reducing computational cost and hardware requirements.
Why it matters
Current 3D scene evaluation overlooks higher-level aesthetics like composition and visual appeal. Aes3D addresses this gap by providing a novel framework, dataset, and efficient model to assess 3DGS scene aesthetics. This enables creators to build more visually compelling immersive media and digital content.
Original Abstract
As 3D Gaussian Splatting (3DGS) gains attention in immersive media and digital content creation, assessing the aesthetics of 3D scenes becomes important in helping creators build more visually compelling 3D content. However, existing evaluation methods for 3D scenes primarily emphasize reconstruction fidelity and perceptual realism, largely overlooking higher-level aesthetic attributes such as composition, harmony, and visual appeal. This limitation comes from two key challenges: (1) the absence of general 3DGS datasets with aesthetic annotations, and (2) the intrinsic nature of 3DGS as a low-level primitive representation, which makes it difficult to capture high-level aesthetic features. To address these challenges, we propose Aes3D, the first systematic framework for assessing the aesthetics of 3D neural rendering scenes. Aes3D includes Aesthetic3D, the first dataset dedicated to 3D scene aesthetic assessment, built on our proposed annotation strategy for 3D scene aesthetics. In addition, we present Aes3DGSNet, a lightweight model that directly predicts scene-level aesthetic scores from 3DGS representations. Notably, our model operates solely on 3D Gaussian primitives, eliminating the need for rendering multi-view images and thus reducing computational cost and hardware requirements. Through aesthetics-supervised learning on multi-view 3DGS scene representations, Aes3DGSNet effectively captures high-level aesthetic cues and accurately regresses aesthetic scores. Experimental results demonstrate that our approach achieves strong performance while maintaining a lightweight design, establishing a new benchmark for 3D scene aesthetic assessment. Code and datasets will be made available in a future version.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.