Design Your Ad: Personalized Advertising Image and Text Generation with Unified Autoregressive Models
Yexing Xu, Wei Feng, Shen Zhang, Haohan Wang, Yuxin Qin + 13 more
TLDR
This paper introduces Uni-AdGen, a unified autoregressive model for personalized image and text ad generation, improving realism and user preference.
Key contributions
- Uni-AdGen: A unified autoregressive model generating personalized ad images and texts.
- Foreground perception and instruction tuning enhance the realism of generated ad content.
- Coarse-to-fine preference module captures user interests for personalized ad generation.
- Introduces PAd1M, a large-scale dataset, and PBS metric for personalized ad evaluation.
Why it matters
Current ad generation methods lack cross-modal perception and rely on average preferences. This paper addresses these issues by jointly generating personalized image-text ads, leading to more realistic and user-preferred advertisements. This can significantly improve e-commerce ad effectiveness.
Original Abstract
Generating realistic and user-preferred advertisements is a key challenge in e-commerce. Existing approaches utilize multiple independent models driven by click-through-rate (CTR) to controllably create attractive image or text advertisements. However, their pipelines lack cross-modal perception and rely on CTR that only reflects average preferences. Therefore, we explore jointly generating personalized image-text advertisements from historical click behaviors. We first design a Unified Advertisement Generative model (Uni-AdGen) that employs a single autoregressive framework to produce both advertising images and texts. By incorporating a foreground perception module and instruction tuning, Uni-AdGen enhances the realism of the generated content. To further personalize advertisements, we equip Uni-AdGen with a coarse-to-fine preference understanding module that effectively captures user interests from noisy multimodal historical behaviors to drive personalized generation. Additionally, we construct the first large-scale Personalized Advertising image-text dataset (PAd1M) and introduce a Product Background Similarity (PBS) metric to facilitate training and evaluation. Extensive experiments show that our method outperforms baselines in general and personalized advertisement generation. Our project is available at https://github.com/JD-GenX/Uni-AdGen.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.