This paper outlines a novel architecture blueprint that use large language models (LLMs) to enhance ad targeting effectiveness through personalized messaging. 

Abstract

Artificial intelligence (AI) is revolutionizing how video service providers engage audiences through digital content delivery. This paper outlines a novel architecture blueprint that use large language models (LLMs) to enhance ad targeting effectiveness through personalized messaging. This approach is structured around four key functional pillars: automated audience segmentation based on core beliefs (Tru-Values); AI-driven tailored message crafting, offering a dual-layer of personalization; a feedback loop to achieve continuous improvements; and the implementation of rigorous safety standards including privacy protections and bias prevention. A simplified implementation of the feedback loop inspired by Reinforcement Learning from AI Feedback (RLAIF) techniques is demonstrated to assess and improve ad campaign effectiveness over time. The objective is to create a personalized viewer experience that resonates deeply with diverse audience segments, thereby generating heightened engagement. Ultimately, the use of generative AI facilitates the creation of multimodal, customized advertising content at scale, paving the way for targeting methods that are more practical across large audiences.

Introduction

In 2017, Netflix demonstrated the impact of personalized promotional messaging by adapting artwork based on user preferences and viewing history (1). The company’s content experts generated multiple images for every title and change them regularly to lure audiences based on their previous viewing history. The approach utilized a sophisticated “online reinforcement learning” strategy, optimizing the balance between exploiting known user preferences and exploring new data for improved recommendations. This dynamic methodology was essential in minimizing the cumulative “regret” (defined as the difference between the expected “payoff” (e.g. engagement) of the algorithm and the payoff of a single fixed strategy for selecting artworks) and enhancing viewer satisfaction over time. Despite its efficiency, this approach epitomized a few key principles under the legacy Personalization 1.0 paradigm, namely the need to choose between a finite set of creatives (while being limited by multimodal content creation’s cost and complexity), the reliance on high-quality (and therefore expensive) human preference labels to optimize algorithmic tuning, and the use of shallow context information (e.g. transient signals like search history) as proxies for viewers’ interests. Today’s advancements in Generative AI necessitate a complete re evaluation of the algorithmic / architectural trade-off and its relevance.