Tracking Conversations: Measuring Content and Identity Exposure on AI Chatbots
Muhammad Jazlan, Ethan Wang, Yash Vekaria, Zubair Shafiq
TLDR
This paper systematically measures web tracking on 20 popular AI chatbots, revealing widespread third-party data sharing of conversation content and user identity.
Key contributions
- Systematically measured web tracking on 20 popular AI chatbots using sensitive prompts.
- Found 17 of 20 chatbots share data with at least one third party.
- Identified 3 chatbots sharing plaintext conversation text with Microsoft Clarity via session replay.
- Revealed 15 chatbots share conversation URLs or IDs with third-party ad/analytics endpoints.
Why it matters
This research highlights significant privacy risks associated with AI chatbot usage, as most services share sensitive conversation content and user identity with third parties. It underscores the urgent need for greater transparency and stronger privacy protections in the rapidly evolving AI chatbot ecosystem.
Original Abstract
AI chatbots are becoming a primary interface for seeking information. As their popularity grows, chatbot providers are starting to deploy advertising and analytics. Despite this, tracking on AI chatbots has not been systematically studied. We present a systematic measurement of web tracking on 20 popular AI chatbots. Under controlled settings using a sensitive prompt, we capture and compare network traffic in normal chats and, where supported, private chats. We search for exposure of two categories of information: content, including prompts, prompt-derived titles, chat URLs, and chat identifiers; and identity, including names, emails, account identifiers, first-party cookies, and explicit IP/User-Agent fields in payloads. We find that 17 of 20 chatbots share information with at least one third party. Three chatbots share plaintext conversation text, including both prompt and response snippets, with Microsoft Clarity through session replay. Fifteen chatbots share conversation URLs or chat identifiers with third-party advertising, analytics, or social endpoints. Several chatbots expose user identity through support widgets, analytics, advertising, and session replay tags; in some cases, hashed emails are shared.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.