Do Privacy Policies Match with the Logs? An Empirical Study of Privacy Disclosure in Android Application Logs
Zhiyuan Chen, Love Jayesh Ahir, Ahmad Suleiman, Kundi Yao, Yiming Tang + 2 more
TLDR
Android app privacy policies rarely align with actual logging practices, leading to widespread sensitive data leakage not disclosed to users.
Key contributions
- Empirically studied 1,000 Android apps and over 86 million log entries.
- Found only 28.5% of policies mention logging, with 27.7% of those being vague.
- Revealed 67.6% of apps leak sensitive information not disclosed in their policies.
- Showed only 4% of apps consistently align their privacy policies with actual logged data.
Why it matters
This paper reveals a significant disconnect between Android app privacy policies and actual data logging, exposing widespread sensitive data leakage. It highlights that current policies are often vague or incomplete, failing to inform users adequately. This research underscores the urgent need for improved transparency and enforcement in mobile app privacy.
Original Abstract
Privacy policies are intended to inform users about how software systems collect and handle data, yet they often remain vague or incomplete. This paper presents an empirical study of patterns in log-related statements within privacy policies and their alignment with privacy disclosures observed in Android application logs. We analyzed 1,000 Android apps across multiple categories, generating 86,836,964 log entries. Our findings reveal that while most applications (88.0%) provide privacy policies, only 28.5% explicitly mention logging practices. Among those that reference logging, most clearly describe what information is logged; however, 27.7% of log-related statements remain overly simplistic or vague, offering limited insight into actual data collection. We further observed widespread privacy leakages in application logs, with 67.6% of apps leaking sensitive information not mentioned in their policies. Alarmingly, only 4% of applications demonstrated consistent alignment between declared policy contents and actual logged data. These findings highlight that current privacy policies provide incomplete or ambiguous descriptions of logging practices, which frequently do not align with actual logging behaviors.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.