AI InfrastructureJune 16, 20264 min read

Attribute Inference from Interactive Targeted Ads, arXiv benchmark

Bayesian and supervised attacks reach about 0.64 AUC after 160 campaigns; disclosure policy most reduces the released inference signal.

The BrieftideJune 16, 2026

TL;DR

01Bayesian and supervised attacks reach about 0.64 AUC after 160 campaigns; disclosure policy most reduces the released inference signal.
02The work treats the channel created by user interactions with targeted ads as a noisy oracle for attribute inference and evaluates multiple attack strategies and disclosure controls.
03The paper separates four components of an interactive ad pipeline: targeting predicates, exposure, interaction, and disclosure.

Peihao Li submitted a paper titled "Attribute Inference from Interactive Targeted Ads" to arXiv on 13 Jun 2026, presenting a formal model and a reproducible benchmark that measures how much sensitive information advertisers can infer from interactive ad campaigns. The work treats the channel created by user interactions with targeted ads as a noisy oracle for attribute inference and evaluates multiple attack strategies and disclosure controls.

What the paper does

The paper separates four components of an interactive ad pipeline: targeting predicates, exposure, interaction, and disclosure. That decomposition isolates the gap between eligibility and actual delivery, and between an interaction and whether an advertiser sees it. Li builds a simulator that produces ground truth labels, event traces, disclosed observations, and evaluation metrics. The synthetic populations used for the benchmark are calibrated with public data and each carries known sensitive labels. A generated campaign semantics layer supplies topic variants and response priors which drive user reactions in the simulator.

The evaluation compares Bayesian, supervised, positive and unlabeled, and adaptive attacks under common campaign and disclosure definitions. The final evaluation uses four topic variants, seven simulator seeds, and two interaction settings to measure how repeated campaigns and disclosure choices affect inference signal.

Key findings from the benchmark

Repeated campaigns with identity exposure produce measurable but bounded inference signal in the simulated setting. At 160 campaigns, Bayesian and supervised attacks reach about 0.64 AUC in the main setting and about 0.65 AUC in the higher interaction setting. Disclosure policy is identified as the strongest control: aggregate reporting removes the evaluated oracle input tied to users, while type filtering and randomized disclosure reduce the released signal.

The paper delivers three deliverables: a formal model that distinguishes eligibility, exposure, interaction, and disclosure; a simulator and dataset generation pipeline calibrated to public data; and an evaluation of attack algorithms and disclosure defenses. The code for the benchmark is available at the paper's https URL.

Why it matters

The model shows that interactive ad campaigns can create a channel through which advertisers receive observations tied to individuals rather than only aggregates. That channel can be exploited by several inference strategies; the benchmark quantifies the strength of that signal under controlled, repeatable conditions. The results make clear that disclosure choices materially change the attacker’s input: moving from per-user disclosures to aggregate reporting removes the oracle signal evaluated in this study, and simpler mitigations such as type filtering and randomized disclosure reduce but do not necessarily eliminate signal.

What to watch

Look for other researchers to reuse the provided benchmark and simulator (the paper includes code at its https URL) to reproduce or extend the experiments, and for follow-up work that measures how real ad platform disclosure policies map to the paper’s disclosure definitions. Also watch whether future evaluations vary campaign counts, interaction models, or disclosure mechanisms beyond the four topic variants and two interaction settings reported here.

AUC at 160 repeated campaigns

Item
Bayesian attacks at 160 campaigns	64	65
Supervised attacks at 160 campaigns	64	65

Written by The Brieftide · Source: arXiv

The Brieftide Daily · 06:00

Briefs like this one, in your inbox every morning.

FreeOne email a dayEvery claim sourcedUnsubscribe in one click

Continue reading

Cognitive Debt: AI leverage and systemic fragility model

Shuchen Meng's formal theory explains how substitutive AI builds 'cognitive debt', compounds leverage in calm periods.

The BrieftideDAILY BRIEF

Germany approves DE-AISI, an AI security institute based on UK

The National Security Council authorised a German AI Security Institute to test advanced models.

The BrieftideDAILY BRIEF

China's 2 trillion yuan AI buildout needs 80% domestic chips

Beijing plans roughly 2 trillion yuan over five years to knit data centers into a national network and require at least 80 percent domestic.

The BrieftideDAILY BRIEF

Apple Siri AI at WWDC 2026: built with Google and Nvidia

Apple unveiled Siri AI at WWDC 2026, using Apple Foundation Models refined with Google technology and Nvidia-powered Private Cloud Compute.