Topic hub

Quotations and Sourcing

How reporters, platforms, and AI systems attribute quotes, sources, and authorship, and standards for proper citation.

12 briefs

Latest in Quotations and Sourcing

Simon WillisonNEWSLETTER

Julia Evans on writing: 'Write for one person' (quote)

Simon Willison posted the quotation on 15th June 2026: Evans said she writes for a single imagined reader, often 'me, but 3 years ago'.

The Brieftide Daily

Briefs on Quotations and Sourcing, in your inbox.

Plus everything else from the frontier, edited down to a two-minute read each morning.

 

About Quotations and Sourcing

source attribution is the set of practices, technical signals, legal rules, and editorial norms used to show where words, ideas, and content come from. That spans reporters asking permission for direct quotes, platforms labeling who created a post, and AI systems tracing training examples or generated outputs back to sources. It sits at the intersection of journalistic ethics, platform design, copyright law, and new provenance tooling.

What this beat covers

This beat follows how sources are documented and presented. Coverage includes reporter workflows for confirming and attributing quotes, policies for including or redacting identifiers, licensing agreements that let commercial models use publisher content, and technical standards for embedding authorship metadata in feeds and APIs. It also tracks how automated systems surface provenance for AI outputs and the protocols used to mark machine generated text or translated excerpts.

Practices range from attribution lines and inline citations to machine-readable metadata such as author, timestamp, canonical URL, and unique IDs. Important questions include when permission is required for a direct quote, how much context must be preserved, and what counts as fair use. The beat also monitors correction protocols and how disputes about misquotation or misattribution are resolved.

Key tensions and sub-areas

  • Journalistic sourcing versus platform scale. Reporters rely on relationships, notes, and on-the-record sourcing. Platforms prioritize content flow and moderation, which can obscure provenance or strip context.

  • Copyright and licensing for AI. Publishers and data owners negotiate terms for models trained on their content. Those deals affect whether AI can reproduce verbatim quotes and how attribution must appear.

  • Technical metadata and feed practices. Syndication formats such as Atom and RSS, canonical links, and content-disposition headers determine whether quotes retain origin information when republished or scraped.

  • Privacy, safety, and consent. Attribution can risk exposing private sources or victims. Redaction and anonymization are sometimes necessary, which creates trade-offs with transparency and verifiability.

  • Automated provenance and fact-checking. New tools aim to tag generated text with source pointers or confidence scores. Those systems raise accuracy and accountability questions when they fail or are gamed.

What to watch

  • Licensing deals and model access rules that change how quotes can be used by AI.
  • Adoption of standardized metadata for bylines, timestamps, and canonical sources in feeds and APIs.
  • Legal rulings on attribution, quoting, and liability for misattributed or fabricated quotes.
  • Tooling that embeds provenance in generated content and how platforms surface it to users.
Quotations and Sourcing Concept Map
Quotations and SourcingJournalistic sourcingPlatform moderation and licensingAI training and provenanceLegal and ethical risksTechnical metadata and feeds

More briefs in Quotations and Sourcing

  1. Anthropic run-rate disputed: Simon Willison rebuts Karen KwokSimon Willison
  2. Daniel Jalkut quotes: How reporters request permissionSimon Willison
  3. Kyle Ferrana: Atom feed best practices for publishersSimon Willison
  4. OpenAI licenses Grupo Folha and Grupo UOL for ChatGPTOpenAI
  5. Language and cognition, MIT's Olivia Honeycutt study on framingMIT News · AI

Explore related topics