Multimodal AIJune 17, 20265 min read

Using Cognitive Models to Train LMs: 26.5% Belief-Error Cut

Equation-to-Behavior Prompting and Equation-to-Behavior RL map cognitive equations onto language models.

The BrieftideJune 17, 2026

TL;DR

01Equation-to-Behavior Prompting and Equation-to-Behavior RL map cognitive equations onto language models.
02Griffiths and Peter Henderson submitted a paper to arXiv on 16 June 2026 that proposes using formal cognitive models to improve how language models simulate human decision-making.
03They mapped equation-based cognitive models onto language model behavior, then tested prompting and RL on persuasion games tied to legal decision-making.

Zirui Cheng, Zeyu Shen, Thomas L. Griffiths and Peter Henderson submitted a paper to arXiv on 16 June 2026 that proposes using formal cognitive models to improve how language models simulate human decision-making. The authors introduce Equation-to-Behavior Prompting to steer large language models toward mathematical specifications and Equation-to-Behavior RL to train smaller models to follow those rules, and they evaluate both on persuasion games based on legal decision-making.

What did the authors do?

They mapped equation-based cognitive models onto language model behavior, then tested prompting and RL on persuasion games tied to legal decision-making. The paper proposes Equation-to-Behavior Prompting for guiding large models to match cognitive models and Equation-to-Behavior RL to train small models to adhere to mathematical rules, and it evaluates these methods in simulated persuasion settings.

The authors ground their approach in cognitive science and economics, noting that people update beliefs differently: some act like Bayesians while others show biases such as motivated reasoning. They target specific, mathematical decision rules, including Bayesian updating, affine distortion, motivated updating and Grether's alpha-beta model, as the equations that simulations should reproduce.

How did the models perform?

Large models, with the Equation-to-Behavior Prompting approach, can approximate the equation-based specifications listed above, while small models fail to do so; applying Equation-to-Behavior RL to small models reduced belief error by 26.5% in out-of-distribution parameterizations. This 26.5% reduction is the paper's concrete measure of improved adherence to the target equations under RL training.

Beyond that single metric, the authors show that simulations built from these cognitive-model-aligned agents create more diverse training environments. Training small models to consider different kinds of decision-makers improved average belief change by 2.5% to 12% compared with training only on Bayesian agents, and those gains held even when the target of persuasion was GPT-5-mini.

Why does this matter?

The paper addresses a gap in current simulation practices: creators of large language models often use simulated humans for safety evaluations and training but do not cover the full breadth of human decision-making. By translating mathematical models of decision-making into prompts and enforcement via RL, the methods promise a clearer, testable way to inject known behavioral variation into simulated agents. That makes training and evaluation environments more representative of the variety of human reasoning patterns documented in cognitive science and economics.

What to watch

Whether Equation-to-Behavior Prompting and Equation-to-Behavior RL generalize beyond the paper's legal persuasion games and to more complicated mathematical models of human decision-making is the next signal to monitor. The authors note that the approach could enable research into more complex decision models; follow-on work applying these methods to different domains or to additional formal models will show how broadly the technique scales.

Paper details: arXiv:2606.17657, submitted 16 June 2026, authors Zirui Cheng, Zeyu Shen, Thomas L. Griffiths and Peter Henderson. The paper links its concrete results to specific equation-based specifications and reports a 26.5% belief-error reduction and a 2.5% to 12% improvement in average belief change under diverse training regimes.

Model methods and reported outcomes from arXiv:2606.17657

Item
Large models with Equation-to-Behavior Prompting	Can approximate equation-based specifications	Bayesian updating, affine distortion, motivated updating, Grether's α-β model
Small models with prompting only	Fail to approximate the equation-based specifications
Small models + Equation-to-Behavior RL	Reduce belief error by 26.5% in out-of-distribution parameterizations
Small models trained on diverse decision-makers	Improve average belief change by 2.5%–12% over Bayesian-only training	Gains hold even when persuading GPT-5-mini

Written by The Brieftide · Source: arXiv

The Brieftide Daily · 06:00

Briefs like this one, in your inbox every morning.

FreeOne email a dayEvery claim sourcedUnsubscribe in one click

Continue reading

LLMs: gpt-4o, gpt-4.1-mini and claude-sonnet-4.6 study

Analysis of 21,000 multi-turn conversations finds human-like behaviors vary by model and user and can be modulated by system prompts.

The BrieftideDAILY BRIEF

ThinkDeception: Progressive RL framework for multimodal deception

ThinkDeception on arXiv uses MLLMs, a step-by-step multimodal Chain of Thought dataset and a four-tier progressive RL trainer for.

The BrieftideDAILY BRIEF

Visual-Seeker: visual-native multimodal search surpasses rivals

Zhengbo Zhang and 12 co-authors submitted Visual-Seeker on 13 Jun 2026.

The BrieftideDAILY BRIEF

Gemma 4 12B: unified, encoder-free multimodal model for laptops

Google DeepMind’s 12B model brings encoder-free vision and native audio to laptops, runs on 16GB memory and is released under Apache 2.0.