Topic hub

Retrieval-Augmented Models

Covers techniques that combine retrieval and models, including long-term conversational memory, personalization, robustness, and modular model composition.

100 briefsUpdated Jul 5, 2026

Retrieval-Augmented Models · Page 4

Anthropic and Stanford: why larger LLMs learn rare skillsThe BrieftideJun 7
LLM Research Papers 2026 (Jan–May): Curated list and trendsThe BrieftideJun 6
ChatGPT Dreaming V3: OpenAI memory rollout in US, wider launchThe BrieftideJun 4
Meta AI support bot allowed Instagram account takeoversThe BrieftideJun 1
Mellum2 by JetBrains: 12B MoE model with 2.5B active paramsThe BrieftideJun 1
Reachy Mini goes fully local: run speech-to-speech stackThe BrieftideMay 27
Google Search: Gemini 3.5 Flash, AI agents and smart Search boxThe BrieftideMay 19
Ettin reranker family: six CrossEncoders, MTEB retrievalThe BrieftideMay 19
Gemini Omni Flash launch: Google DeepMind's multimodal video AIThe BrieftideMay 17
Gemma 4 and new LLM designs: KV sharing, PLE, compressed attentionThe BrieftideMay 16
Granite Embedding Multilingual R2: 97M tops sub-100M MTEB scoresThe BrieftideMay 14
NVIDIA Nemotron 3 Nano Omni launch: long-context multimodal AIThe BrieftideApr 28
DeepSeek-V4: million-token context release from Hugging FaceThe BrieftideApr 24
GRASP: Gradient Planning for World Models at Long HorizonsThe BrieftideApr 20
Gemini Robotics ER 1.6 release: improved embodied reasoningThe BrieftideApr 13
Sentence Transformers launches multimodal embeddings and rerankersThe BrieftideApr 9
Attention variants in LLMs: MHA, GQA, MLA, sparse & hybridThe BrieftideMar 22
Gemini 3.1 Pro release: DeepMind's model for complex tasksThe BrieftideFeb 19
DeepSeek V3: 671B model, MLA and MoE architectural choices, 2025The BrieftideJul 19
KV Cache in LLMs: How Sebastian Raschka Implements ItThe BrieftideJun 17
SecAlign and StruQ: Berkeley AI defenses cut prompt-injectionThe BrieftideApr 11

Explore related topics

Coding Agents291 AI Infrastructure287 AI Safety240 Benchmarks & Evals113 Enterprise AI Adoption32 Computational Biology18 Augmented Reality Hardware11 Commercial Space Industry8