Topic hub

Retrieval-Augmented Models

Covers techniques that combine retrieval and models, including long-term conversational memory, personalization, robustness, and modular model composition.

100 briefs

Retrieval-Augmented Models · Page 4

  1. Anthropic and Stanford: why larger LLMs learn rare skillsThe Brieftide
  2. LLM Research Papers 2026 (Jan–May): Curated list and trendsThe Brieftide
  3. ChatGPT Dreaming V3: OpenAI memory rollout in US, wider launchThe Brieftide
  4. Meta AI support bot allowed Instagram account takeoversThe Brieftide
  5. Mellum2 by JetBrains: 12B MoE model with 2.5B active paramsThe Brieftide
  6. Reachy Mini goes fully local: run speech-to-speech stackThe Brieftide
  7. Google Search: Gemini 3.5 Flash, AI agents and smart Search boxThe Brieftide
  8. Ettin reranker family: six CrossEncoders, MTEB retrievalThe Brieftide
  9. Gemini Omni Flash launch: Google DeepMind's multimodal video AIThe Brieftide
  10. Gemma 4 and new LLM designs: KV sharing, PLE, compressed attentionThe Brieftide
  11. Granite Embedding Multilingual R2: 97M tops sub-100M MTEB scoresThe Brieftide
  12. NVIDIA Nemotron 3 Nano Omni launch: long-context multimodal AIThe Brieftide
  13. DeepSeek-V4: million-token context release from Hugging FaceThe Brieftide
  14. GRASP: Gradient Planning for World Models at Long HorizonsThe Brieftide
  15. Gemini Robotics ER 1.6 release: improved embodied reasoningThe Brieftide
  16. Sentence Transformers launches multimodal embeddings and rerankersThe Brieftide
  17. Attention variants in LLMs: MHA, GQA, MLA, sparse & hybridThe Brieftide
  18. Gemini 3.1 Pro release: DeepMind's model for complex tasksThe Brieftide
  19. DeepSeek V3: 671B model, MLA and MoE architectural choices, 2025The Brieftide
  20. KV Cache in LLMs: How Sebastian Raschka Implements ItThe Brieftide
  21. SecAlign and StruQ: Berkeley AI defenses cut prompt-injectionThe Brieftide

Explore related topics