Topic hub

Model Compression

Covers methods to shrink and speed AI models, including pruning, quantization, distillation, MoE compression, and train and inference alignment.

74 briefs

Model Compression · Page 2

  1. Large Language Models: Small Initialization Improves ReasoningThe Brieftide
  2. Amazon SageMaker AI container caching: up to 2x faster scalingThe Brieftide
  3. AI Engram: geometric memory traces in deep networks, ICML oralThe Brieftide
  4. BioNeMo Recipes: LoRA fine-tunes ESM2-3B and Evo2-1B on RTX 6000The Brieftide
  5. Count Anything: Tsinghua model, CLOC dataset and benchmarksThe Brieftide
  6. Satya Nadella warns on token-maxing and maps developer futureThe Brieftide
  7. Microsoft SkillOpt boosts GPT-5.5 by about 23 points on tasksThe Brieftide
  8. DiffusionGemma: 4x faster text generation, 26B MoEThe Brieftide
  9. Cohere North Mini Code: 30B Mixture-of-Experts launchThe Brieftide
  10. Gemma 4 12B: unified, encoder-free multimodal model for laptopsThe Brieftide
  11. Microsoft Research Lens: detailed captions beat raw scaleThe Brieftide
  12. Anthropic and Stanford: why larger LLMs learn rare skillsThe Brieftide
  13. Sakana AI launches RSI Lab, outlines four-phase roadmapThe Brieftide
  14. DPO for OCR: cuts text degeneration by 59.4% on DharmaOCRThe Brieftide
  15. Mellum2 by JetBrains: 12B MoE model with 2.5B active paramsThe Brieftide
  16. Profiling in PyTorch: Beginner's Guide to torch.profilerThe Brieftide
  17. Nemotron-Labs Diffusion: 6.4× self-speculation speed on 8B modelsThe Brieftide
  18. DharmaOCR benchmark: 3B specialized model beats frontier APIsThe Brieftide
  19. OlmoEarth v1.1 release: Up to 3× cheaper satellite AIThe Brieftide
  20. Gemma 4 and new LLM designs: KV sharing, PLE, compressed attentionThe Brieftide
  21. GridSFM release: Microsoft's model solves AC‑OPF in millisecondsThe Brieftide
  22. Serverless GPUs add Opus 4.7 Fast and Qwen Image 2.0 supportThe Brieftide
  23. OpenAI Codex with GPT-5.5 used to ship production systemsThe Brieftide
  24. Parameter Golf challenge draws 1,000+ participants, 2,000+The Brieftide

Explore related topics