The Salt - Curated AI

The Salt - Curated AI

Home
Notes
AI Notebooks
AI Repositories
Related Articles
deep dive
Archive
About
When Quantization Improves Reinforcement Learning
The Weekly Salt #91
7 hrs ago • 
Benjamin Marie
3
Could GRPO Be an "Off-Policy" Algorithm?
The Weekly Salt #90
Oct 9 • 
Benjamin Marie
8
Pre-Training Updates: NVFP4 and Thinking Augmented
The Weekly Salt #89
Oct 1 • 
Benjamin Marie
2

September 2025

How Poor SFT Data Overwrites Learned Knowledge
The Weekly Salt #87
Sep 24 • 
Benjamin Marie
5
Jet-Nemotron: Searching for the Best Attention Architecture
DeltaNet + Hardware-aware Search
Sep 23 • 
Benjamin Marie
2
MMBERT as a Drop-in Successor to XLM-R
The Weekly Salt #86
Sep 17 • 
Benjamin Marie
1
LLMs Hallucinate and That's a Benchmarking Problem
The Weekly Salt #85
Sep 10 • 
Benjamin Marie
4
What Breaks When You Quantize for Translation? A Deep Dive Across 55 Languages
Evaluating LLM translation under quantization with COMET, BLEU, GGUF models, and more
Sep 8 • 
Benjamin Marie
1
2
Joint Prediction of Token Order and Next Token
The Weekly Salt #84
Sep 3 • 
Benjamin Marie
2

August 2025

Dual Preference Optimization with DuPO
The Weekly Salt #83
Aug 27 • 
Benjamin Marie
3
Easier MoE Training and GFPO, a New Alternative to GRPO Against "Length Inflation"
The Weekly Salt #82
Aug 20 • 
Benjamin Marie
2
Better MoE Compression and Understanding of Massive Activations
The Weekly Salt #81
Aug 13 • 
Benjamin Marie
5
© 2025 Benjamin Marie
Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture