The Salt - Curated AI
Subscribe
Sign in
Home
Notes
AI Notebooks
AI Repositories
Related Articles
deep dive
Archive
About
Latest
Top
Discussions
NSA, RLDP, and MaPPO: Techniques for Efficient, Private, and Aligned LLMs
The Weekly Salt #80
Aug 5
•
Benjamin Marie
1
Share this post
The Salt - Curated AI
NSA, RLDP, and MaPPO: Techniques for Efficient, Private, and Aligned LLMs
Copy link
Facebook
Email
Notes
More
July 2025
Rethinking LLM Reliability: Calibration, Compression, and the Hidden Costs of Inference-Time Scaling
The Weekly Salt #79
Jul 30
•
Benjamin Marie
2
Share this post
The Salt - Curated AI
Rethinking LLM Reliability: Calibration, Compression, and the Hidden Costs of Inference-Time Scaling
Copy link
Facebook
Email
Notes
More
Is RLVR Just a Conservative Reweighting Process?
The Weekly Salt #78
Jul 23
•
Benjamin Marie
2
Share this post
The Salt - Curated AI
Is RLVR Just a Conservative Reweighting Process?
Copy link
Facebook
Email
Notes
More
Enabling Reasoning with Simple KV Cache Steering
The Weekly Salt #77
Jul 16
•
Benjamin Marie
1
Share this post
The Salt - Curated AI
Enabling Reasoning with Simple KV Cache Steering
Copy link
Facebook
Email
Notes
More
Supervised Fine-Tuning Improves LLM Reasoning at the Cost of Other Skills
The Weekly Salt #76
Jul 9
•
Benjamin Marie
3
Share this post
The Salt - Curated AI
Supervised Fine-Tuning Improves LLM Reasoning at the Cost of Other Skills
Copy link
Facebook
Email
Notes
More
Tower+: Translation and General-Purpose Multilingual Models
The Weekly Salt #75
Jul 2
•
Benjamin Marie
2
Share this post
The Salt - Curated AI
Tower+: Translation and General-Purpose Multilingual Models
Copy link
Facebook
Email
Notes
More
June 2025
RLPR: RLVR without Verifiers
The Weekly Salt #74
Jun 25
•
Benjamin Marie
3
Share this post
The Salt - Curated AI
RLPR: RLVR without Verifiers
Copy link
Facebook
Email
Notes
More
Reward Correct CoT for Better Reasoning Models
The Weekly Salt #73
Jun 19
•
Benjamin Marie
1
Share this post
The Salt - Curated AI
Reward Correct CoT for Better Reasoning Models
Copy link
Facebook
Email
Notes
More
Magistral: Advancing Reasoning with Efficient GRPO Training
No More KL Penalty, No Need for a Reference Model
Jun 12
•
Benjamin Marie
2
Share this post
The Salt - Curated AI
Magistral: Advancing Reasoning with Efficient GRPO Training
Copy link
Facebook
Email
Notes
More
Better Data Recipes for Pre-training LLMs and Training Reasoning Models
The Weekly Salt #72
Jun 11
•
Benjamin Marie
2
Share this post
The Salt - Curated AI
Better Data Recipes for Pre-training LLMs and Training Reasoning Models
Copy link
Facebook
Email
Notes
More
Reasoning Models Are More Prone to "Hallucination"
The Weekly Salt #71
Jun 4
•
Benjamin Marie
1
Share this post
The Salt - Curated AI
Reasoning Models Are More Prone to "Hallucination"
Copy link
Facebook
Email
Notes
More
May 2025
End-to-End FP4 Training for LLMs with Blackwell GPUs
The Weekly Salt #70
May 28
•
Benjamin Marie
3
Share this post
The Salt - Curated AI
End-to-End FP4 Training for LLMs with Blackwell GPUs
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts