The Salt - Curated AI
Subscribe
Sign in
Home
Notes
AI Notebooks
AI Repositories
Related Articles
Archive
About
Latest
Top
Discussions
LLMs Already Know How to Reason
The Weekly Salt #55
Feb 12
•
Benjamin Marie
3
Share this post
The Salt - Curated AI
LLMs Already Know How to Reason
Copy link
Facebook
Email
Notes
More
Accurate LLM Training with FP4 Quantization, Coming Soon?
The Weekly Salt #54
Feb 4
•
Benjamin Marie
5
Share this post
The Salt - Curated AI
Accurate LLM Training with FP4 Quantization, Coming Soon?
Copy link
Facebook
Email
Notes
More
Online DPO with a Reward Model
Better than offline DPO, cheaper than reinforcement learning
Feb 4
•
Benjamin Marie
5
Share this post
The Salt - Curated AI
Online DPO with a Reward Model
Copy link
Facebook
Email
Notes
More
January 2025
More Papers on Deeper and Efficient "Thinking" for LLMs
The Weekly Salt #53
Jan 29
•
Benjamin Marie
2
Share this post
The Salt - Curated AI
More Papers on Deeper and Efficient "Thinking" for LLMs
Copy link
Facebook
Email
Notes
More
Smaller KV Cache with Tensor Product Attention
The Weekly Salt #52
Jan 21
•
Benjamin Marie
3
Share this post
The Salt - Curated AI
Smaller KV Cache with Tensor Product Attention
Copy link
Facebook
Email
Notes
More
Multiagent Finetuning and Better RLHF
The Weekly Salt #51
Jan 15
•
Benjamin Marie
4
Share this post
The Salt - Curated AI
Multiagent Finetuning and Better RLHF
Copy link
Facebook
Email
Notes
More
The Bottlenecks of State Space Models
The Weekly Salt #50
Jan 7
•
Benjamin Marie
2
Share this post
The Salt - Curated AI
The Bottlenecks of State Space Models
Copy link
Facebook
Email
Notes
More
December 2024
Chain-of-Thought with a Token Budget
The Weekly Salt #49
Dec 31, 2024
•
Benjamin Marie
3
Share this post
The Salt - Curated AI
Chain-of-Thought with a Token Budget
Copy link
Facebook
Email
Notes
More
3
TÜLU 3: The Post-Training Recipe
SFT + DPO + RLVR
Dec 19, 2024
•
Benjamin Marie
5
Share this post
The Salt - Curated AI
TÜLU 3: The Post-Training Recipe
Copy link
Facebook
Email
Notes
More
Chain-of-Thought in a Continuous Latent Space for More Efficient Reasoning
The Weekly Salt #48
Dec 18, 2024
•
Benjamin Marie
9
Share this post
The Salt - Curated AI
Chain-of-Thought in a Continuous Latent Space for More Efficient Reasoning
Copy link
Facebook
Email
Notes
More
"Reverse Thinking" for Better LLM Reasoning
The Weekly Salt #47
Dec 10, 2024
•
Benjamin Marie
4
Share this post
The Salt - Curated AI
"Reverse Thinking" for Better LLM Reasoning
Copy link
Facebook
Email
Notes
More
TÜLU 3's High-Quality Synthetic Datasets for Post-Training LLMs
Made by GPT-4o
Dec 5, 2024
•
Benjamin Marie
3
Share this post
The Salt - Curated AI
TÜLU 3's High-Quality Synthetic Datasets for Post-Training LLMs
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts