The Salt - Curated AI
Subscribe
Sign in
Home
Notes
AI Notebooks
AI Repositories
Related Articles
Archive
About
Latest
Top
Discussions
Vocabulary Parallelism for More Efficient LLMs
The Weekly Salt #44
Nov 19
•
Benjamin Marie
2
Share this post
The Salt - Curated AI
Vocabulary Parallelism for More Efficient LLMs
Copy link
Facebook
Email
Notes
More
Memory-Efficient Inference with 4-bit Activations for 1-bit LLMs
The Weekly Salt #43
Nov 12
•
Benjamin Marie
4
Share this post
The Salt - Curated AI
Memory-Efficient Inference with 4-bit Activations for 1-bit LLMs
Copy link
Facebook
Email
Notes
More
Go Zero-Shot for Cheaper LLM Evaluations
Unless you use a generative benchmark
Nov 6
•
Benjamin Marie
4
Share this post
The Salt - Curated AI
Go Zero-Shot for Cheaper LLM Evaluations
Copy link
Facebook
Email
Notes
More
Your Prompts Are Not Safe with Mixture of Experts Models
The Weekly Salt #42
Nov 5
•
Benjamin Marie
6
Share this post
The Salt - Curated AI
Your Prompts Are Not Safe with Mixture of Experts Models
Copy link
Facebook
Email
Notes
More
October 2024
The Effective Context Length of LLMs and Semi-Supervised Fine-Tuning
The Weekly Salt #41
Oct 29
•
Benjamin Marie
6
Share this post
The Salt - Curated AI
The Effective Context Length of LLMs and Semi-Supervised Fine-Tuning
Copy link
Facebook
Email
Notes
More
Mixture-of-Experts: Mixture-of-Head Attention and Embedding Model
The Weekly Salt #40
Oct 22
•
Benjamin Marie
2
Share this post
The Salt - Curated AI
Mixture-of-Experts: Mixture-of-Head Attention and Embedding Model
Copy link
Facebook
Email
Notes
More
Cancelling Attention Noise with Differential Transformer
The Weekly Salt #39
Oct 15
•
Benjamin Marie
6
Share this post
The Salt - Curated AI
Cancelling Attention Noise with Differential Transformer
Copy link
Facebook
Email
Notes
More
Evaluating AdEMAMix: A New Optimizer for Faster, More Efficient LLM Training
But with hyperparameter values not easy to find!
Oct 9
•
Benjamin Marie
6
Share this post
The Salt - Curated AI
Evaluating AdEMAMix: A New Optimizer for Faster, More Efficient LLM Training
Copy link
Facebook
Email
Notes
More
Cross Capabilities of LLMs and Contextual Document Embeddings
The Weekly Salt #38
Oct 8
•
Benjamin Marie
8
Share this post
The Salt - Curated AI
Cross Capabilities of LLMs and Contextual Document Embeddings
Copy link
Facebook
Email
Notes
More
LLMs Can Follow Instructions Without Instruction Tuning
The Weekly Salt #37
Oct 1
•
Benjamin Marie
5
Share this post
The Salt - Curated AI
LLMs Can Follow Instructions Without Instruction Tuning
Copy link
Facebook
Email
Notes
More
September 2024
Qwen2-VL: How Does It Work?
One of the best VLMs for image captioning, visual question answering, optical character recognition (OCR), and multimodal chat.
Sep 25
•
Benjamin Marie
3
Share this post
The Salt - Curated AI
Qwen2-VL: How Does It Work?
Copy link
Facebook
Email
Notes
More
SCoRe: Teach LLMs to Self-Correct
The Weekly Salt #36
Sep 24
•
Benjamin Marie
1
Share this post
The Salt - Curated AI
SCoRe: Teach LLMs to Self-Correct
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts