Discussion about this post

User's avatar
Remixa's avatar

Thanks BENJAMIN.

But I've been wondering since it seems that their experiments are mostly based on completion tasks. In theory, could these methods (context length extension methods such as LongRecipe, YARN and more) also apply to instruction finetuning rather than just completion tasks?

Expand full comment
3 more comments...

No posts