Gaperon Collection Our French-English LLM suite (including Base and SFT models. All checkpoints are also included. • 16 items • Updated 23 days ago • 16
Gaperon: A Peppered English-French Generative Language Model Suite Paper • 2510.25771 • Published Oct 29, 2025 • 15
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper • 2503.02812 • Published Mar 4, 2025 • 10
Headless Language Models: Learning without Predicting with Contrastive Weight Tying Paper • 2309.08351 • Published Sep 15, 2023 • 3