bartowski/allura-forge_Llama-3.3-8B-Instruct-GGUF Text Generation • 8B • Updated 24 days ago • 13.1k • 23
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 22 items • Updated 10 days ago • 87