Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates Paper • 2512.04844 • Published 22 days ago • 4
Deconstructing Attention: Investigating Design Principles for Effective Language Modeling Paper • 2510.11602 • Published Oct 13 • 14
Fine-Tuning on Noisy Instructions: Effects on Generalization and Performance Paper • 2510.03528 • Published Oct 3 • 17