papers
updated
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper
•
2508.16153
•
Published
•
160
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Paper
•
2403.13372
•
Published
•
176
LMEnt: A Suite for Analyzing Knowledge in Language Models from
Pretraining Data to Representations
Paper
•
2509.03405
•
Published
•
24
KL3M Tokenizers: A Family of Domain-Specific and Character-Level
Tokenizers for Legal, Financial, and Preprocessing Applications
Paper
•
2503.17247
•
Published
•
1
swiss-ai/Apertus-70B-2509
Text Generation
•
71B
•
Updated
•
1.02k
•
139
NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware
Embeddings
Paper
•
2509.04011
•
Published
•
29
Why Language Models Hallucinate
Paper
•
2509.04664
•
Published
•
195
hmBERT: Historical Multilingual Language Models for Named Entity
Recognition
Paper
•
2205.15575
•
Published