Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
yulan-team
/
YuLan-Mini-Before-Annealing
like
7
Follow
RUC-GSAI-YuLan
43
Safetensors
yulanmini
optimizer_states
custom_code
arxiv:
2412.17743
License:
mit
Model card
Files
Files and versions
xet
Community
2
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (1)
Sort: Recently created
Question about architecture
2
#2 opened about 1 year ago by
QuantPanda