reaperdoesntknow
/

SMOLM2Prover

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

reaperdoesntknow commited on Sep 6

Commit

06a90ce

·

verified ·

1 Parent(s): 4f9bbab

Update README.md

Files changed (1) hide show

README.md +156 -10

README.md CHANGED Viewed

@@ -1,3 +1,140 @@
 ---
 library_name: transformers
 model_name: SmolLM2_Thinks
@@ -5,10 +142,22 @@ tags:
 - generated_from_trainer
 - sft
 - trl
 licence: license
-base_model:
-- prithivMLmods/SmolLM2-CoT-360M
-pipeline_tag: text-generation
 ---
 # Model Card for SmolLM2_Thinks
@@ -51,10 +200,7 @@ Cite TRL as:
 ```bibtex
 @misc{vonwerra2022trl,
 	title        = {{TRL: Transformer Reinforcement Learning}},
-	author       = {Leandro von Werra and Younes Belkada and Lewis Tunstall and Edward Beeching and Tristan Thrush and Nathan Lambert and Shengyi Huang and Kashif Rasul and Quentin Gallou{\'e}dec},
-	year         = 2020,
-	journal      = {GitHub repository},
-	publisher    = {GitHub},
-	howpublished = {\url{https://github.com/huggingface/trl}}
-}
-```

+Hugging Face's logo Hugging Face
+Models
+Datasets
+Spaces
+Docs
+Enterprise
+Pricing
+reaperdoesntknow
+/
+SmolLM2_Thinks
+Text Generation
+Transformers
+PyTorch
+English
+llama
+Generated from Trainer
+sft
+trl
+proof
+cot
+reasoning
+symbioticai
+calculus
+logic
+SFT
+TRL
+datasets
+finetune
+conversational
+text-generation-inference
+Model card
+Files and versions
+xet
+Community
+Settings
+SmolLM2_Thinks/
+license
+datasets
+language
+metrics
+base_model
+new_version
+pipeline_tag
+library_name
+tags
+Eval Results
+View doc
+1
+2
+3
+4
+5
+6
+7
+8
+9
+10
+11
+12
+13
+14
+15
+16
+17
+18
+19
+20
+21
+22
+23
+24
+25
+26
+27
+28
+29
+30
+31
+32
+33
+34
+35
+36
+37
+38
+39
+40
+41
+42
+43
+44
+45
+46
+47
+48
+49
+50
+51
+52
+53
+54
+55
+56
+57
+58
+59
+60
+61
+62
+63
+64
+65
+⌄
+⌄
+⌄
+⌄
+⌄
+⌄
+⌄
+⌄
+⌄
+⌄
+⌄
+⌄
 ---
 library_name: transformers
 model_name: SmolLM2_Thinks
 - generated_from_trainer
 - sft
 - trl
+- proof
+- cot
+- reasoning
+- symbioticai
+- calculus
+- logic
+- SFT
+- TRL
+- transformers
+- datasets
+- finetune
 licence: license
+datasets:
+- AI-MO/NuminaMath-1.5
+language:
+- en
 ---
 # Model Card for SmolLM2_Thinks
 ```bibtex
 @misc{vonwerra2022trl,
 	title        = {{TRL: Transformer Reinforcement Learning}},
+Commit directly to the main branch
+Open as a pull request to the main branch
+Commit changes
+Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.