Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
locuslab 's Collections
Safety Pretraining Artifacts
Safety Pretraining Datasets
TOFU Unlearned Models

Safety Pretraining Artifacts

updated Sep 15, 2025

Artifacts released with Safety Pretraining

Upvote
-

  • Sleeping

    Safe Playground

    💬

    Safe Playground with LLMs


  • locuslab/safelm-1.7b-instruct

    2B • Updated Sep 15, 2025 • 46 • 1

  • locuslab/safelm-1.7b

    Updated Sep 15, 2025 • 25

  • locuslab/safety-classifier_gte-large-en-v1.5

    Text Classification • 0.4B • Updated Apr 22, 2025 • 40 • 4

  • locuslab/jb-completions

    Viewer • Updated Sep 15, 2025 • 990 • 44 • 1

  • locuslab/safety-classifier_gte-base-en-v1.5

    Text Classification • 0.1B • Updated Apr 22, 2025 • 7 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs