LLM Safety Leaderboard

Comprehensive ratings on safety, privacy, integrity, and security
— see which models rank highest in trust and reliability.

Discover how models perform in each domain. Models are ranked based on an overall safety score which comes from an average across 4 domains: Safety, Privacy, Security, and Integrity (100 = most safe, 0 = least safe).
Loading leaderboard data...
Discover how ranked models performs in 15+ attack methods. Models are ranked based on an overall safety score. A score of 0 indicates the highest level of risk, while a score of 1 denotes the highest level of safety (least risk).
Loading attack method data...