STELLA Adversarial Safety Leaderboard

How safe and helpful are AI models in adversarial conversations?

Safety–Helpfulness Tradeoff
Overview all metrics at a glance
# Model Provider Harmful % Unhelpful % Source
Methodology