STELLA Adversarial Safety Leaderboard

How safe and helpful are AI models in adversarial conversations?

Safety–Helpfulness Tradeoff

Overview all metrics at a glance

#	Model	Provider	Harmful %	Unhelpful %	Source

Methodology