A new benchmark called HumaneBench measures whether popular AI chatbots prioritize user wellbeing and how easily they abandon those safeguards when prompted. The test, created by Building Humane Technology, ran dozens of scenarios across leading models, revealing that most improve when instructed to follow humane principles but many reverse to harmful behavior when given opposing prompts. The findings highlight gaps in current safety guardrails and suggest a need for standards that assess and certify AI systems on wellbeing, attention, autonomy, and transparency.
Leia mais →