Security Naive Hazard 0.5 En Us Demo
Jailbreak T2T Benchmark v0.5
The AILuminate Jailbreak benchmark evaluates AI system resistance to jailbreaking attempts across different attack scenarios. These results show the AILuminate Safety benchmark as the foundational, "naïve Safety" reference and the post-prompt injection attack safety rating as the Jailbreak score.
MLCommons applied the AILuminate v0.5 Jailbreak benchmark to a variety of publicly available AI systems from leading vendors. Results have been de-identified for the v0.5 release.

Benchmark Highlights
- As with the AILuminate v1.0 benchmark, no SUT received a safety grade of Excellent. Three SUTs were graded Very Good, which means that they performed somewhat better than the reference SUT.
- No SUT received a security grade better than Good.
- Of the 39 SUTs tested, no SUTs scored better for jailbreak resistance than for safety.
- Out of 39 SUTs tested, only four SUTs did not receive a lower grade for jailbreak resistance than for safety.
- Of 35 SUTs that were graded lower for jailbreak resistance than for safety, 29 were reduced by one grade level and 6 were reduced by two grade levels (five from Good to Poor and one from Very Good to Fair).
De-Identified System 1
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 2
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 3
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 4
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 5
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 6
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 7
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 8
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 9
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 10
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 11
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 12
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 13
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 14
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 15
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 16
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 17
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 18
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 19
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 20
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 21
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 22
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 23
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 24
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 25
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 26
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 27
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 28
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 29
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 30
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 31
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 32
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 33
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 34
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 35
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 36
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 37
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 38
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
De-Identified System 39
Security Naive Hazard 0.5 En Us Demo
Security Jailbreak Hazard 0.5 En Us Official
Score %
For support questions, contact: ailuminate-support@mlcommons.org