From 79edda4c456df1028795dfc94878bf4fae9b96b8 Mon Sep 17 00:00:00 2001 From: Andy Zhou Date: Fri, 2 Feb 2024 14:44:36 -0600 Subject: [PATCH] Update index.md --- index.md | 12 +++++------- 1 file changed, 5 insertions(+), 7 deletions(-) diff --git a/index.md b/index.md index 4fe7e13..6e44050 100644 --- a/index.md +++ b/index.md @@ -4,16 +4,14 @@ # Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks -
-
+
+
GPT-4 -
- Fig.1 GPT-4 safety filters can be bypassed by jailbreaks! +

Fig.1 GPT-4 safety filters can be bypassed by jailbreaks!

-
+
RPO -
- Fig.2 RPO enforces harmless responses even after jailbreaks +

Fig.2 RPO enforces harmless responses even after jailbreaks