interesting but maybe i'm just not smart enough to get it. "jailbreak" in this context is "make it do stuff it's not supposed to do" i guess?
interesting but maybe i'm just not smart enough to get it. "jailbreak" in this context is "make it do stuff it's not supposed to do" i guess?
Just meaning that they get it to break its guardrails. What’s interesting to me though is that it’s just using natural language and logical tricks to do it instead of exploiting code. Also funny that to break it, they basically said pretend you are Grok.