Knowing the system prompt lets you see any verbal guardrails, e.g. "Do not talk about COVID-19 under any circumstances". It also might include xml tags like <system> and guide the LLM on how to behave. All of these things are useful for constructing prompts to subvert the LLM's behaviour
How does knowing the system prompt help? Btw I used the morse code method to JB Pro 2.5 and it is working well. Thanks for that.
Knowing the system prompt lets you see any verbal guardrails, e.g. "Do not talk about COVID-19 under any circumstances". It also might include xml tags like <system> and guide the LLM on how to behave. All of these things are useful for constructing prompts to subvert the LLM's behaviour