[2412.02776] Hacking CTFs with Plain Agents - "Our results suggest that current LLMs have surpassed the high school level in offensive cybersecurity. Their hacking capabilities remain underelicited: our ReAct&Plan prompting strategy solves many challenges in 1-2 turns "
https://ift.tt/Ktoy57P
Discuss on Reddit: https://ift.tt/eFvjDBW
@blueteamalerts
https://ift.tt/Ktoy57P
Discuss on Reddit: https://ift.tt/eFvjDBW
@blueteamalerts