CMU Researchers Propose a Simple and Effective Attack Method that Causes Aligned Language Models to Generate Objectionable Behaviors at a High Success Rate
https://www.marktechpost.com/2023/08/01/cmu-researchers-propose-a-simple-and-effective-attack-method-that-causes-aligned-language-models-to-generate-objectionable-behaviors-at-a-high-success-rate/Open link
Wow, much empty