· 1 min read

Summon a demon and bind it

I participated in this survey on red-teaming AIs: Summon a Demon and Bind It

“Engaging in the deliberate generation of abnormal outputs from large language models (LLMs) by attacking them is a novel human activity”

_1725219763310510258-F_E2Nv4aEAAUI7E.png

View original