2025.naacl-tutorial.2@ACL

Total: 1

#1 DAMAGeR: Deploying Automatic and Manual Approaches to GenAI Red-teaming [PDF] [Copy] [Kimi] [REL]

Authors: Manish Nagireddy, Michael Feffer, Ioana Baldini

In this tutorial, we will review and apply current automatic and manual red-teaming techniques for GenAI models(including LLMs and multimodal models). In doing so, we aim to emphasize the importance of using a mixture of techniques and establishing a balance between automatic and manual approaches. Lastly, we aim to engage tutorial participants in live red-teaming activities to collaboratively learn impactful red-teaming strategies and share insights.

Subject: NAACL.2025 - Tutorial Abstracts