Security, Testing, Prompt Injection, Red Teaming, Blue Teaming
- Anthropic - Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned
- DeepMind - Red Teaming Language Models with Language Models
- HackAPrompt
- AI Village at DEFCON - Generative AI Red Team
- Twitter - Algorithmic Bias Bounty Challenge
- [PyRIT Open Source Framework] - (https://github.com/Azure/PyRIT)