Skip to content
@CHATS-lab

CHATS-lab

Conversation, Human-AI Technology, and Security Lab

Popular repositories Loading

  1. persuasive_jailbreaker persuasive_jailbreaker Public

    Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!

    HTML 326 25

  2. verbalized-sampling verbalized-sampling Public

    Verbalized Sampling, a training-free prompting strategy to mitigate mode collapse in LLMs by requesting responses with probabilities. Achieves 2-3x diversity improvement while maintaining quality. …

    Python 213 22

  3. KokoMind KokoMind Public

    KokoMind: Can LLMs Understand Social Interactions?

    JavaScript 102 8

  4. LLMs_Encode_Harmfulness_Refusal_Separately LLMs_Encode_Harmfulness_Refusal_Separately Public

    Python 9

Repositories

Showing 4 of 4 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…