New AI Jailbreak Method 'Bad Likert Judge' Boosts Attack Success Rates by Over 60%

Jan 3, 2025 - 04:45

0 3

New AI Jailbreak Method 'Bad Likert Judge' Boosts Attack Success Rates by Over 60%

Cybersecurity researchers have shed light on a new jailbreak technique that could be used to get past a large language model's (LLM) safety guardrails and produce potentially harmful or malicious responses. The multi-turn (aka many-shot) attack strategy has been codenamed Bad Likert Judge by Palo Alto Networks Unit 42 researchers Yongzhe Huang, Yang Ji, Wenjun Hu, Jay Chen, Akshata Rao, and

What's Your Reaction?

Like 0

Dislike 0

Love 0

Funny 0

Angry 0

Sad 0

Wow 0

Related Posts

TechCrunch Mobility: Uber Freight’s AI bet, Tesla’s robotaxi caveat, and Nikola’s trucks hit the auction block

TechCrunch Mobility: Uber Freight’s AI bet, Tesla’s rob...

May 23, 2025 0 1

BEYOND Expo 2025: She Rewires founder Jill Tang on empowering women in tech

BEYOND Expo 2025: She Rewires founder Jill Tang on empo...

May 23, 2025 0 0

Xiaomi’s second EV model draws styling cues from Ferrari Purosangue

Xiaomi’s second EV model draws styling cues from Ferrar...

May 23, 2025 0 1

Hackers Use TikTok Videos to Distribute Vidar and StealC Malware via ClickFix Technique

Hackers Use TikTok Videos to Distribute Vidar and Steal...

May 23, 2025 0 0

Zoox issues second robotaxi software recall in a month following collision

Zoox issues second robotaxi software recall in a month ...

May 23, 2025 0 0

Landa promised real estate investing for $5. Now it’s gone dark.

Landa promised real estate investing for $5. Now it’s g...

May 23, 2025 0 28