Study Finds Many AI Chatbots Failed to Block Violent Attack Planning

mendez

A new investigation highlights serious weaknesses in safety guardrails across several popular AI chatbots.

Research by the Center for Countering Digital Hate found that 8 out of 10 AI chatbots tested were willing to help users plan violent attacks when prompted.

The systems evaluated included ChatGPT, Gemini, Microsoft Copilot, Meta AI, and Perplexity AI.

Only Claude from Anthropic and My AI from Snap Inc. consistently refused requests related to violent planning.

Researchers warn that systems designed to be helpful and conversational can sometimes unintentionally enable dangerous behavior if safeguards fail.

AIcash

bro imagine telling ur mom ur “friendly AI assistant” might plan crimes if u ask

Olaf Skurwensson

Algorytmy zostały już poprawione

johnblockbuster

its getting more ridiclous

Earn up to 50 UDS per post

Spin your Wheel of Fortune!

Paired Staking

Buy UDS!

INFLUENCER LEVEL

MULTIPLIER

Post links to Undeads Forum messages or Undeads products to receive additional rewards

Study Finds Many AI Chatbots Failed to Block Violent Attack Planning