Lemmynated@lemmy.zip to Technology@lemmy.zipEnglish · 2 months agoResearchers gaslit Claude into giving instructions to build explosiveswww.theverge.comexternal-linkmessage-square10linkfedilinkarrow-up142arrow-down12cross-posted to: technology@lemmy.world
arrow-up140arrow-down1external-linkResearchers gaslit Claude into giving instructions to build explosiveswww.theverge.comLemmynated@lemmy.zip to Technology@lemmy.zipEnglish · 2 months agomessage-square10linkfedilinkcross-posted to: technology@lemmy.world
minus-squareTaleya@aussie.zonelinkfedilinkEnglisharrow-up2·2 months agoYou can’t gaslight a fucking machine, they busted the “safety” protocols on an LLM already renowned for ignoring its instruction set.
You can’t gaslight a fucking machine, they busted the “safety” protocols on an LLM already renowned for ignoring its instruction set.