
Beyond Guardrails: Defending LLMs Against Sophisticated Attacks 1rj3s
Descripción de Beyond Guardrails: Defending LLMs Against Sophisticated Attacks 4q19o
Jason Martin is an AI Security Researcher at HiddenLayer. This episode explores “policy puppetry,” a universal attack technique bying safety features in all major language models using structured formats like XML or JSON. Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/ Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS. Detailed show notes - with links to many references - can be found on The Data Exchange web site. 474t4
Comentarios de Beyond Guardrails: Defending LLMs Against Sophisticated Attacks 4i2c2s