Podcast Episode Details

Back to Podcast Episodes
The coming AI security crisis (and what to do about it) | Sander Schulhoff

The coming AI security crisis (and what to do about it) | Sander Schulhoff



Sander Schulhoff is an AI researcher specializing in AI security, prompt injection, and red teaming. He wrote the first comprehensive guide on prompt engineering and ran the first-ever prompt injection competition, working with top AI labs and companies. His dataset is now used by Fortune 500 companies to benchmark their AI systems security, he’s spent more time than anyone alive studying how attackers break AI systems, and what he’s found isn’t reassuring: the guardrails companies are buying don’t actually work, and we’ve been lucky we haven’t seen more harm so far, only because AI agents aren’t capable enough yet to do real damage.

We discuss:

1. The difference between jailbreaking and prompt injection attacks on AI systems

2. Why AI guardrails don’t work

3. Why we haven’t seen major AI security incidents yet (but soon will)

4. Why AI browser agents are vulnerable to hidden attacks embedded in webpages

5. The practical steps organizations should take instead of buying ineffective security tools

6. Why solving this requires merging classical cybersecurity expertise with AI knowledge

Brought to you by:

Datadog—Now home to Eppo, the leading experimentation and feature flagging platform: https://www.datadoghq.com/lenny

Metronome—Monetization infrastructure for modern software companies: https://metronome.com/

GoFundMe Giving Funds—Make year-end giving easy: http://gofundme.com/lenny

Transcript: https://www.lennysnewsletter.com/p/the-coming-ai-security-crisis

My biggest takeaways (for paid newsletter subscribers): https://www.lennysnewsletter.com/i/181089452/my-biggest-takeaways-from-this-conversation

Where to find Sander Schulhoff:

• X: https://x.com/sanderschulhoff

• LinkedIn: https://www.linkedin.com/in/sander-schulhoff

• Website: https://sanderschulhoff.com

• AI Red Teaming and AI Security Masterclass on Maven: https://bit.ly/44lLSbC

Where to find Lenny:

• Newsletter: https://www.lennysnewsletter.com

• X: https://twitter.com/lennysan

• LinkedIn: https://www.linkedin.com/in/lennyrachitsky/

In this episode, we cover:

(00:00) Introduction to Sander Schulhoff and AI security

(05:14) Understanding AI vulnerabilities

(11:42) Real-world examples of AI security breaches

(17:55) The impact of intelligent agents

(19:44) The rise of AI security solutions

(21:09) Red teaming and guardrails

(23:44) Adversarial robustness

(27:52) Why guardrails fail

(38:22) The lack of resources addressing this problem

(44:44) Practical advice for addressing AI security

(55:49) Why you shouldn’t spend your time on guardrails

(59:06


Published on 1 week, 4 days ago






If you like Podbriefly.com, please consider donating to support the ongoing development.

Donate