Defending LLM - Prompt Injection @LiveOverflow

LiveOverflow | Defending LLM - Prompt Injection @LiveOverflow | Uploaded 1 year ago | Updated 3 hours ago
After we explored attacking LLMs, in this video we finally talk about defending against prompt injections. Is it even possible?

Buy my shitty font (advertisement): shop.liveoverflow.com

Watch the complete AI series:
youtube.com/playlist?list=PLhixgUqwRTjzerY4bJgwpxCLyfqNYwDVB

Language Models are Few-Shot Learners: arxiv.org/pdf/2005.14165.pdf
A Holistic Approach to Undesired Content Detection in the Real World: arxiv.org/pdf/2208.03274.pdf

Chapters:
00:00 - Intro
00:43 - AI Threat Model?
01:51 - Inherently Vulnerable to Prompt Injections
03:00 - It's not a Bug, it's a Feature!
04:49 - Don't Trust User Input
06:29 - Change the Prompt Design
08:07 - User Isolation
09:45 - Focus LLM on a Task
10:42 - Few-Shot Prompt
11:45 - Fine-Tuning Model
13:07 - Restrict Input Length
13:31 - Temperature 0
14:35 - Redundancy in Critical Systems
15:29 - Conclusion
16:21 - Checkout LiveOverfont

Hip Hop Rap Instrumental (Crying Over You) by christophermorrow
soundcloud.com/chris-morrow-3 CC BY 3.0
Free Download / Stream: http://bit.ly/2AHA5G9
Music promoted by Audio Library youtu.be/hiYs5z4xdBU

=[ ❤️ Support ]=

→ per Video: patreon.com/join/liveoverflow
→ per Month: youtube.com/channel/UClcE-kVhqyiHCcjYwcpfj9w/join

2nd Channel: youtube.com/LiveUnderflow

=[ 🐕 Social ]=

→ Twitter: twitter.com/LiveOverflow
→ Streaming: https://twitch.tvLiveOverflow/
→ TikTok: tiktok.com/@liveoverflow_
→ Instagram: instagram.com/LiveOverflow
→ Blog: liveoverflow.com
→ Subreddit: reddit.com/r/LiveOverflow
→ Facebook: facebook.com/LiveOverflow

Troubleshooting AFL Fuzzing Problems | Ep. 03

Injection Vulnerabilities - or: How I got a free Burger

Reading Player Position with DLL Injection - Pwn Adventure 3

Can We Find a New Exploit Strategy? | Ep. 13

Finding iOS Kernel Exploit // SockPuppet Jailbreak - CVE-2019-8605

Found a Crash Through Fuzzing? Minimize AFL Testcases! | Ep. 05