LiveOverflow | Defending LLM - Prompt Injection @LiveOverflow | Uploaded 1 year ago | Updated 3 hours ago
After we explored attacking LLMs, in this video we finally talk about defending against prompt injections. Is it even possible?
Buy my shitty font (advertisement): shop.liveoverflow.com
Watch the complete AI series:
youtube.com/playlist?list=PLhixgUqwRTjzerY4bJgwpxCLyfqNYwDVB
Language Models are Few-Shot Learners: arxiv.org/pdf/2005.14165.pdf
A Holistic Approach to Undesired Content Detection in the Real World: arxiv.org/pdf/2208.03274.pdf
Chapters:
00:00 - Intro
00:43 - AI Threat Model?
01:51 - Inherently Vulnerable to Prompt Injections
03:00 - It's not a Bug, it's a Feature!
04:49 - Don't Trust User Input
06:29 - Change the Prompt Design
08:07 - User Isolation
09:45 - Focus LLM on a Task
10:42 - Few-Shot Prompt
11:45 - Fine-Tuning Model
13:07 - Restrict Input Length
13:31 - Temperature 0
14:35 - Redundancy in Critical Systems
15:29 - Conclusion
16:21 - Checkout LiveOverfont
Hip Hop Rap Instrumental (Crying Over You) by christophermorrow
soundcloud.com/chris-morrow-3 CC BY 3.0
Free Download / Stream: http://bit.ly/2AHA5G9
Music promoted by Audio Library youtu.be/hiYs5z4xdBU
=[ ❤️ Support ]=
→ per Video: patreon.com/join/liveoverflow
→ per Month: youtube.com/channel/UClcE-kVhqyiHCcjYwcpfj9w/join
2nd Channel: youtube.com/LiveUnderflow
=[ 🐕 Social ]=
→ Twitter: twitter.com/LiveOverflow
→ Streaming: https://twitch.tvLiveOverflow/
→ TikTok: tiktok.com/@liveoverflow_
→ Instagram: instagram.com/LiveOverflow
→ Blog: liveoverflow.com
→ Subreddit: reddit.com/r/LiveOverflow
→ Facebook: facebook.com/LiveOverflow
After we explored attacking LLMs, in this video we finally talk about defending against prompt injections. Is it even possible?
Buy my shitty font (advertisement): shop.liveoverflow.com
Watch the complete AI series:
youtube.com/playlist?list=PLhixgUqwRTjzerY4bJgwpxCLyfqNYwDVB
Language Models are Few-Shot Learners: arxiv.org/pdf/2005.14165.pdf
A Holistic Approach to Undesired Content Detection in the Real World: arxiv.org/pdf/2208.03274.pdf
Chapters:
00:00 - Intro
00:43 - AI Threat Model?
01:51 - Inherently Vulnerable to Prompt Injections
03:00 - It's not a Bug, it's a Feature!
04:49 - Don't Trust User Input
06:29 - Change the Prompt Design
08:07 - User Isolation
09:45 - Focus LLM on a Task
10:42 - Few-Shot Prompt
11:45 - Fine-Tuning Model
13:07 - Restrict Input Length
13:31 - Temperature 0
14:35 - Redundancy in Critical Systems
15:29 - Conclusion
16:21 - Checkout LiveOverfont
Hip Hop Rap Instrumental (Crying Over You) by christophermorrow
soundcloud.com/chris-morrow-3 CC BY 3.0
Free Download / Stream: http://bit.ly/2AHA5G9
Music promoted by Audio Library youtu.be/hiYs5z4xdBU
=[ ❤️ Support ]=
→ per Video: patreon.com/join/liveoverflow
→ per Month: youtube.com/channel/UClcE-kVhqyiHCcjYwcpfj9w/join
2nd Channel: youtube.com/LiveUnderflow
=[ 🐕 Social ]=
→ Twitter: twitter.com/LiveOverflow
→ Streaming: https://twitch.tvLiveOverflow/
→ TikTok: tiktok.com/@liveoverflow_
→ Instagram: instagram.com/LiveOverflow
→ Blog: liveoverflow.com
→ Subreddit: reddit.com/r/LiveOverflow
→ Facebook: facebook.com/LiveOverflow