Stealing Part of a Production LLM | API protects LLMs no more

preview_player
Показать описание
How it is possible to steal part of LLMs protected behind an API? 🥷 We explain both papers that made a breakthrough on this, one from Carlini et al. (Google), and the other one from Finlayson et al. (USC), see references below.

Thanks to our Patrons who support us in Tier 2, 3, 4: 🙏
Dres. Trost GbR, Siltax, Vignesh Valliappan, Michael, Sunny Dhiana, Andy Ma

Outline:
00:00 Stealing LLMs from behind API’s!?
01:54 AssemblyAI (Sponsor)
02:59 Two papers, same thing
04:03 Core observation
07:05 Recover Hidden Dimensionality
08:54 gpt-3.5-turbo
10:30 Full Layer Extraction
10:58 Extract all logits
14:35 Defenses
15:40 Cost of attack
16:22 Further impact
17:40 API response stochasticity

▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
🔥 Optionally, pay us a coffee to help with our Coffee Bean production! ☕
Join this channel to get access to perks:
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀

🔗 Links:

#AICoffeeBreak #MsCoffeeBean #MachineLearning #AI #research​

Video editing: Nils Trost
Scientific advising by Mara Popescu
Рекомендации по теме
Комментарии
Автор

Excellent! White paper from LLM monitoring, simply excellent :) ... off to the reading library. thanks muchly.

MyrLin
Автор

Thanks! Concentrate on your PhD writing it is important! We can wait for the next video.

jmirodg
Автор

Wow. I Lived. I have a long way to go to understand it all, but I held on!

HunterMayer
Автор

Wow. . . I'm glad I've found your channel very informative, I'm still trying to wrap my head around those articles, love your accent and your smile, looking forward to more videos, best of luck in your thesis, cheers from Sydney.

ricowallaby
Автор

Can't say I really understand most of what they have done but it seems totally wild and real creative how they managed to work it out. Good luck with the thesis :D

robthomas
Автор

Good luck defending your thesis, your accent is one of the reasons i watch.

praxis
Автор

Good luck with the paper, remember to take some time for yourself afterwards! I look forward to your next contribution.

HunterMayer
Автор

Really cool video. Good luck with your thesis! ^^

IndoorAdventurer
Автор

WOW! 😅 I watched the whole video! I think my brain was somewhat cramping, but I grasped the concept. Good job and good luck with your thesis!!!! I subscribed but I’ll wait for my brain to heal a bit before I jump into another great video 🙃🙏👍

bernieapodaca
Автор

For what it’s worth, I completely understand your accent (it’s not very heavy in English), and wow I was completely overthinking it with my guesses on the paper on Patreon. 😅

I’m glad I can now steal all the logits if I wanted! 😜

Another fantastic paper breakdown as always!

MaJetiGizzle
Автор

As "d" is the hidden embedding dimension, is it guaranteed somehow that the logits and embeddings themselves lie in d-dimensional space? Or they probably lie in lower dimensional sub-space?

drummatick
Автор

So much linear algebra is involved in LLMs

AmCanTech
Автор

I really like your content. Thank you so much for making it!

jackpisso
Автор

Awesome. Good video editing snd pacing too.

williamchamberlain
Автор

What software and video editing tools do you use for creating this great content ?

AbdallahAbdallah
Автор

The intellect of a woman has always melted my heart. I think I just fell in love with Letitia. That is the most beautiful lecture I have ever enjoyed. ❤

gregsLyrics
Автор

Hey amazing content, quick note 2nd paper is from University of southern California (My uni) so just pointing that out!

visheshmittal
Автор

So cool. Thanks mate! I wonder how other architectures would fare against this attack.

Also, super good luck on your thesis mate!!! I would love to see what you've been researching!

BooleanDisorder
Автор

Perfect way to start a week with one of your videos. Great work!

cosmic_reef_
Автор

Good luck with your thesis! If you want try to do a summary video of your thesis after you submit it and before your defense, I'd watch it. Could be good practice for the defense too. I'm sure you know more about the topic than anyone else in the world...new research is cool.

nathanbanks