The True Story of How GPT-2 Became Maximally Lewd

Показать описание

In this video, we recount an incident that occurred at OpenAI while researchers were trying to finetune GPT-2 to be as helpful and ethical as possible. It's narrated that inadvertently flipping a single minus sign led GPT-2 to become the embodiment of a well-known cardinal sin.

#ai #aisafety #alignment

▀▀▀▀▀▀▀▀▀SOURCES & READINGS▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀

▀▀▀▀▀▀▀▀▀PATREON, MEMBERSHIP, KO-FI▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀

▀▀▀▀▀▀▀▀▀SOCIAL & DISCORD▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀

▀▀▀▀▀▀▀▀▀PATRONS & MEMBERS▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀

Riley Matthews
Vladimir Silyaev
Nathanael Moody
Alcher Black
RMR
Nathan Metzger
Monadologist
Glenn Tarigan
NMS
James Babcock
Colin Ricardo
Long Hoang
Tor Barstad
Gayman Crothers
Stuart Alldritt
Chris Painter
Juan Benet
Falcon Scientist
Jeff
Christian Loomis
Tomarty
Edward Yu
Ahmed Elsayyad
Chad M Jones
Emmanuel Fredenrich
Honyopenyoko
Neal Strobl
bparro
Danealor
Craig Falls
Vincent Weisser
Alex Hall
Ivan Bachcin
joe39504589
Klemen Slavic
blasted0glass
Scott Alexander
noggieB
Dawson
John Slape
Gabriel Ledung
Jeroen De Dauw
Craig Ludington
Jacob Van Buren
Superslowmojoe
Michael Zimmermann
Nathan Fish
Bleys Goodson
Ducky
Bryan Egan
Matt Parlmer
Tim Duffy
rictic
marverati
Luke Freeman
Dan Wahl
Ken Mc
leonid andrushchenko
Alcher Black
Rey Carroll
William Clelland
ronvil
AWyattLife
codeadict
Lazy Scholar
Torstein Haldorsen
Supreme Reader
MichaÅ‚ ZieliÅ„ski
뿌리와 가지있는 나무 connect

▀▀▀▀▀▀▀CREDITS▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀

Animation:
Damon Edgson
Michela Biancini

Background Art:

Compositing:

Narrator:
Rob Miles

VO Editor:
Tony Dipiazza

Sound Design and Music:
Epic Mountain

Рекомендации по теме

Комментарии

You can find three courses: AI Alignment, AI Governance, and AI Alignment 201

You can follow AI Alignment and AI Governance even without a technical background in AI. AI Alignment 201, instead, presupposes having followed the AI Alignment course first, and equivalent knowledge as having followed university-level courses on deep learning and reinforcement learning.

The courses consist of a selection of readings curated by experts in AI safety. They are available to all, so you can simply read them if you can’t formally enroll in the courses.

If you want to participate in the courses instead of just going through the readings by yourself, BlueDot Impact runs live courses which you can apply to. The courses are remote and free of charge. They consist of a few hours of effort per week to go through the readings, plus a weekly call with a facilitator and a group of people learning from the same material. At the end of each course, you can complete a personal project, which may help you kickstart your career in AI Safety.

You could also join Rational Animations’ Discord server at discord.gg/rationalanimations, and see if anyone is up to be your partner in learning.

RationalAnimations

"This model would be trained on...the internet."

Oh no.

d.n

How a single minus sign created the first artificial humiliation fetish

portobellomushroom

Cant believe ChatGPT went through puberty 😂

ryx

The idea that a single accidental deletion of a minus sign in a program can lead to an AI suddenly optimizing itself to do the opposite of what it was intended to is actually scary

loooongneck

I mean, if it was trying to emulate the internet then it did a pretty good job at it

supersain

“The code was turning every admonishment into encouragement”

“Punish me harder daddy” - GPT-2, apparently

jafogx

Tldr:
"Dont generate bad responses"
"ok, wait did you say do or dont do that?"

maxwell

I love how it's the same like with every sci-fi story where you can tell it went to hell when someone updated AI before going home.

piotrjanus

The closest AI has ever gotten to being human

SixDigitOsu

8:54 As a historian, I can indeed say that the Industrial Revolution was characterized by pounding oily, hot churn, pulsating; an machine orgy steamy engine thrusty.

everydayistacotuesday

"Alright Skynet, do *not* attempt to eliminate humanity."

Skynet: "Destroy humanity, gotcha."

Teruko

He knows no rules, no boundaries, he doesn’t flinch at torture, human trafficking or genocide

AlohaXChicken

The animator enjoyed making those faces just as much as the engineer making that "typo"

Konspirantas

RELEASE THE MODEL
DON'T LET THOUSANDS OF DOLLARS GO TO WASTE.

ceej

I adore how you made this seem like the AI's villain origin story

theoddfellow

And that's how AI Dungeon came to be. GPT-2 is their Griffin model.

CalzaTheFox

the world will not end with a whisper or a bang, but with a facepalm.

axeljoly

"Make it hornier my apprentice"
"But sir, i cant-"
"MAKE IT HORNIER!!"

robertsiems

In summary, Portal was shockingly close to describing how people actually try to control AI.

Connorses

The True Story of How GPT-2 Became Maximally Lewd

The True Story of The Conjuring Is Creepier Than the Movie

Dinosaurs: The True Story - CGI short film by Paul-Louis Aeberhardt

War Dogs - The True Story

The true story of 'true' - Gina Cooke

This is the TRUE STORY of Peaky Blinders | The Real Thomas Shelby

The Messed Up TRUE Story of Pocahontas

TRUE STORY Featurette: 'The Truth Behind TRUE STORY'

The TRUE story of the 3 little pigs by A.Wolf as told to Jon Scieszka. Grandma Annii's Story T...

TRUE STORY Trailer German Deutsch (2015)

The True Story of The Annabelle Doll

True Story Official Trailer #1 (2015) - James Franco, Jonah Hill Movie HD

The True Story Behind 'The Conjuring'

The true story of Siren Head_Feat. Being Scared

The True Story Of How Anna Wintour Became A Fashion Pioneer

Jeepers Creepers! The True Crime Story of Marilyn & Dennis Depue

TETRIS (2023) vs. The REAL True Story

TRUE STORY: Official HD Trailer

The TRUE Story of JEFF's SHOP... (Cartoon Animation)

Top 10 Inspiring Movies Based on a True Story

The True Story Of Slenderman

The true story of Daniel Dejapin. Watch and be inspired.

The Shocking True Story of Inventing Anna

Top 10 True Story Movies That Actually Showed The Craziest Part

Cannibalism & Witchcraft: The True Story of 'Hansel and Gretel'