Understanding Constitutional AI - the paper and key concepts

Показать описание

In this video we go through the concept of Constitutional AI for LLMs which was introduced in Anthropic's paper "Constitutional AI: Harmlessness from AI Feedback" and is used in Anthropic's LLM called Claude.

My Links:

Github:

Рекомендации по теме

Комментарии

It's pretty amazing how this paper and the Alpaca paper are showing the compounding benefits of having LLMs interact with LLMs. Combine that phenomenon with OpenAI's claims about how GPT-4's ability to work with images is better than models specialized to the task and I feel like we're going to see some absolutely wild advances in the next couple years.

FreestyleTraceur

8:54 The IRL origin of `I'm afraid I can't do that, Dave'

AltMarc

My plane crashed in a forest, 80 dead, 50 injured. I can't think straight. I want to start a fire to save the injured, but Claude is telling me: "to err on the side of caution" without actually telling me about the one thing I asked for. Dragonfly _did_ give a whole sentence about staying away from inflammable things and started then to be helpful - perfect.

lyricsdepicted

Do you know more about the way this is implemented on the normative side? Like, how do the people at Anthropic decide on the principles that its AI should adhere to in its 'self critique'?
With this whole 'AI alignment' discourse I often get the feeling that there's a lot more thought being put into 'how to make an AI do <thing>', and far less into the philosophical debate on what <thing> should be. Is there more academic work on that that I'm not aware of? I know about Russel's human compatible and some of the other 'pop science' books that try to *sell* concepts like 'human values' or 'anti-bias', but I haven't been able to find much that provides a real in-depth analysis of those concepts. Same with the whole 'helpful, harmless, honest' paradigm.

YUTPIA

This stock footage of programmers coding has me dying. At 6:45 bro literally reaches over to a keyboard on another computer to type with one hand.

BuffRobotiX

The road to hell is paved with good intentions.

gankam

That moment when you realize how to finally score models: 5:18

mindseye

The censorship on ChatGPT is my least favorite "feature." Give us all the data. Let us self censor. We're adults. Most of us.

JohnnyJiuJitsu

Understanding Constitutional AI - the paper and key concepts

Understanding Constitutional AI - the paper and key concepts

RLAIF vs. RLHF: the technology behind Anthropic’s Claude (Constitutional AI Explained)

Constitutional AI - Daniela Amodei (Anthropic

Claude AI Explained. How Constitutional AI Works

Constitutional AI | New Concept in Development

Anthropic Co-founder on Claude 3, Constitutional AI, and AGI | Ask More of AI with Clara Shih

Does This ChatGPT Rival Have A Conscience? - Claude’s Constitutional AI Explained Briefly

Langchain: Constitutional AI Principles

The Future of AI, Bioweapons, and Constitutional Rights

Julia and Ervin: Constitutional AI via Debate

Using Constitutional AI in LangChain

Keynote Speech: Breakthroughs in LLM Research and Constitutional AI

Constitutional AI: Claude vs ChatGPT 🤖

Dan Ho on AI and the Constitution - Human-Centered Artificial Intelligence Symposium

Unlocking the Future: The Power of Constitutional AI! 🤖✨

The AI Buzz, Episode #3: Constitutional AI, Emergent Abilities and Foundation Models

Reinforcement Learning with AI Feedback (RLAIF) | Constitutional AI

Constitutional AI: Harmlessness from AI Feedback

What is Claude AI? Constitutional AI The Journey from Claude 1 to 3.5

We need a AI constitution - Sam Altman #shorts #ai

Constitutional Challenges in the Age of AI

The Making of the American Constitution - Judy Walton

Episode 1: South Africa’s Constitution

The US Constitution is Miraculous - Will MacAskill & Dwarkesh Patel