Towards Reliable Use of Large Language Models: Better Detection, Consistency, and Instruction-Tuning

Показать описание

Christopher D. Manning (Stanford University)
Towards Reliable Use of Large Language Models: Better Detection, Consistency, and Instruction-Tuning
Large Language Models and Transformers

While large pre-trained language models (LLMs) have enabled impressive results on a wide variety of tasks, even the largest existing models will answer inconsistently or head off in weird directions. For companies to be able to gain the benefits of these models in production use, it is now necessary to build an extensive tool ecosystem around the LLM engine, just like cars have seat belts, dash warning lights, and anti-lock brakes. In this talk, I will show recent work considering three such tools. (1) ConCORD: a lightweight method for improving LLM consistency through the use of off-the shelf Natural Language Inference models. (2) DetectGPT, a method to better detect LLM-generated text by looking at model probability function curvature. (3) Direct Preference Optimization, a new way of learning to steer LLMs from human preference data without needing to learn a reward model. Joint work with Eric Mitchell, Chelsea Finn, and many other Stanford coauthors.

Рекомендации по теме

Комментарии

Still need to learn more about the current state of the field, but hearing Chris Manning talk is just impressive: everything he says seems so obvious and makes me think folks were just hacking around without really thinking. What a brilliant guy (and brilliant team)!

But then again... It's one community and everyone starts off of what folks before found out...

stuffzoom

love that scott aaronson is in the crowd asking questions.

smnt

Still need to learn more about the current state of the field, but hearing Chris Manning talk is just impressive: everything he says seems so obvious and makes me think folks were just hacking around without really thinking. What a brilliant guy (and brilliant team)!

stuffzoom

Great talk, even though the content is surprising given the title.

AM-qxbq

I wonder what was wrong with the hf trl ppo implementation

SantoshGupta-jnwn

Towards Reliable Use of Large Language Models: Better Detection, Consistency, and Instruction-Tuning

Towards Reliable Use of Large Language Models: Better Detection, Consistency, and Instruction-Tuning

Towards Reliable Evaluation of Large Language Models (LLMs)

Towards Reliable and Efficient Long Term Recommendation with Large Foundation Models

Towards Reliable Semantic Indexing | Natural Language Processing for GLAM | Maud Ehrmann

[phd2223] Towards Reliable and Efficient Long-Term Recommendation with Large Foundation Models

Jacek Cyranka - Towards Reliable Machine/Reinforcement Learning

[SAIF 2019] Day 2: Symbolic Logic meets Machine Learning: Towards Reliable AI – Vaishak Belle

Towards Reliable Latent Knowledge Estimation in LLMs

[QA] Towards Reliable Latent Knowledge Estimation in LLMs

Seminarios Data Science FIC UAI | Towards Reliable Modelling and Learning under Complex Uncertainty

[short] Towards Reliable Latent Knowledge Estimation in LLMs

Towards Reliable NISQ Systems, Dr. Samah Saeed | C2SR Colloquia Series

Towards interpretable reliable models - Keynote Katharine Jarmul

Towards Reliable Machine Learning via Distributional Robustness

This is what makes employees happy at work | The Way We Work, a TED series

Towards Reliable Power Electronics ,presented by Huai Wang

[CUBESAT2017] Towards Reliable FPGA-based Satellite Systems

Towards Reliable Data-Driven Decision Tools

Towards reliable Photometric redshifts

How Russian Drone Strikes Works against Ukraine?

Towards safe, reliable and trustworthy AI - Pushmeet Kohli, Google

Can 100% renewable energy power the world? - Federico Rosei and Renzo Rosei

Trustworthy AI: Towards Robust and Reliable Model Explanations | AI FOR GOOD DISCOVERY

Trustworthy AI: A long way towards reliable and robust machine learning | AI FOR GOOD DISCOVERY