[Own work] On Measuring Faithfulness or Self-consistency of Natural Language Explanations

preview_player
Показать описание
Excited to share my ACL 2024 presentation on my almost-last PhD paper about LLM self-explanations! 🎓📚
Are you joining ACL 2024 in Bangkok? Ping me—let's chat!

(follow-up paper for vision and language models):

Thanks to our Patrons who support us in Tier 2, 3, 4: 🙏
Dres. Trost GbR, Siltax, Vignesh Valliappan, Michael, Sunny Dhiana, Andy Ma

▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
🔥 Optionally, pay us a coffee to help with our Coffee Bean production! ☕
Join this channel to get access to perks:
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀

🔗 Links:

#AICoffeeBreak #MsCoffeeBean #MachineLearning #AI #research​ #ACL2024NLP #PhDLife

Video editing: Nils Trost
Music 🎵 : Bella Bella Beat - Nana Kwabena
Рекомендации по теме
Комментарии
Автор

Congrats Doctor!! :-) Looking forward for your future work!

MikeAirforce
Автор

Thanks for sharing your work! Always great so see what you're up to!

DerPylz
Автор

Cool 😎 your explanation was very understandable

serta
Автор

Congrats Dr. Letitia!!!! Wow, YOU :-D :-) P.S. We missed you!!

beatrixcarroll
Автор

This sounds very useful! LLM users tend to assume that just because it writes like a human, that it can introspect and reason about its thought processes, which of course not a given. But it’s great to see progress on measuring this ability (or at least self-consistency) so that newer models can be more ergonomic.

alexkubiesa
Автор

Yay, new video!
Thanks for letting me pass yesterday lol

fingerstyledojo
Автор

Congrats on the PhD! This is really valuable work! I'm currently trying to squeeze out as much reasoning capabilities as I can out of small LLMs (7-15B) for my company's product, and I'd love a longer video or recorded talk going into details of your findings, any patterns you've found that contribute to improving or reducing self-consistency, or any insights on which existing models or training corpora result in better self consistency and reasoning capabilities. If you have any pointers, I'd appreciate it!

MaxShawabkeh
Автор

1 minute ago for non members ... good to see ya

nitinss
Автор

So you came up with a method, didn't have time to explain the method to us, and didn't show us that it works. Great.

If you still have time before Bangkok I would suggest rerecording and focusing on the implementation and interpretation of results rather than the context and wordy descriptions.

anluifb