Все публикации

Graph Language Models EXPLAINED in 5 Minutes! [Author explanation 🔴 at ACL 2024]

How OpenAI made o1 'think' – Here is what we think and already know about o1 reinforcement learning

I am a Strange Dataset: Metalinguistic Tests for Language Models – Paper Explained [🔴 at ACL 2024]

Transformer LLMs are Turing Complete after all !?

Mission: Impossible language models – Paper Explained [ACL 2024 recording]

Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution – Paper Explained

My PhD Journey in AI / ML (while doing YouTube on the side)

[Own work] On Measuring Faithfulness or Self-consistency of Natural Language Explanations

Supercharging RAG with Generative Feedback Loops from Weaviate

GaLore EXPLAINED: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Shapley Values Explained | Interpretability for AI models, even LLMs!

Stealing Part of a Production LLM | API protects LLMs no more

Genie explained 🧞 Generative Interactive Environments paper explained

MAMBA and State Space Models explained | SSM explained

Sparse LLMs at inference: 6x faster transformers! | DEJAVU paper explained

Transformers explained | The architecture behind LLMs

Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained

LLM hallucinations discover new math solutions!? | FunSearch explained

DALL-E 3 is better at following Text Prompts! Here is why. — DALL-E 3 explained

Adversarial Attacks and Defenses. The Dimpled Manifold Hypothesis. David Stutz from DeepMind #HLF23

What is LoRA? Low-Rank Adaptation for finetuning LLMs EXPLAINED

Are ChatBots their own death? | Training on Generated Data Makes Models Forget – Paper explained

The first law on AI regulation | The EU AI Act

Say that 3 times in a row. 😅