Все публикации

Graph Language Models

Graph Language Models EXPLAINED in 5 Minutes! [Author explanation 🔴 at ACL 2024]

How OpenAI made

How OpenAI made o1 'think' – Here is what we think and already know about o1 reinforcement learning

I am a

I am a Strange Dataset: Metalinguistic Tests for Language Models – Paper Explained [🔴 at ACL 2024]

Transformer LLMs are

Transformer LLMs are Turing Complete after all !?

Mission: Impossible language

Mission: Impossible language models – Paper Explained [ACL 2024 recording]

Discrete Diffusion Modeling

Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution – Paper Explained

My PhD Journey

My PhD Journey in AI / ML (while doing YouTube on the side)

[Own work] On

[Own work] On Measuring Faithfulness or Self-consistency of Natural Language Explanations

Supercharging RAG with

Supercharging RAG with Generative Feedback Loops from Weaviate

GaLore EXPLAINED: Memory-Efficient

GaLore EXPLAINED: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Shapley Values Explained

Shapley Values Explained | Interpretability for AI models, even LLMs!

Stealing Part of

Stealing Part of a Production LLM | API protects LLMs no more

Genie explained 🧞

Genie explained 🧞 Generative Interactive Environments paper explained

MAMBA and State

MAMBA and State Space Models explained | SSM explained

Sparse LLMs at

Sparse LLMs at inference: 6x faster transformers! | DEJAVU paper explained

Transformers explained |

Transformers explained | The architecture behind LLMs

Direct Preference Optimization:

Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained

LLM hallucinations discover

LLM hallucinations discover new math solutions!? | FunSearch explained

DALL-E 3 is

DALL-E 3 is better at following Text Prompts! Here is why. — DALL-E 3 explained

Adversarial Attacks and

Adversarial Attacks and Defenses. The Dimpled Manifold Hypothesis. David Stutz from DeepMind #HLF23

What is LoRA?

What is LoRA? Low-Rank Adaptation for finetuning LLMs EXPLAINED

Are ChatBots their

Are ChatBots their own death? | Training on Generated Data Makes Models Forget – Paper explained

The first law

The first law on AI regulation | The EU AI Act

Say that 3

Say that 3 times in a row. 😅