ImageGPT (Generative Pre-training from Pixels)

Показать описание

This video will explore the exciting new 6.8 Billion parameter ImageGPT model! The researchers show that better and larger generative models learn better representations for tasks like ImageNet classification!

Thanks for watching! Please Subscribe!

Paper Links:

Connor Shorten

Рекомендации по теме

Комментарии

2:18 Auto-Regressive modeling of Pixels
4:18 Denoising Autoencoders: AR and BERT
5:40 GPT Architecture, No CNN Prior!
7:00 6.8 BILLION parameters!! Comparison with SimCLR, CPC, BigBiGAN
8:24 Generative Models and Representation Learning for Vision
10:30 Fine-Tuning with Linear Probes
11:50 Working around Quadratic Complexity of Self-Attention
12:50 Context Reduction
13:52 Results and Ablations
18:50 Promise of Longer Context Transformers and Visual Representation Learning

connor-shorten

Yannic Kilcher sent me here. Good channel. Subbed!

herp_derpingson

That imageGPT result is crazy. It seems that you can replace inductive biases (translation invariance via convolutions) with just more data and compute.

citiblocsMaster

Awesome stuff. Have to watch it a couple times to wrap my head around it.

Schematical

😩 too awesome i can't even process

geekionizado

Can u use plain English please , it still sounds complex for bigginners

quadhd

ImageGPT (Generative Pre-training from Pixels)

ImageGPT (Generative Pre-training from Pixels)

Image GPT: Generative Pretraining from Pixels (Paper Explained)

(Image-GPT) Generative Pretraining from Pixels | Paper Explained + Colab Notebook

Generative Pretraining from Pixels

Generative Pretraining from Pixels

ICML2020 | Outstanding Paper | Generative Pretraining From Pixels 🔥

Autocomplete Images With AI: image-GPT explained

OpenAI’s Image GPT Completes Your Images With Style!

How to use Image-GPT [Project link in desc]

Generative Pre-Training

What is GPT in ChatGPT - GPT paper explained

OpenAI's DALL-E explained. How GPT-3 creates images from descriptions.

Rethinking Pre-training and Self-Training

Distribution Augmentation for Generative Modeling

High-Res Image Synthesis - Merging Transformer Power with CNN Efficiency

Stable Diffusion but for Pixel Art: Pixel GPT

Image Completion AI - Predict Pixels Just Like Text Predictions [Image-GPT]

EP82 - Rejuvenating image-GPT as Strong Visual Representation Learners

Generative Pre-Training 2

How to Use DALL.E 3 - Top Tips for Best Results

OpenAI's GPT-2 Explained | Visualizing Transformer Language Models | Generative Pre-Training | ...

OpenAI Art DALL-E: Creating Images from Text

GPT-4: The next AI model from OpenAI - Will it surprise us like GPT-3? - Artificial Intelligence

This presentation was generated by GPT-3 (extended)