BigBird Research Ep. 4 - Where Does BigBird Help?

Показать описание

Weekly Research Group, April 29th, 2021

So far, I’ve struggled to get BigBird to outperform the original BERT (using the simple strategy of truncating text for BERT). This week, the group helped me figure out how we might craft a code example that best demonstrates where BigBird might be most applicable or useful.

In the process, we touched on:
- The authors’ recommendation to only use Sparse Attention above 1,024 tokens.
- Why BigBird is valuable for Question Answering.
- Possible strategies for addressing GPU memory concerns with BigBird.

Outside of BigBird, we also talked about how to use a classifier to help label a large unlabeled dataset, and strategies for detecting the author of a piece of text.

I’ll be implementing the group’s suggestions this week, and for the next group we’ll see how it went!

ChrisMcCormickAI

Рекомендации по теме

Комментарии

I love what you guys are doing. It will be a great to compare BigBird Vs. Chunking. I am working on a dataset that has most data length greater than 512 and I will really need to know how they both perform on my dataset.

ifeanyindukwe

Hey Chris, really loved the video. Could you please share me some resources on learning distributed training in Pytorch as someone getting started in distributed training it’s really intimidating. Perhaps you could also make a YouTube video explaining all the details

stephennfernandes

BigBird Research Ep. 4 - Where Does BigBird Help?

BigBird Research Ep. 4 - Where Does BigBird Help?

BigBird Research Ep. 5 - Where Does BigBird Help? & Series Conclusion

BigBird Research Ep. 3 - Block Sparse Attention, ITC vs. ETC

BigBird Research Ep. 2 - Multi-GPU transformers

BigBird Research Ep. 1 - Sparse Attention Basics

Project 2025 Is Trump’s Blueprint For A Radical Conservative Takeover Of The U.S. Government

Fog Hill of Five Elements - Xuan vs Final Form Wrath [4K]

Impregnated Against Their Will | Law & Order SVU

Question Answering Research - Ep. 4 - Retrieval Augmented Generation (RAG)

Big Bird: Transformers for Longer Sequences

Carrie Asks Mr. Big Why He Didn't Pick Her | Sex and the City | HBO

Manzil Zaheer | Big Bird: Transformers for Longer Sequences

Is Jeff Bezos Really That Approachable #wealth #jeffbezos #celebrity #entrepreneur #ceo

[Yonsei NLP Study] BIGBIRD : Transformers for Longer Sequences

The REAL reason The Dodo Went Extinct

Ular Kepala Manusia di Papua

Classic Ben 10 | It's Only A Game | Cartoon Network

Your Grade Your Griddy

The Height of Magic | Frieren: Beyond Journey's End

Angry Birds 3D Animation Test by Squeeze Studio Animation

Diana and Roma's Mighty PAW Patrol Adventure!

Ancient Aliens: Unbelievable Extraterrestrial Encounters in Antarctica

Rose Princess and the Golden Bird | Bedtime Stories for Kids in English | Fairy Tales

Let Me Out