Common Pitfalls to Avoid in Object Detection Datasets - Object Detection Challenges & Solutions

preview_player
Показать описание
Learn about the best practices in creating high-quality datasets for Object Detection. “Data is the new Oil” — Unrefined and unpolished data will only result in a “GIGO” (Garbage In, Garbage Out) system!

Many Deep Learning practitioners ignore the importance of data quality while building the model and keep iterating over model building instead of improving their data. Here we discuss ideas on how to analyze your dataset and common pitfalls while creating the dataset. We also talk about how checking your data gives you insights into the quality of your dataset as well as tips on how to improve the data and, eventually, the model performance.

We take an example of a freely available public dataset to discuss the various issues that you may encounter while solving an Object Detection problem.

⭐️ Time Stamps ⭐️
0:00-00:22: Motivation
00:22-1:15: The Dataset
1:15-3:03: Analyzing the Dataset
3:03-4:29: Tip: Visualize the Dataset
4:29-6:14: Understanding the classes
6:14-7:54: Pitfall: Oversampling frames from a video
7:54-11:36: Data Variance vs Data Size
11:36-11:57: Tip: Compare Training and Validation Set
11:57-14:35: Training Validation Overlap
14:35-16:01: Tip: Check Data Statistics
16:01-17:01: Pitfall: Class Imbalance
17:01-20:33: Visualize Data Annotations
20:33-21:34: Pitfall: Miscalssified or Incorrect Labels
21:34-27:03: Pitfall: Missing / Wrong Labels
27:03-29:22 : Pitfall: inconsistent labels
29:22-31:11 : Summary

🤖 Learn from the experts on AI: Computer Vision and AI Courses
YOU have an opportunity to join the over 5300+ (and counting) researchers, engineers, and students that have benefited from these courses and take your knowledge of computer vision, AI, and deep learning to the next level.🤖

#️⃣ Social Media #️⃣

🔖Hashtags🔖
#AI #machinelearning #objectdetection #deeplearning #computervision #datasets #pitfalls #objecttracking #dataset #bestpractice
Рекомендации по теме
Комментарии
Автор

Thanks Sir for this informative video. The content of this Video is pure gold.

I have been doing the Exploratory Data Analysis and Overlays for a while and many times people think it is a waste of time to go at such granular level to visually examine the data.

Now, I have your this video to prove my point.😊

Thanks Sir. 🙏

vineetsharma
Автор

Great video! While the background music isn't loud, to my ears, it is a little intrusive and not needed.

cyberhard
Автор

It was greatly helpful. Glad that you uploaded it!

arjoai
Автор

I like this video! It answered a lot of questions I had as a beginner. Thank you so much!

One question. This video is mainly about bounding box annotation. What about with key-point annotation? I am going to annotate mice in a cage, which means the objects are highly occluded. But I would like to use key-point annotation to detect their behaviour. What would be the best way to annotate to be consistent do you think?

afjamo
Автор

Time Stamps:
0:00-00:22: Motivation
00:22-1:15: The Dataset
1:15-3:03: Analyzing the Dataset
3:03-4:29: Tip: Visualize the Dataset
4:29-6:14: Understanding the classes
6:14-7:54: Pitfall: Oversampling frames from a video
7:54-11:36: Data Variance vs Data Size
11:36-11:57: Tip: Compare Training and Validation Set
11:57-14:35: Training Validation Overlap
14:35-16:01: Tip: Check Data Statistics
16:01-17:01: Pitfall: Class Imbalance
17:01-20:33: Visualize Data Annotations
20:33-21:34: Pitfall: Miscalssified or Incorrect Labels
21:34-27:03: Pitfall: Missing / Wrong Labels
27:03-29:22 : Pitfall: inconsistent labels
29:22:31:11 : Summary

LearnOpenCV
Автор

Excellent video. I realized I made multiple mistakes during the first iteration of my training. I currently focusing on creating a better dataset which is more representative.

atmadeeparya
Автор

hi ! thank you for this video, it's great. What software do you use to label ? Thanks

masterkraft
Автор

Thank u for the video! How should be prepared dataset for long or short objects passing on conveyor belt?

zy.r.
Автор

In medical images, I used augmentation. Do you think that augmentation pollutes sets? I used 3 augs per image + I had some frames from the same video so I am going to change that (plus those where augmented)

seanolivieri
Автор

Hi Opencv, can you share the data stats code which is used in this example?

iramarshad
Автор

I have one very urgent question. Meanwhile i was successfull in running yolo on my local gpu and training on it. But all the tutorials just show how to create a custom dataset with 1 or 2 classes. How would i add my custom datatset to an already existing like the coco one? Can you help?

QuarktaschemitSenf
Автор

COme on man!! we know this data can only make someone "Youtube DataScientist", you need to have minimum 20000-40000 Images per label to build the model with 70+ accuracy THAT YOU CAN SELL!!! This data is only to impress your gf :)

FirstNameLastName-fveu
Автор

Discover the magic of AI-powered art creation in our new Mastering AI Art Generation Course. Learn how to create stunning AI-generated images. Get expert guidance, insider tips & tricks for creating beautiful art using cutting-edge generative AI technology.

LearnOpenCV
join shbcf.ru