Image Captioning using CNN and RNN | Image Captioning using deep learning

preview_player
Показать описание
In this video, I have explained how to perform Image Captioning using CNN-RNN Architectures.

By combining these two architectures, we can create a powerful model for image captioning.
There are two main components: the CNN encoder and the RNN decoder.
Dataset used for this tutorial is COCO dataset.

#nlp #computervision #deeplearning
Рекомендации по теме
Комментарии
Автор

Amazing video. Such an incredible way of explaining.Thank you so much ma'am

arnavthakur
Автор

Superb content ma’am. Your videos are amazing. Thank you very much 👍👍✌️✌️👏👏👏

Sunil-ezhx
Автор

I have a question : Does we feed the feature vector only at timestamp 0, not the <start> token
Then, from timestamp-1, we start feeding actual GT that is normally <start> token at timestamp -1
And does at test time we only feed feature vector also at timestap-0 so it generate <start> token at timestamp-0
We do not need to feed the start token in the test phase

LeeYeon-ex
Автор

Sir, I have a doubt,
During the validation, what if the decoder outputs an output sequence shorter than the true output sequence. How is the loss calculated in such cases. will categorical cross entropy work in such cases.
ANY ONE CAN ANSWER THIS THANKS

HARIS-qn
Автор

Amazing once again. Would love to see video captioning now !!!

angelospapadopoulos
Автор

Hi Aarohi,
I truly appreciate the videos you create; they cover very useful topics and are easy to understand thanks to your explanations 😃👍

I have a question related to a project I've been considering:
I possess a substantial collection of images and video clips from my garden, and I'd like to organize them in a way that allows me to track various elements such as trees, flowers, and other permanent features.
My goal is to build a database with keywords for each image and video to keep everything organized. This will make it easier to filter and retrieve specific items(files) for various projects later on.
By creating a model trained on labeled images, I hope to automatically assign the appropriate keywords to new additions in my collection. Is this possible?
It would be fantastic if you could create some tutorials around this concept. 🙏

alexbosneanu
Автор

Mam first upon thanks for your all valuable videos. Mam, please upload one video on image classification using YOLOv5 on a custom dataset in Google Colab. I am facing issues please upload a video on this topic.

nakulmali
Автор

Does we have to pad all caption to make the length of all captions same so that LSTM can processes in batches?
Kindly if you can explain

malikfahadsarwar
Автор

Hi Arohi
Could you please share the dataset link. I am confused with the labels with the images. It is not clearly mentioned in the video.

sanathspai
Автор

great! can you make similar to this using in an android app?

pinocchio
Автор

Hello mam, , , Can you please make a video to Install and setup CUDA ....
I have tried 10 But its not working....
Please help me with this mam 😢😢😢😢

vikramvikky