Pytorch RNN example (Recurrent Neural Network)

Показать описание

In this video we go through how to code a simple rnn, gru and lstm example. Focus is on the architecture itself rather than the data etc. and we use the simple MNIST dataset for this example.

❤️ Support the channel ❤️

Paid Courses I recommend for learning (affiliate links, no extra cost for you):

✨ Free Resources that are great:

💻 My Deep Learning Setup and Recording Setup:

GitHub Repository:

✅ One-Time Donations:

▶️ You Can Connect with me on:

Рекомендации по теме

Комментарии

I have been struggling for my master degree. You tutorials really help me a lot. What distinguish your tutorials from others is it's very practical and hands-on. I have learned the basic theory of deep learning, but implementing them is the key! Thanks for your hard work. God bless you!

ChizkiyahuOhayon

Really enjoy how you leave the theory for other videos and get right to the hands on, thank you!

vaisuliafu

Never thought of doing image related processing with RNNs xD
Nice tutorial. Thanks. I like this playlist for its clear explanations about the code, and yeah the intro is my favourite <3

arsiveparkour

may I ask how would you define your input size, and sequence length if you would have word embeddings of num_instances by num_features?

bestest

Thanks for the tutorial. What I think regarding the LSTM having better performance when only taking the last layer output, is that the LSTM now has the chance to develop and accumulate a good decision, as the question is a classification problem (i.e. many-to-1). That is because the last output is conditioned to ALL the previous states. In the case of including the intermediate states as an input to the FC layer, the accumulated learning will be somehow "partially" phased out by the immature decisions represented in the hidden states if I may say :)

awadelrahman

Hello Very Nice tutorial, I have a question. I know that Rnn's can take variable length sequences but when it comes to mini batch we should pad them to same length. Why? why can't we have variable length sequences in a mini batches.

nikhilkumar

Thanks, why in implementation you do not need to specify sequence_length in your architecture ? Is there a specific forme to give in the input to the model in order to let it detects the sequence_length alone ?

anas.k

Nice Tutorial! However, I have a question. Do you know some references where they combine all time steps for the classification at the end? I've not seen that before, and I'm wondering what's the point? Shouldn't the last time step output be the best predictor anyway?

patloeber

Does it perform the same if you put a sequence of rows or a sequence of columns as the input ?

zrmsraggot

Thanks for explanation. I have tried BiLSTM on salami dataset for detectiong boundaries but the f1 score is decreasing after 20 epochs, can you please elaborate how may I fix this overfitting issue using same model?

aneekaazmat

Hi Aladdin.. The video is the to the point and awesome till the implementation part. I think you could have added a hacky intro to RNN/GRU/LSTM as well, Otherwise I really liked this one.

ashishjohnsonburself

Why do you take the product of hidden_size and sequence_length as input into nn.Linear() at 6:00?

orjihvy

Thanks Aladdin. Great tutorial.
By the way i was trying to test my model on individual samples. I realized that it does not matter if the shape of my individual image is (1, 28, 28) or (28, 28), my model accepts it and gives me correct results. Why would be that ? Should not the model reject an image with (28, 28) since it expects this shape: (batch, seq_len, features) ?

somyekathait

I really like the paper walk through tutorial. I am your patriot. Expect you do deliver more cool stuff.

donkkey

A slight heads-up for people trying this out themselves -- (for the eagle-eyed observants, never mind).
Using a learning rate of 0.005 for a Vanilla RNN does not have any effect on learning, and you will end up not converging (abysmal accuracy). Use a smaller learning rate for RNN's (0.001) and you can use the default 0.005 for the GRU and LSTM implementations, to replicate the results. Great video nevertheless!

praladprasad

Hi Aladdin,

Thank you so much for your amazing tutorial videos.
I was wondering that about only using the last hidden state in the lstm, should the code be `self.fc = nn.Linear(sequence_length, num_classes)` rather than `self.fc = nn.Linear(hidden_size, num_classes)`?

Best,
Yu

yuqi

Can I know why there is torch zeros in the forward method? (the reason) If you have any resource to share it would be good.

joxa

what i'm always missing are a few inference examples with the final model and the code to do so

holthuizenoemoet

As per the pytorch documentation the shape of the output of the nn.RNN cell is (seqlength, batchsize, hidden_size)so the reshaping operation should be out.reshape(out.shape[1], -1)

tapaskumarroy

Hey I have been following your tutorial series, I have a doubt! Why are we getting such overfitting results for just training for 2 epochs. Even though we're using an RNN, that's not suitable for training Image Data!

harjyotbagga

Pytorch RNN example (Recurrent Neural Network)

Pytorch RNN example (Recurrent Neural Network)

PyTorch RNN Tutorial - Name Classification Using A Recurrent Neural Net

PyTorch Tutorial - RNN & LSTM & GRU - Recurrent Neural Nets

12. RNN. Recurrent Neural Network in PyTorch.

MLfAS - 09 Recurrent Neural Networks (RNNs) - 06 Training the RNN in PyTorch

Recurrent neural network example using surnames dataset | part 1 | Pytorch tutorial

Recurrent Neural Networks with PyTorch

Intro to recurrent neural network RNN | Pytorch tutorial

Build a Simple Recurrent Neural Network Using PyTorch | Depth Analysis | Deep Learning | PyTorch

Pytorch tutorial: Recurrent Neural Networks theory

Machine Learning for Audio Signals in Python - 09 Recurrent Neural Networks (RNN) in PyTorch

Recurrent Neural Network - Dall'architettura all'implementazione (RNN in PyTorch)

Pytorch tutorial: Recurrent Neural Networks practice

Recurrent neural network example using image dataset RNN | Pytorch tutorial

L15.7 An RNN Sentiment Classifier in PyTorch

Recurrent Neural Networks (RNNs), Clearly Explained!!!

Pytorch for Beginners #21 | Recurrent Neural Networks: Understanding and Implementing Vanilla RNN

Deep Learning Projects with PyTorch : Understanding Recurrent Neural Network | packtpub.com

MLfAS - 09 Recurrent Neural Networks (RNNs) - 05 RNN as an IIR Generalization in PyTorch

Hands-On Natural Language Processing with PyTorch:Work with Recurrent Neural Network| packtpub.com

PyTorch Deep Learning in 7 Days: Recurrent Networks, RNN, and LSTM, GRU | packtpub.com

RNN From Scratch In Python

Lecture 19 - RNN Implementation

How to Optimize an RNN in PyTorch (~20% to over 80% accuracy)