L19/5 Truncated Backprop through Time

preview_player
Показать описание
Dive into Deep Learning
UC Berkeley, STAT 157

Slides are at
The book is at

Truncated Backprop
Рекомендации по теме
Комментарии
Автор

In slide for 'Latent state gradient'(video time 10:44), why does he use different index for `x` and `h` in function `f` which is used for hidden state ?(j for x, j-1 for h) In previous slide, he used same index(t-1), h_t = f(h_t-1, x_t-1, w). What am I missing here?

박지호-zk