Introduction to Deep Learning (I2DL 2023) - 5. Scaling Optimization

preview_player
Показать описание

Introduction to Deep Learning (I2DL) - Lecture 5
TUM Summer Semester 2023
Prof. Niessner
Рекомендации по теме
Комментарии
Автор

It was noted that all samples are included in a distinct batch without replacement. Is it safe to assume that the final batch will just be a residual batch of final datapoints and have cardinality less than all the other batches?

As unless your total number of samples is divisible by a power of 2 (and a power of 2 appropriate for your GPU), it wouldn't be possible to have batches of equal cardinality?

daniellevy