A Profit-Maximizing Reinforcement Learning-Based AI System in Python

preview_player
Показать описание
Dr. Soper provides a complete example of a profit-maximizing artificial intelligence system in Python that relies on Thompson Sampling-based reinforcement learning to decide which advertising campaign to show to a company's customers. Each part of the Python source code and the corresponding results are discussed in detail to ensure understanding.

Рекомендации по теме
Комментарии
Автор

Best content ever on reinforcement learning

saeedseyedhossein
Автор

Thank you. Short videos with sufficient explanation.

FatemehBagherpour-vu
Автор

great example and explanation. Thanks!

fabianlopez
Автор

Another super video. I am new to this topic. Don't me asking where is the data related to Advertising Campaigns? Thanks.

EustaquioSantimano
Автор

The choice of the b parameter for the Beta distribution was very insightful. Any additional resource you recommend for that topic?

seydoudia
Автор

In practice, true probabilities of campaign success are not known.what changes to the code should we make in that case for the information the mobile company does not have?

tomjoseph
Автор

I'm no expert at this stuff, but I suspect that a better way of choosing which campaign to try _might_ be to use a Gamma distribution (allows score-like values in [0-infinity) ) rather than a Beta distribution contorted to handle values that aren't counts or probabilities. The Gamma dist is very closely related to the Beta, hence why I suspect it might be the one to try.

robharwood
visit shbcf.ru