Why do we use 'e' in the Sigmoid?

preview_player
Показать описание
Why do we use the mathematical constant "e" in the sigmoid function?

Рекомендации по теме
Комментарии
Автор

Thank you for making us love math even more.

halibrahim
Автор

thankyou for bringing this intuitive video, I just had this thought yesterday.
Please keep uploading videos like this it makes my intuitive more strong and closer to statistics.

pawanbhatt
Автор

Holy shit! I wish I could watch this video 6 years ago when I just got into machine learning. You did a great job! Thank you so much!

haojiang
Автор

Thanks! I really appreciate this bits of useful, subtle and insightful ideas about common objects in data science

ramirolopezvazquez
Автор

Makes sense. dy/dx = y (1- y ) if k=e. Great video!

SuperMtheory
Автор

are operations with 'e' are more expensive then with 2 or 3?

uusserrrreesssuuu
Автор

Nice explanation. Clarifies everything

apoorvatiwari
Автор

Good one !
Video edit a picture in picture effect of showing this in desmos. That would help as an interactive visual aide.

behrampatel
Автор

The best in the game for this kind of conteht

ChocolateMilkCultLeader
Автор

Actually there is a better reasoning but I am still not sure about it... Sigmoid is derived through the linear regression on log odds of the two classes... So mx+c = ln(p/(1-p)) which gives p = 1/(1+e^-(mx+c))

jasdeepsinghgrover
Автор

Huh, so I guess this is like a tradeoff of annoyances where using e upfront is just less annoying than discovering ln(k) much later.

JeremiahLam-sd
Автор

Really good explanation. Keep it up :)

giorda
Автор

This is a nice explanation, however one question is left open for me: We interpret the result of the sigmoid as probability. So sigmoid(x) results in some probability of something to be classified as some category. Let's assume the standard sigmoid(x) results in a value of 0.7. When I change sigmoid to use some other number k instead of e, this probability would change. Let's say it would now be 0.9 instead of 0.7. This appears to me as semantically completely different from 0.7. So I would conclude that with respect to the interpretation as probability, it is not arbitrary to choose e oder some other number k.

masster_yoda
Автор

sorry, you didn't explain anything.

nononnomonohjghdgdshrsrhsjgd