Distributed Machine Learning with Apache Spark / PySpark MLlib

preview_player
Показать описание


Don't forget to subscribe if you enjoyed the video :D
Рекомендации по теме
Комментарии
Автор

Thank you for the video on mllib, I haven't watch much yet, but looks promising.
The machine learning stuff starts about 12:00 - the beginning is a warm up to PySpark. Chapters/timestamps would have been helpful for a 40 min video (with each chapter being a different stage in the process or function).

nickie
Автор

Great tutorial, Greg - really appreciate how you distilled such a comprehensive overview into a single video. Would you consider doing a video showing how to create a complete ML pipeline -- i.e., using output from Imputer(), StringIndexer(), OneHotEncoderEstimator(), VectorAssembler(), and VectorIndexer() -- for a dataset with multiple categorical and numerical features?

erint.
Автор

Thx Greg ! It's a very good tutorial from pyspark ! comprehensive with a lot of examples

davtg
Автор

Thank you for this tutorial on PySpark !

maximinmaster
Автор

Good information Greg! Thanks for sharing.

drjabirrahman
Автор

Thank you
I just found one thing is confusing
which is that you did the standard scaling AFTER the merging into one column
shouldn't have you done it for each column before the merging?

ammaralhawashem
Автор

Thanks. That was pretty comprehensive.

Value_Pilgrim
Автор

I wish I had seen this when I took Econ 424(ml) at uw😂

shiminglu
Автор

White Christopher Hernandez Kenneth Martinez Thomas

LiamAlixsons-ob