Apache Spark / PySpark Tutorial: Basics In 15 Mins

preview_player
Показать описание


Subscribe if you enjoyed the video!

Best Courses for Analytics:
---------------------------------------------------------------------------------------------------------

Best Courses for Programming:
---------------------------------------------------------------------------------------------------------

Best Courses for Machine Learning:
---------------------------------------------------------------------------------------------------------

Best Courses for Statistics:
---------------------------------------------------------------------------------------------------------

Best Courses for Big Data:
---------------------------------------------------------------------------------------------------------

More Courses:
---------------------------------------------------------------------------------------------------------

Рекомендации по теме
Комментарии
Автор

We use spark for our data pipeline at work -- we have tables with 10+ billion records, and our applications end up moving trillions upon trillions of records of data per month. Unfathomable numbers that spark is capable of. Great video!

mongooon
Автор

I'm a freelance data scientist and I really thankful to find this video, Gregg. Can't expect more! Thank you so much. Good luck with everything. 🙏

Nedwin
Автор

Your explanation is clear and the examples are practical and useful for beginners. Thanks a lot and keep it up!

AnVinhNguyen
Автор

Thank you for sharing to the world. I'm currently a supply chain analyst and aspiring supply chain data scientist 🙏

joshuabradshaw
Автор

I'm just getting into DataBricks and PySpark and this introductory tutorial was a great starter.

andersborum
Автор

Awesome video. I love using spark at work

ashleyb
Автор

You are awesome. Just delivering the right videos. Subscribed a few days back already but hit notifications on for you rn. Cause I wanna watch all your videos

mtamjidhossain
Автор

Greg, thank you so much. I am new to PySpark, and your video is very good in explanation and you did those simple example and I am able to follow you and write in my own Python Notebook to try it out. Will watch your DataFrame basics video next.

dominicaleung
Автор

Thanks for sharing, appreciate the quick run down on this stuff

ericcarmichael
Автор

No words man! Simply loved it. Appreciate your efforts.

parvathirajan.n
Автор

Just the type of samples we need to begin with. Meaningful content. thnx.

victorroy
Автор

Hi Greg! Great video, do you have one that explains how you convert spark to dfs and vice versa? We pull millions of rows from csvs and looking to do transformations before dropping into a db.

Also, how does the distributed computing work on a singular computer? Just distributes it across the cpu cores?

SpecialGreg
Автор

you are a great teacher... keep doing what you do my man

EclipsyChannel
Автор

Thanks Greg for the wonderful explanation !!

BHANUCHAUDHARY-ebul
Автор

This is an awesome video. I wonder, however, whether you could explain why the end results shows numbers with 12 characters. Didn't you set of numbers only go up to a million, which has 6 digits?

You also referred to your hour-long PySpark course. Would you be able to link to it in the show notes, please? Thanks!

andrewhancock
Автор

sc command is not working on my Colab as it's working on this vide... can anyone help?

Vlapstone
Автор

You are awesome, thanks for sharing your knowledge with the world

hsoley
Автор

@greg, plz share the link of 1 hr video.. I am unable to find it

Автор

Which big data tools one must learn for beginners and from where to learn( please provide some resources)

yashmodi
Автор

It look like using numpy, pandas what is the difference between this and pyspark.

keerthanamurugesan-xemr