Pyspark Vs Pandas - Benchmark Testing in Python - Memory ran out!!!!!

preview_player
Показать описание
Which library is better to handle #bigdata ? #python #pyspark or #pandas ?
For #datascience ,#dataanalytics and #machinelearning handling huge datasets is day to day job of every engineer.
This video shows a comparative study between Pandas and Pyspark.

Here is the link of the benchmark testing script:
Рекомендации по теме