PySpark Dataframes Tutorial | Introduction to PySpark Dataframes API | PySpark Training | Edureka

preview_player
Показать описание
This Edureka video will provide you with a comprehensive and detailed knowledge of Dataframes, and how to use Dataframes in PySpark. Below are the topics covered in the video:

1. Need for Dataframes
2. What are Dataframes
3. Dataframes Features
4. Sources of Dataframes
5. Hands On - Pyspark Dataframes

Subscribe to our channel to get video updates. Hit the subscribe button above.

--------------------------------------------

About the Course

Edureka’s PySpark Certification Training is designed to provide you the knowledge and skills that are required to become a successful Spark Developer using Python and prepare you for the Cloudera Hadoop and Spark Developer Certification Exam (CCA175). Throughout the PySpark Training, you will get an in-depth knowledge of Apache Spark and the Spark Ecosystem, which includes Spark RDD, Spark SQL, Spark MLlib and Spark Streaming. You will also get comprehensive knowledge of Python Programming language, HDFS, Sqoop, Flume, Spark GraphX and Messaging System such as Kafka.

----------------------------------------------

Spark Certification Training is designed by industry experts to make you a Certified Spark Developer. The PySpark Course offers:
Overview of Big Data & Hadoop including HDFS (Hadoop Distributed File System), YARN (Yet Another Resource Negotiator)
Comprehensive knowledge of various tools that fall in Spark Ecosystem like Spark SQL, Spark MlLib, Sqoop, Kafka, Flume and Spark Streaming
The capability to ingest data in HDFS using Sqoop & Flume, and analyze those large datasets stored in the HDFS
The power of handling real-time data feeds through a publish-subscribe messaging system like Kafka
The exposure to many real-life industry-based projects which will be executed using Edureka’s CloudLab
Projects which are diverse in nature covering banking, telecommunication, social media, and government domains
Rigorous involvement of an SME throughout the Spark Training to learn industry standards and best practices
---------------------------------------------------

Who should go for this course?

The market for Big Data Analytics is growing tremendously across the world and such strong growth pattern followed by market demand is a great opportunity for all IT Professionals. Here are a few Professional IT groups, who are continuously enjoying the benefits and perks of moving into Big Data domain.

Developers and Architects
BI /ETL/DW Professionals
Senior IT Professionals
Mainframe Professionals
Freshers
Big Data Architects, Engineers and Developers
Data Scientists and Analytics Professionals

-------------------------------------------------------

There are no such prerequisites for Edureka’s PySpark Training Course. However, prior knowledge of Python Programming and SQL will be helpful but is not at all mandatory.

--------------------------------------------------------

Рекомендации по теме
Комментарии
Автор

Thanks you very much... excellent video and highly recommended for beginner ..

sweetjam
Автор

Super... I am very thankful for this video

shivanidubey
Автор

Superb video thankyou, as a beginner this is perfect.

matthewfeeley
Автор

Hello @edureka, it is a great playlist. Can you please share the codes in all videos of the playlist?

yogeshtekwani
Автор

hi..nice and informative video..please let me know from where i can get the datasets for both use case. Thanks

rishianand
Автор

hey in filter function which you have applied in (11:37- time in vedio) column right, my column name has space in between, so when I am writing filter function it is showing incorrect syntax, whereas those columns with no spaces are just responding fine. Can you tell me what to do in this case.

ankushmishra