Spark Interview Question | Scenario Based |DataFrameReader - Handle Corrupt Record | LearntoSpark

preview_player
Показать описание
In this video, we will learn how to handle the corrupted records in our dataset using Apache Spark. We will also learn about the mode available in option while reading a file as dataframe using Spark DataFrameReader API. Hope it will be useful for your interview preparation.

Blog link to learn more on Spark:

Linkedin profile:

FB page:

Github:
Рекомендации по теме
Комментарии
Автор

Explanation in Simple words.. Really good to follow.

madhubalagnanamoorthy
Автор

Thanks for sharing your knowledge. May you be blessed with more of it.

faizalimuhammadzoda
Автор

underrated content! Thanks for providing such info!

billcates
Автор

thanks Azar ...it is help ing for invterwive prospect

moulalichebolu
Автор

Hi videos are very helpful and it would be great if you can share the sample files also for practice

areeshkumar
Автор

Hi, what if i want the corrupted data for checking which records are going for reject…
how to get the corrupted records into another dataframe ?

nareshgiduthuri
Автор

But In real time it we need to capture or save malformed records for analysis. So wrote those malformed ones in textfile. Use either option and capture the malformed ones.

umamahesh
Автор

Really useful.Can you please make complete playlist for pyspark tutorial, covering all topics.

varungupta
Автор

How it identifies those records are invalid ? Because we gave schema for all 3 columns with StringType, also we setted nullable = True, in that case those records are valid only right ? Kindly help me to understand this...

prabuchandrakumar
Автор

Is there a way to store the malformed rows in a separate file. Or count the number of records dropped in the dropmalformed mode?

ASHISH
Автор

Could you please explain why u renamed the column to c1, c2 And why it is not advisable for header true

amitpadhi
Автор

How will one fine all available options for the given api method/option of spark ??

SpiritOfIndiaaa
Автор

Do you have Facebook page as well?
Please provide the link

ManishKumar-sreu
Автор

How to capture bad records in separate file for analysis

MultiSuresh
Автор

How to avoid bad email format in dataframe, i want only email correct format data..?

kollurusrinivasarao