Displaying duplicate records in PySpark | Using GroupBy | Realtime Scenario

preview_player
Показать описание
Hi Friends,

In this video, I have explained the code to display duplicate records from a dataframe in pyspark.

Please subscribe to my channel for more interesting learnings.
Рекомендации по теме
Комментарии
Автор

Hi Sravana, awesome!! Thank you for posting this!!

sudippandit
Автор

Mam your all videos are very informative rather than i will say best ever videos for concept clear.I want to give one suggestion if you are explaining any concept with help of other concept for which you have already made video So please mention that video link in description..
Thank u keep it up...

yogeshkale
Автор

hii sis i have problem i need to replace every 5th occurrence'|' to new

shiyamprasath
Автор

Superb, Can u please make video on how to maintain column value from the previous data frame run by ADF, for the next ADF run Data frame?

tirumalrc
Автор

Is there any way to work on RDDs instead of data frames?

sreerajnr