Compare Two excel or csv files using python Pandas library.

preview_player
Показать описание
Hello everyone, this is my first video on YouTube😄. This video demonstrates how to compare two excel files using python pandas library. Please let me know in the comments section if I can do this in a better way.
Also I am using Apple new MacBook Air with Apple own M1 silicon😁. If you have any question regarding it, let me know in the comment section.

Github link for code-
Рекомендации по теме
Комментарии
Автор

Hi Gulshan, the content was really good, keep up doing the good work, just if u can explain each line of code with all the operators functionalities, then it would better for newbies like me . Thank you for your efforts 👍😌

vijyalaxmi
Автор

this was a great content man, you should continue to do so. Thank you

Sece
Автор

Very well explained, and not too long. Thank you!

MelvinBagby
Автор

Awesome content and helpful too😁👍. Please keeping posting such good content. Waiting for your posts.😁👍

khushitayal
Автор

Hi gulshan this content is very much helpful

srikishoreka
Автор

thank you! finally what I'm looking for :):)

joxxen
Автор

Great tutorial.. thank you so much brother

ajaynt
Автор

Hello Gulshan just i have to check if two excel sheets are matching or not.can you help me out

Gamer-txgy
Автор

I'm trying the same code but for some reason when I execute the first "df_join" it is not creating a new .xlsx file with both files joined. Any guess??

Mathias-cqzo
Автор

And i have a doubt if a sheet contain a file path with name and sheet contain only name then how we should remove that file path and check both the sheet ?

srikishoreka
Автор

The columns are in not same the order, it will works ?

boxersony
Автор

Hi, in spark scala, I want to compare CSV file using test cases, but I don't know how to compare those, is it based on number of columns

uppalaadarsh
Автор

Hello, this is awesome. Works like a charm 😮 Is there any way so we can get the same elements from two files? Not the difference but elements/values that are the same. I tried modifying on my own but I can’t make it 😅

Again, great video!

piotrjanuszek
Автор

Hi, im getting AttributeError: 'numpy.ndarray' object has no attribute 'merge' when i run the code you have shared. Im using python 3.10. Please help how to run the same in python 3.10

aditishekhar
Автор

I need to simply join the two extracted DB values and get the result in a report with all the values and the mistmatched values should be highlighted. Can u make a video for it

sampathkumar
Автор

How can find the matched records from both files, with specified multiple columns

shrinivasulunandyala
Автор

I want to append data from different CSV(total 12 ) files into one file .. the coloums are same in all the files .. But the data is different .. how do I do that?
It's just a 1 year data and I want to add in the same sheet

ujjwalwadera
Автор

Hi thanks for this video, i have one scenario like two different XL files and having different values and some are same so i want both compared result in one new result file and both files having different rows and columns can you please make a video

manuprakashmp
Автор

You are trying to merge on object and datetime64[ns] columns. If you wish to proceed you should use pd.concat . getting this error.
Since I don't know which column will have what kind of datatype in both the sheets how can I change datatype for all and perform merge operation?

sagarkadam
Автор

Jeez I cannot thank you enough. I am going to see if I can figure out my problem on my own, but if you can assist I would be so grateful!
I have this as the dataset for two different CSV when I do your DF_Join command. Thank you so much - does exactly as intended and matches data from one day on same line as data from next day. any indescrepencies show that data did not appear in one of two days.

Symbol, Type, Strike Price, Exp Date /// Symbol, Type, Strike Price, Exp Date
MS, Call, 85, 4/21/2023 /// MS, Call, 85, 4/21/2023

HOWEVER, Mr. Kumar, I had to edit both CSV to remove a certain column. This column is Volume. How can I make it so itll show the volume but not try and match that specific colum? If I include volume (because this is the only column that always differs no matter the row (1, 2, 3, 4) or column (A, B, C, D)), then it just separates everything and is not helpful. Again, I am forever grateful for this video and I will try to figure it out on my own. I believe i have to adjust this lambda x:x but i am not sure.

zackszigeti
join shbcf.ru