How to Remove Duplicate Rows with Power Query in Excel

preview_player
Показать описание
I demonstrate how to efficiently remove duplicate data in Excel using Power Query. I start the tutorial by explaining why traditional methods, like conditional formatting, might not always be effective for finding duplicates, specifically when there is no unique primary key.

I then discuss how to use the 'Remove Duplicates' feature under the Data tab, highlighting its benefits and limitations. For a more dynamic solution, I guide you through the application of Power Query to delete duplicate records.

This approach allows the file to continuously update itself even when new data is introduced, thus automatically maintaining the removal of duplicates. This video is for both beginners looking for a basic guide to check for duplicates in Excel and experienced users seeking an advanced and automated way to maintain data integrity.

Chapters:
00:00 Introduction
00:32 Review of Removing Duplicates
01:15 Using Conditional Formatting to Find Duplicates
01:58 Removing Duplicates Using Data Tab
02:51 Introduction to Power Query
03:15 Removing Duplicates Using Power Query
04:10 Demonstration of Power Query Efficiency
05:07 Conclusion

Check out my Excel and Teams courses:

And make sure you subscribe to my channel!

-- EQUIPMENT USED ---------------------------------

-- SOFTWARE USED ---------------------------------

DISCLAIMER: Links included in this description might be affiliate links. If you purchase a product or service with the links I provide, I may receive a small commission. There is no additional charge to you! Thank you for supporting my channel, so I can continue to provide you with free content each week!

#excel #powerquery #chrismenard
Рекомендации по теме
Комментарии
Автор

Liked and subscribed! A bit off topic here, but I saw you did some videos using the MAX function and was wondering if it was possible to tell Excel to find the max or latest date in a table and return a corresponding cell value that goes with that date in that is located in a column to the right of it. I feel like it should be possible but I haven’t come across any videos on it this far.

ngee
Автор

Hi chris
What is short cut key to select all columns if we want to check based on all columns?

gandhisunil
Автор

Thanks for the video, I would have avoided removing duplicates on customer number only thoughl; that is fine if a customer has only ever made one purchase, but what if the customer has made multiple purchases over several months? So easy to produce reports with incorrect data. Far better to use a transaction number grouped with all rows selected, produce a table per transaction number remove duplicates from the nested tables relying on more than just the one column.

roywilson