Tiny Tutorial 2: Loading and Plotting Data in Python with Matplotlib & Pandas

preview_player
Показать описание


So why use Python instead of Excel? Well, Excel may be fine IF you don’t need to do anything to the data in the spreadsheet and IF the canned plots and charts that are available meet your needs...

BUT in the real world, more often than not, you need to clean up your data or do some analysis before plotting it, or you may want to customize your plot in ways Excel doesn't support. Although it may seem easy, doing ad hoc analysis in Excel is often a manual, awkward, and error prone process. In addition, it forces you to create new versions of your edited data file and enters you into the painful world of version control, documenting what changes you made to the data, and so on. You may have experienced this before -- what started out as a quick and painless plotting process can morph into a long and painful ordeal.

What’s nice about using Python is is that you can do all of the work right in one place - in addition to plotting, you can load, clean, wrangle, analyze, and plot your data seamlessly using powerful tools like the Pandas package that were designed for that purpose instead of trying to create dozens of worksheets to do the analysis by hand. What's more, when you do the work in Python or Pandas, you don't have to touch the original data and so you don't have to create a bunch of slightly different versions of it along the way or record notes on the process you followed. Pandas lets you work with the raw data, overlaying the transformations and operations on top of it and creating a complete record of the process you went through to get to your final result. It supports repeatable analysis, with no unwanted, undocumented changes to your data -- and better yet, no copies of the data file to deal with.

**Learn more about Enthought's Python course offerings here:
Рекомендации по теме
Комментарии
Автор

Great animation. Pithy presentation. Impressive. Thumbs up.

mdesm