filmov
tv
Python Pandas #1; Introduction to Pandas

Показать описание
Hello Everyone, and welcome to this tutorial series on Pandas.
This video is going to be a short introduction to Pandas.
Pandas is module or package which is built for Python.
It is One of the most powerful and flexible open source
data analysis and manipulation tool
It is mostly used in working with relational or labeled data.
Relational data is basically data in tabular format which has rows and columns.
Most common example is the data that can be stored in Excel.
Pandas has two types of Data Structures.
One is the Series and the other one in DataFrames.
Series are nothing but one dimensional array with custom indexes.
We will try to understand what custom indexes are may be in the upcoming videos. But for now,
Series are data structures which can accomodate single dimensional homogeneous arrays. Homogeneous means all the data inside a series
should be of same type.
DataFrame is the most popular data strucutre of pandas.
DataFrames can accomodate the relational or tabular type data.
For example, the data inside of an excel can be stored in the DataFrames.
So lets look at some of the features of Pandas.
-- Easy handling of missing data
There are built in functions available in pandas which makes it easy to work with data which has missing values.
You can add or delete any column of a dataframe easily with the built in functions.
-- Intelligent label-based slicing,
fancy indexing, and subsetting
of large data sets
-- Functions which help with merging and joining of different data sets.
-- Robust IO tools for loading data
from flat files (CSV and delimited),
Excel files, databases,
and saving or loading data
from the ultrafast HDF5 format
-- Time series-specific functionalities such as
date range generation and frequency
conversion, moving window statistics,
moving window linear regressions,
date shifting and lagging.
Комментарии