# Description

Pandas is a package for data manipulation and analysis in Python. The name Pandas is derived from the econometrics term Panel Data. Pandas incorporates two additional data structures into Python, namely Pandas Series and Pandas DataFrame

# Series

A Pandas series is a one-dimensional array-like object that can hold many data types, such as numbers or strings.

# We import Pandas as pd into Python
import pandas as pd

# We create a Pandas Series that stores a grocery list
groceries = pd.Series(data = [30, 6, 'Yes', 'No'], index = ['eggs', 'apples', 'milk', 'bread'])

# We display the Groceries Pandas Series
groceries

# Series Indexing

input

print('Do we need milk and bread:\n', groceries[['milk', 'bread']]) 
print()

output

Do we need milk and bread:
milk       Yes
bread     No
dtype: object

# Dataframe

Pandas DataFrames are two-dimensional data structures with labeled rows and columns, that can hold many data types.

# Loading

# We import Pandas as pd into Python
import pandas as pd

# We create a dictionary of Pandas Series 
items = {'Bob' : pd.Series(data = [245, 25, 55], index = ['bike', 'pants', 'watch']),
         'Alice' : pd.Series(data = [40, 110, 500, 45], index = ['book', 'glasses', 'bike', 'pants'])}

# We print the type of items to see that it is a dictionary
print(type(items))

# We create a Pandas DataFrame by passing it a dictionary of Pandas Series
shopping_carts = pd.DataFrame(items)

# We display the DataFrame
shopping_carts

pd loading

na example

na count

na drop

na fill