Pandas Tutorial
Creating Objects
Viewing Data
Selection
Manipulating Data
Grouping Data
Merging, Joining and Concatenating
Working with Date and Time
Working With Text Data
Working with CSV and Excel files
Operations
Visualization
Applications and Projects
Let's go through a tutorial on the pandas
Series.
The pandas Series is a one-dimensional labeled array. It can hold data of any type (integer, string, float, python objects, etc.).
Ensure you have pandas installed:
pip install pandas
import pandas as pd
From a list:
s = pd.Series([1, 2, 3, 4, 5]) print(s)
From a Dictionary:
dict_data = {'a': 1, 'b': 2, 'c': 3} s = pd.Series(dict_data) print(s)
With an index defined:
s = pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd']) print(s)
Using the Index:
print(s['a'])
Using integer location:
print(s[0])
Addition:
s2 = pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd']) result = s + s2 print(result)
Boolean Selection:
print(s[s > 2])
Checking for null values:
print(s.isnull())
Getting the unique values:
s3 = pd.Series([1, 2, 2, 3, 3, 4]) print(s3.unique())
Value counts:
print(s3.value_counts())
Setting a name for the index:
s.index.name = 'letters' print(s)
Changing index values:
s.index = ['x', 'y', 'z', 'w'] print(s)
The pandas Series provides a versatile and efficient way to work with one-dimensional data structures in Python. It integrates well with many of the other pandas structures and offers a wide range of methods for data manipulation, making it an essential tool for data analysis in Python.
Creating a Pandas Series in Python:
import pandas as pd data = [1, 2, 3, 4, 5] series = pd.Series(data)
Indexing and selecting data in Pandas Series:
# Using index label value = series['label'] # Using positional index value = series.iloc[2]
Operations on Pandas Series:
result = series1 + series2
Working with missing data in Pandas Series:
dropna()
or fillna()
.series = series.dropna()
Sorting a Pandas Series:
sorted_series = series.sort_values(ascending=False)
Filtering and subsetting in Pandas Series:
subset = series[series > 3]
Converting a dictionary to Pandas Series:
data_dict = {'a': 1, 'b': 2, 'c': 3} series = pd.Series(data_dict)
Mathematical operations on Pandas Series:
result = series * 2
Visualization with Pandas Series:
series.plot(kind='bar')