Act 7.2
Act 7.2
Huelva
BSIT 3-1
Activity #7 PanDaS
Importing panda
import pandas as pd
Series
- similar to the use of np.array() in NumPy
series1 = pd.Series([9,3,1,7,8,5])
series1
0 9
1 3
2 1
3 7
4 8
5 5
dtype: int64
# index object of the Series via its values and index attributes,
respectively.
series1.index
series2.values
series2.index
a 0
b -12
c -3
d -9
e -9
f -9
dtype: int64
a 0
b 16
c 1
d 9
e 9
f 9
dtype: int64
-3
a 0
d -3
f -3
dtype: int64
a 0
b -4
c -1
d -3
e -3
f -3
dtype: int64
Splicing
# assign value 0 to index 'a'
series2["c":"f"]
c -1
d -3
e -3
f -3
dtype: int64
Series Assignments
# assign value 0 to index 'a'
series2["a"] = 0
series2
a 0
b -4
c -1
d -3
e -3
f -3
dtype: int64
series2["d":"f"] = -3
series2
a 0
b -4
c -1
d -3
e -3
f -3
dtype: int64
a 0
b -4
c -1
d -3
e -3
f -3
dtype: int64
Using dictionaries to create Series
age_dict={"Ellen":27, "Charlie":18, "Ana":20, "Ben":24, "Dina":29}
age = pd.Series(age_dict)
age
Ellen 27
Charlie 18
Ana 20
Ben 24
Dina 29
dtype: int64
27
Charlie 18
Ana 20
Ben 24
dtype: int64
Charlie 18
Ana 20
Ben 24
dtype: int64
First Names
Ellen 27
Charlie 18
Ana 20
Ben 24
Dina 29
Name: Age, dtype: int64
DataFrame
some_dict = {'a':[0,1,2], 'b':[3,4,5]}
series3 = pd.DataFrame(some_dict)
series3
a b
0 0 3
1 1 4
2 2 5
First Names
Ellen 27
Charlie 18
Ana 20
Ben 24
Dina 29
Name: Age, dtype: int64
Ellen Tarlac
Charlie Cebu
Ana Pampanga
Ben Davao
Dina Cebu
dtype: object
age province
Ellen 27 Tarlac
Charlie 18 Cebu
Ana 20 Pampanga
Ben 24 Davao
Dina 29 Cebu
Ellen 27
Charlie 18
Ana 20
Ben 24
Dina 29
Name: age, dtype: int64
people.province
Ellen Tarlac
Charlie Cebu
Ana Pampanga
Ben Davao
Dina Cebu
Name: province, dtype: object
age 27
province Tarlac
Name: Ellen, dtype: object
Array of values
people['is_married'] = [True, False, False, True, True]
people
Deleting a column/row
people.drop(columns="debt")
people.drop(index=["Regine","Ellen"], inplace=True)
people
Sorting in a DataFrame
people.sort_values(by='province')
people.sort_index(axis=1, ascending=False)
age debt
count 4.000000 4.0
mean 22.750000 0.0
std 4.856267 0.0
min 18.000000 0.0
25% 19.500000 0.0
50% 22.000000 0.0
75% 25.250000 0.0
max 29.000000 0.0