0% found this document useful (0 votes)

13 views4 pages

Lab Session 07: Perform Following Operations Using Pandas

The document outlines a laboratory session for a Data Science course focusing on operations using Pandas, including handling NaN values, sorting, and grouping data. It includes pre-lab and post-lab tasks with example code snippets demonstrating how to fill NaN values, sort DataFrames, and group data for analysis. The lab aims to provide practical experience in data manipulation using Python and Pandas.

Uploaded by

nagachaitanyaprathipati97

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views4 pages

Lab Session 07: Perform Following Operations Using Pandas

Uploaded by

nagachaitanyaprathipati97

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

REGD. NO.

238W1A5464 DATA SCIENCE USING PYTHON LABORATORY-23AI&DS4354 ACADEMIC YEAR: 2024-2025

Lab Session 07: Perform following operations using pandas

Date of the Session: 17/02/2025 Time of the Session:12:30AM to 1:00PM

Pre-Lab Task: Write answers before entering into lab.

1. What does NaN stand for in Pandas, and why do missing values occur in a dataset?
A. NaN stands for Not a Number in Pandas. It is used to represent missing or undefined values in a dataset.
Missing values can occur due to various reasons:
a.Data collection errors (e.g., missing fields in a survey)
b.Data entry errors (e.g., missing values in a database)
c.Absence of data (e.g., a product or customer may not have a value for a certain attribute)
d.Merging datasets where some values do not match.

2. How can we fill NaN values in a Pandas DataFrame with a specific string?
A. youcan use the fillna() function to replace NaN values with a specific string:
import pandas as pd
data = {'Name': ['Alice', 'Bob', 'Charlie', 'David'],
'City': ['New York', None, 'Chicago', None]}
df = pd.DataFrame(data)
df_filled = df.fillna("Unknown")
print (df_filled)

3. What is the purpose of the sort_values() function in Pandas?

A. The sort_values() function is used to sort a DataFrame by one or more columns in either ascending or
descending order. It helps in organizing the data to make it easier to analyze
df = pd.DataFrame({'Name': ['Alice', 'Bob', 'Charlie', 'David'],
'Age': [24, 30, 35, 40]})
df_sorted = df.sort_values(by='Age')
print(df_sorted)

4. How does the groupby() function work, and when should it be used?
A. The groupby() function in Pandas is used to group data based on one or more columns and then apply an
aggregate function (like sum, mean, count, etc.) on each group.
 Usage: It is used when you want to analyze subsets of data and perform aggregate calculations on
these subsets.

5. Can you explain a real-world scenario where sorting and grouping data is essential?
A. Scenario: In a sales report analysis, sorting and grouping data is essential for understanding performance
across different product categories or regions.
 Sorting: To find the top-selling products or regions, you can sort the sales data by the total revenue
in descending order. This helps identify high-performers at a glance.
 Grouping: To calculate total revenue for each region or category, you can group the sales data by
region or product category and then calculate the sum of sales. This helps compare the performance
of different regions or categories.

LAB No. 07 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

REGD. NO.238W1A5464 DATA SCIENCE USING PYTHON LABORATORY-23AI&DS4354 ACADEMIC YEAR: 2024-2025

In Lab Task:

a. Filling NaN with string

Code:
import pandas as pd
data = {'Name': ['Alice', 'Bob', 'Charlie', 'David'],
'City': ['New York', None, 'Chicago', None]}
df = pd.DataFrame(data)
df_filled = df.fillna("Unknown")
print(df_filled)

Ourtput:
Name City
0 Alice New York
1 Bob Unknown
2 Charlie Chicago
3 David Unknown

b. Sorting based on column values

Code:
data = {'Name': ['Alice', 'Bob', 'Charlie', 'David'],
'Age': [24, 30, 35, 40]}
df = pd.DataFrame(data)
df_sorted = df.sort_values(by='Age', ascending=True)
print(df_sorted)

Output:
Name Age
0 Alice 24
1 Bob 30
2 Charlie 35
3 David 40

c. groupby()

Code:
data = {'Category': ['A', 'B', 'A', 'B'],
'Value': [10, 20, 30, 40]}
df = pd.DataFrame(data)
grouped_df = df.groupby('Category')['Value'].sum()
print(grouped_df)

Output:
Category
A 40
B 60
Name: Value, dtype: int64

LAB No. 07 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

REGD. NO.238W1A5464 DATA SCIENCE USING PYTHON LABORATORY-23AI&DS4354 ACADEMIC YEAR: 2024-2025

Post Lab Task:

a. Write a Python code snippet to fill all NaN values in a DataFrame with the string "Missing".
A. import pandas as pd
data = {'Name': ['Alice', 'Bob', 'Charlie', 'David'],
'City': ['New York', None, 'Chicago', None],
'Age': [24, None, 35, None]}
df = pd.DataFrame(data)
df_filled = df.fillna("Missing")
print(df_filled)
Output:
Name City Age
0 Alice New York 24
1 Bob Missing Missing
2 Charlie Chicago 35
3 David Missing Missing

b. Given a DataFrame with a "Salary" column, how would you sort it in descending order?
A. Code:
data = {'Name': ['Alice', 'Bob', 'Charlie', 'David'],
'Salary': [50000, 60000, 70000, 55000]}
df = pd.DataFrame(data)
df_sorted = df.sort_values(by='Salary', ascending=False)
print(df_sorted)
Output:
Name Salary
2 Charlie 70000
1 Bob 60000
3 David 55000
0 Alice 50000

c. How can you group a DataFrame by a "Department" column and calculate the average salary for each
department?
A. Code:
data = {'Department': ['HR', 'IT', 'HR', 'IT'],
'Salary': [50000, 60000, 55000, 70000]}
df = pd.DataFrame(data)
grouped_df = df.groupby('Department')['Salary'].mean()
print(grouped_df)
Output:
Department
HR 52500.0
IT 65000.0
Name: Salary, dtype: float64

d. What happens when you use multiple columns in groupby()? Provide an example scenario.
A. When using multiple columns in groupby(), the DataFrame is grouped by the unique combinations of
values from those columns.
Example scenario: You have a dataset of employees and want to calculate the average salary by both
"Department" and "Gender".

LAB No. 07 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

REGD. NO.238W1A5464 DATA SCIENCE USING PYTHON LABORATORY-23AI&DS4354 ACADEMIC YEAR: 2024-2025

e. How would you handle a dataset where multiple columns contain NaN values and need different
replacement strategies?
A. You can use the fillna() method with a dictionary, where each column has a different strategy for
replacing NaN values
Code:
data = {'Name': ['Alice', 'Bob', 'Charlie', 'David'],
'Age': [24, None, 35, None],
'City': [None, 'Los Angeles', 'Chicago', None]}
df = pd.DataFrame(data)
replacement_values = {'Age': 30, 'City': 'Unknown'}
df_filled = df.fillna(replacement_values)
print(df_filled)
output:
Name Age City
0 Alice 24.0 Unknown
1 Bob 30.0 Los Angeles
2 Charlie 35.0 Chicago
3 David 30.0 Unknown

Students Signature

(For Evaluator’s use only)

Comment of the Evaluator (if Any) Evaluator’s Observation

Marks Secured:_ out of __

Signature of the Evaluator with Date:

LAB No. 07 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

AI Practical 2025
No ratings yet
AI Practical 2025
14 pages
Chapter 2 - Python Pandas II
No ratings yet
Chapter 2 - Python Pandas II
71 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
Informatic Practices HHW
No ratings yet
Informatic Practices HHW
59 pages
CO3 - 1 - Pandas Series and Data Frame
No ratings yet
CO3 - 1 - Pandas Series and Data Frame
37 pages
Python MCQs
No ratings yet
Python MCQs
21 pages
Adobe Scan 11-Jul-2025
No ratings yet
Adobe Scan 11-Jul-2025
8 pages
Informatic Practices HHW
No ratings yet
Informatic Practices HHW
21 pages
Chapter 2 Python Pandas
No ratings yet
Chapter 2 Python Pandas
8 pages
04-Data Manipulation With Pandas
No ratings yet
04-Data Manipulation With Pandas
28 pages
Pandas Introduction: What Is Python Pandas Used For?
No ratings yet
Pandas Introduction: What Is Python Pandas Used For?
28 pages
Unit 1 Python Pandas
No ratings yet
Unit 1 Python Pandas
20 pages
DXE 24gksmknvj
No ratings yet
DXE 24gksmknvj
16 pages
12 Information Practices Text Book Preeti Arora
No ratings yet
12 Information Practices Text Book Preeti Arora
45 pages
Unit 5 Python
No ratings yet
Unit 5 Python
30 pages
12 Pandas
100% (1)
12 Pandas
21 pages
QP Xii Ip Hy 2023-24
No ratings yet
QP Xii Ip Hy 2023-24
9 pages
Pandas & Vis 2
No ratings yet
Pandas & Vis 2
11 pages
Python Pandas - 2 2020-21
No ratings yet
Python Pandas - 2 2020-21
21 pages
DAV Previous Year
No ratings yet
DAV Previous Year
7 pages
Python Notes by Prof T
No ratings yet
Python Notes by Prof T
10 pages
Top Python Questions 1735201448
No ratings yet
Top Python Questions 1735201448
25 pages
More Practice Questions For DataFrame
No ratings yet
More Practice Questions For DataFrame
9 pages
Create A Pandas Series From A Dictionary of Values and An Ndarray
No ratings yet
Create A Pandas Series From A Dictionary of Values and An Ndarray
15 pages
Pandas 2 Complete Notes Class XII
No ratings yet
Pandas 2 Complete Notes Class XII
18 pages
Pandas1 Q&ans
No ratings yet
Pandas1 Q&ans
14 pages
Lab Session 06: Perform Following Operations Using Pandas
No ratings yet
Lab Session 06: Perform Following Operations Using Pandas
5 pages
Murali Internship
No ratings yet
Murali Internship
34 pages
BQ1031 Exercises
No ratings yet
BQ1031 Exercises
90 pages
Chapter-2 Python Pandas
100% (2)
Chapter-2 Python Pandas
33 pages
ML Lab Manual Final
No ratings yet
ML Lab Manual Final
36 pages
Unit 4 Fod
100% (1)
Unit 4 Fod
21 pages
Ip - Capsule
No ratings yet
Ip - Capsule
17 pages
Docmine: Spare Parts Catalog
No ratings yet
Docmine: Spare Parts Catalog
83 pages
28 03 2024 Sample Paper Grade 12 Informatics Practices 2023 24
No ratings yet
28 03 2024 Sample Paper Grade 12 Informatics Practices 2023 24
8 pages
Info Practical
No ratings yet
Info Practical
56 pages
Chai
No ratings yet
Chai
5 pages
QP DAV 3rd Sem Dec 2023
No ratings yet
QP DAV 3rd Sem Dec 2023
12 pages
PYTHON PROGRAMMING: Data Handling
No ratings yet
PYTHON PROGRAMMING: Data Handling
12 pages
12 IP Dataframe and Pyplot Notes
No ratings yet
12 IP Dataframe and Pyplot Notes
14 pages
Exam Lo1 Electrical Circuit Protection
No ratings yet
Exam Lo1 Electrical Circuit Protection
1 page
Python 1st 10
No ratings yet
Python 1st 10
11 pages
Pandas Questions
No ratings yet
Pandas Questions
11 pages
Pyq Solution
No ratings yet
Pyq Solution
12 pages
Data Frame 100 Questions
No ratings yet
Data Frame 100 Questions
16 pages
Python-for-Data-Analysis (Pandas
No ratings yet
Python-for-Data-Analysis (Pandas
31 pages
Questions Practical File
No ratings yet
Questions Practical File
13 pages
Python Data Science 101
100% (1)
Python Data Science 101
41 pages
Code Explanation For Date Types
No ratings yet
Code Explanation For Date Types
8 pages
Python ClassXII AI
No ratings yet
Python ClassXII AI
4 pages
101 Onwards On Python Pandas and Pyplot
No ratings yet
101 Onwards On Python Pandas and Pyplot
33 pages
Minimum Level Pandas Skill Based Questions
No ratings yet
Minimum Level Pandas Skill Based Questions
8 pages
IP - Pandas 1 & 2 (Worksheet) Class 12
No ratings yet
IP - Pandas 1 & 2 (Worksheet) Class 12
16 pages
IP Imp Notes
No ratings yet
IP Imp Notes
5 pages
Worksheet - Pandas
100% (1)
Worksheet - Pandas
16 pages
Introduction-to-TikTok-Shop-Affiliate-Program 2
No ratings yet
Introduction-to-TikTok-Shop-Affiliate-Program 2
10 pages
CCM 303 Topic 8 PPT Gender and Communication in The Media PDF
No ratings yet
CCM 303 Topic 8 PPT Gender and Communication in The Media PDF
23 pages
7 Days Analytics Course 3feiz7 4
No ratings yet
7 Days Analytics Course 3feiz7 4
8 pages
Lesson 3: Surface Creation
No ratings yet
Lesson 3: Surface Creation
86 pages
DATAFRAME
No ratings yet
DATAFRAME
4 pages
Man Cruise
No ratings yet
Man Cruise
73 pages
LL
No ratings yet
LL
5 pages
21-Economics-2017 (Tamil) - Final - 1693223768823
No ratings yet
21-Economics-2017 (Tamil) - Final - 1693223768823
74 pages
Cost & Management Accounting
No ratings yet
Cost & Management Accounting
3 pages
Aug 1-27 Final
No ratings yet
Aug 1-27 Final
90 pages
Pandas
No ratings yet
Pandas
5 pages
MCQ On Dataframe
No ratings yet
MCQ On Dataframe
11 pages
Science: Junior Cycle Final Examination Sample Paper A Solutions
No ratings yet
Science: Junior Cycle Final Examination Sample Paper A Solutions
10 pages
DF Ques1
No ratings yet
DF Ques1
2 pages
Robotic Gripper Using Four Bar Mechanism
No ratings yet
Robotic Gripper Using Four Bar Mechanism
54 pages
Distribution and Habitat Association of Somali Ostrich in Samburu, Kenya
No ratings yet
Distribution and Habitat Association of Somali Ostrich in Samburu, Kenya
9 pages
Fuzzy Logic To Controlled Signal System
No ratings yet
Fuzzy Logic To Controlled Signal System
10 pages
Letter of Invitation SGC
No ratings yet
Letter of Invitation SGC
7 pages
Guidance Mandatory Competence Attainment Report (v7) Final 04072012
No ratings yet
Guidance Mandatory Competence Attainment Report (v7) Final 04072012
8 pages
Guiding Principle:: Title: Training Guide For Dcws On Self Help Assessment
No ratings yet
Guiding Principle:: Title: Training Guide For Dcws On Self Help Assessment
33 pages
Shahzad 2014
No ratings yet
Shahzad 2014
21 pages
Expt. No. 2 - Basic Operational Amplifier Circuit PDF
No ratings yet
Expt. No. 2 - Basic Operational Amplifier Circuit PDF
2 pages
Displacement and Acceleration C Programming
No ratings yet
Displacement and Acceleration C Programming
11 pages
Inner Ring
No ratings yet
Inner Ring
16 pages
DxDiag Requisitos
No ratings yet
DxDiag Requisitos
30 pages
Econometrics Problem Set
No ratings yet
Econometrics Problem Set
5 pages
Experiment-2 RLC Circuit
No ratings yet
Experiment-2 RLC Circuit
6 pages
Data Class Nist SP 1800 39a Preliminary Draft
No ratings yet
Data Class Nist SP 1800 39a Preliminary Draft
4 pages
Vertic
No ratings yet
Vertic
4 pages
DCP Exam Datesheet
No ratings yet
DCP Exam Datesheet
15 pages
List of Government Colleges Affiliated To The University of Jammu (ACADEMIC SESSION 2020-21)
No ratings yet
List of Government Colleges Affiliated To The University of Jammu (ACADEMIC SESSION 2020-21)
9 pages
Project 619839 EPP 1 2020 1 FI EPPKA1 JMD MOB
No ratings yet
Project 619839 EPP 1 2020 1 FI EPPKA1 JMD MOB
2 pages
Sheets, Preset Names and Formatting Descriptions For Different Elements (Such As "Chapter Title" or
No ratings yet
Sheets, Preset Names and Formatting Descriptions For Different Elements (Such As "Chapter Title" or
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lab Session 07: Perform Following Operations Using Pandas

Uploaded by

Lab Session 07: Perform Following Operations Using Pandas

Uploaded by

REGD. NO.

238W1A5464 DATA SCIENCE USING PYTHON LABORATORY-23AI&DS4354 ACADEMIC YEAR: 2024-2025

Lab Session 07: Perform following operations using pandas

Date of the Session: 17/02/2025 Time of the Session:12:30AM to 1:00PM

Pre-Lab Task: Write answers before entering into lab.

3. What is the purpose of the sort_values() function in Pandas?

LAB No. 07 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

a. Filling NaN with string

b. Sorting based on column values

LAB No. 07 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

Post Lab Task:

LAB No. 07 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

(For Evaluator’s use only)

Marks Secured:_ out of __

Signature of the Evaluator with Date:

LAB No. 07 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Lab Session 07: Perform Following Operations Using Pandas

Uploaded by

Lab Session 07: Perform Following Operations Using Pandas

Uploaded by

REGD. NO.

238W1A5464 DATA SCIENCE USING PYTHON LABORATORY-23AI&DS4354 ACADEMIC YEAR: 2024-2025

Lab Session 07: Perform following operations using pandas

Date of the Session: 17/02/2025 Time of the Session:12:30AM to 1:00PM

Pre-Lab Task: Write answers before entering into lab.

3. What is the purpose of the sort_values() function in Pandas?

LAB No. 07 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

a. Filling NaN with string

b. Sorting based on column values

LAB No. 07 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

Post Lab Task:

LAB No. 07 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

(For Evaluator’s use only)

Marks Secured:_______ out of ________

Signature of the Evaluator with Date:

LAB No. 07 VELAGAPUDI RAMAKRISHNA SIDDHARTHA ENGINEERING COLLEGE Page |

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Marks Secured:_ out of __