0% found this document useful (0 votes)

4 views11 pages

Prac 2

The document outlines practical exercises in Business Analytics using Python libraries like NumPy, Pandas, and Matplotlib. It includes tasks such as creating and manipulating arrays, handling missing data in DataFrames, and visualizing data through various plots. Key operations include calculating statistics, filtering data, and comparing distributions with visualizations.

Uploaded by

asharathod1999

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views11 pages

Prac 2

Uploaded by

asharathod1999

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

22SE02ML063 Business Analytics

Practical – 2
Create a NumPy array of shape (5, 5) with values ranging from 1 to 25. •
Perform the following operations: • Flatten the array into a 1D array. •
Calculate the mean, median, and standard deviation of the array. • Reshape
the array back into a 5x5 matrix and replace all values greater than 10 with 0.
import numpy as np

array_2d = np.arange(1, 26).reshape(5, 5)

array_flattened = array_2d.flatten()

mean_value = np.mean(array_flattened)
median_value = np.median(array_flattened)
std_deviation = np.std(array_flattened)

array_reshaped = array_flattened.reshape(5, 5)
array_reshaped[array_reshaped > 10] = 0

print("Original 2D Array:")
print(array_2d)
print("\nFlattened Array:")
print(array_flattened)
print("\nMean:", mean_value)
print("Median:", median_value)
print("Standard Deviation:", std_deviation)
print("\nModified 2D Array:")
print(array_reshaped)
Output:
22SE02ML063 Business Analytics

• Create two NumPy arrays: a 3x3 matrix of random integers between 1 and
10 and a 3x1 column vector of random integers between 1 and 5. • Perform
the following: o Multiply the matrix by the column vector. o Transpose the
resulting matrix. o Find the determinant of the original 3x3 matrix.
import numpy as np

matrix = np.random.randint(1, 11, size=(3, 3))

column_vector = np.random.randint(1, 6, size=(3, 1))

result_matrix = np.dot(matrix, column_vector)

transposed_matrix = result_matrix.T

determinant = np.linalg.det(matrix)

print("Original 3x3 Matrix:")

print(matrix)
print("\n3x1 Column Vector:")
print(column_vector)
print("\nResulting Matrix After Multiplication:")
print(result_matrix)
print("\nTransposed Matrix:")
print(transposed_matrix)
print("\nDeterminant of the Original Matrix:", determinant)
Output:
22SE02ML063 Business Analytics

Create a Pandas DataFrame with columns Name, Age, Height, and City with
the following data: • Perform the following tasks: o Display the first 3 rows of
the DataFrame. o Add a new column Weight with random values. o Filter the
rows where Age is greater than 25 and display only the Name and Height
columns
import numpy as np
import pandas as pd

data = {
"Name": ["Alice", "Bob", "Charlie", "David", "Eve"],
"Age": [23, 30, 35, 22, 28],
"Height": [5.5, 6.0, 5.8, 5.9, 5.7],
"City": ["New York", "Los Angeles", "Chicago", "Houston", "Phoenix"]
}
df = pd.DataFrame(data)

print("\nFirst 3 Rows of DataFrame:")

print(df.head(3))

df["Weight"] = np.random.randint(50, 101, size=len(df))

print("\nDataFrame with Weight Column:")
print(df)

filtered_df = df[df["Age"] > 25][["Name", "Height"]]

print("\nFiltered Rows (Age > 25):")
print(filtered_df)
Output:
22SE02ML063 Business Analytics

Create a DataFrame containing Name, Age, Salary columns with some missing
(NaN) values. • Fill the missing Age values with the mean value of the
column.• Drop any rows where Salary is missing
import numpy as np
import pandas as pd

data_with_nan = {
"Name": ["Frank", "Grace", "Hank", "Ivy", "Jack"],
"Age": [25, np.nan, 29, np.nan, 32],
"Salary": [50000, 60000, np.nan, 75000, 80000]
}
df_nan = pd.DataFrame(data_with_nan)

df_nan["Age"].fillna(df_nan["Age"].mean(), inplace=True)

df_nan.dropna(subset=["Salary"], inplace=True)

print("\nDataFrame with Missing Values Handled:")

print(df_nan)
Output:
22SE02ML063 Business Analytics

Create a line plot that represents the relationship between two lists x = [1, 2,
3, 4, 5] and y = [2, 4, 6, 8, 10]. • Label the x-axis as "X values" and the y-axis as
"Y values". • Add a title "Simple Line Plot".
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
x = [1, 2, 3, 4, 5]
y = [2, 4, 6, 8, 10]

plt.plot(x, y, marker='o')
plt.xlabel("X values")
plt.ylabel("Y values")
plt.title("Simple Line Plot")
plt.grid(True)
plt.show()
Output:
22SE02ML063 Business Analytics

Create a bar plot comparing the sales of different products in a store.

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

products = ["Product A", "Product B", "Product C", "Product D"]

sales = [250, 400, 300, 450]

plt.bar(products, sales, color=['blue', 'green', 'red', 'purple'])

plt.xlabel("Products")
plt.ylabel("Sales")
plt.title("Product Sales Comparison")
plt.show()

Output:
22SE02ML063 Business Analytics

Plot histograms for both total_bill and tip. Compare their distributions. •
Create overlapping histograms for total_bill for lunch and dinner times. What
differences do you notice? • Adjust the number of bins in the histogram to
50. How does it affect the visualization?
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

data = sns.load_dataset('tips')

plt.hist(data['total_bill'], bins=50, alpha=0.7, label='Total Bill', color='blue')

plt.hist(data['tip'], bins=50, alpha=0.7, label='Tip', color='green')
plt.xlabel("Value")
plt.ylabel("Frequency")
plt.title("Histograms of Total Bill and Tip")
plt.legend()
plt.show()

lunch_data = data[data['time'] == 'Lunch']

dinner_data = data[data['time'] == 'Dinner']

plt.hist(lunch_data['total_bill'], bins=50, alpha=0.7, label='Lunch', color='orange')

plt.hist(dinner_data['total_bill'], bins=50, alpha=0.7, label='Dinner', color='purple')
plt.xlabel("Total Bill")
plt.ylabel("Frequency")
plt.title("Overlapping Histograms of Total Bill (Lunch vs Dinner)")
plt.legend()
plt.show()

Observation: Adjusting bins to 50 creates more granular insights into the distribution of
values.
Output:
22SE02ML063 Business Analytics
22SE02ML063 Business Analytics

Create a boxplot comparing tip amounts for smokers and non-smokers. What
trends can you identify? • Add a swarmplot over the boxplot (use
sns.swarmplot) for total_bill by day. Does it add any additional insights? •
Group the boxplot by sex and time (e.g., use hue='sex' and x='time') to see if
there are any differences in spending habits.
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
data = sns.load_dataset('tips')

plt.figure(figsize=(8, 6))
sns.boxplot(x='smoker', y='tip', data=data)
plt.title("Boxplot of Tip Amounts for Smokers and Non-Smokers")
plt.xlabel("Smoker")
plt.ylabel("Tip Amount")
plt.show()

Observation: Boxplot reveals trends such as whether smokers tend to tip more or less than
non-smokers.

plt.figure(figsize=(10, 6))
sns.boxplot(x='day', y='total_bill', data=data, palette='Set2')
sns.swarmplot(x='day', y='total_bill', data=data, color='black', alpha=0.7)
plt.title("Boxplot with Swarmplot Overlay of Total Bill by Day")
plt.xlabel("Day")
plt.ylabel("Total Bill")
plt.show()

Observation: Swarmplot provides additional insights into individual data points and outliers.
22SE02ML063 Business Analytics

plt.figure(figsize=(10, 6))
sns.boxplot(x='time', y='total_bill', hue='sex', data=data, palette='coolwarm')
plt.title("Boxplot of Total Bill Grouped by Sex and Time")
plt.xlabel("Time")
plt.ylabel("Total Bill")
plt.legend(title="Sex")
plt.show()

Observation: Grouping by sex and time shows differences in spending habits between males
and females during lunch and dinner.
Output:
22SE02ML063 Business Analytics

Content Server 20.3 Administration Guide
No ratings yet
Content Server 20.3 Administration Guide
578 pages
C++ Programking Lab 01 and 02 Check - Sheet
No ratings yet
C++ Programking Lab 01 and 02 Check - Sheet
4 pages
Design and Construction of A Battery Level Indicator
No ratings yet
Design and Construction of A Battery Level Indicator
10 pages
Prac 2
No ratings yet
Prac 2
11 pages
Data Analytics
No ratings yet
Data Analytics
34 pages
Data Analysis Lab - Final - 23-24
No ratings yet
Data Analysis Lab - Final - 23-24
11 pages
NumPy and Pandas Step
No ratings yet
NumPy and Pandas Step
9 pages
Exp 8 - LM
No ratings yet
Exp 8 - LM
10 pages
Vanshika Goyal Gec Practicals
No ratings yet
Vanshika Goyal Gec Practicals
31 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
End Semester Answer Key Format-Fods
No ratings yet
End Semester Answer Key Format-Fods
8 pages
23bet10114 Naman Gupta Assignment-1
No ratings yet
23bet10114 Naman Gupta Assignment-1
17 pages
FDS Record-1-4
No ratings yet
FDS Record-1-4
18 pages
Aids Lab
No ratings yet
Aids Lab
45 pages
3 - Pandas
No ratings yet
3 - Pandas
87 pages
Final Dev Record
No ratings yet
Final Dev Record
49 pages
PP DWDM 4 5
No ratings yet
PP DWDM 4 5
26 pages
Data Science
No ratings yet
Data Science
18 pages
Programming Notes 3
No ratings yet
Programming Notes 3
3 pages
Gec Practicals
No ratings yet
Gec Practicals
31 pages
Usage of NumPy For Numerical Data in Detail
No ratings yet
Usage of NumPy For Numerical Data in Detail
52 pages
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
No ratings yet
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
28 pages
DS3 1
No ratings yet
DS3 1
8 pages
Lab Record Dev
No ratings yet
Lab Record Dev
20 pages
DEV Lab Record
No ratings yet
DEV Lab Record
46 pages
Khadeeja - DS - PRACTICAL 4
No ratings yet
Khadeeja - DS - PRACTICAL 4
24 pages
GE02 (DAVP) Assignment
No ratings yet
GE02 (DAVP) Assignment
3 pages
Index
No ratings yet
Index
4 pages
Certificate
No ratings yet
Certificate
25 pages
L6 and 7-Data Preprocessing-Coding
No ratings yet
L6 and 7-Data Preprocessing-Coding
34 pages
Pandas NumPy Practice Questions
No ratings yet
Pandas NumPy Practice Questions
2 pages
Python CA2
No ratings yet
Python CA2
11 pages
Report
No ratings yet
Report
18 pages
Eda Code Snippets
No ratings yet
Eda Code Snippets
17 pages
Chapter 4 Data Mining
No ratings yet
Chapter 4 Data Mining
5 pages
Dev Record Aids
No ratings yet
Dev Record Aids
24 pages
FDS Lab
No ratings yet
FDS Lab
43 pages
Python Unit-5
No ratings yet
Python Unit-5
14 pages
Financial Analytics With Python
100% (1)
Financial Analytics With Python
40 pages
Prac 4
No ratings yet
Prac 4
3 pages
Eda Lab Manual
No ratings yet
Eda Lab Manual
34 pages
SET 1 Part A Marks, (
No ratings yet
SET 1 Part A Marks, (
10 pages
Exploratory Data Analysis-1
No ratings yet
Exploratory Data Analysis-1
10 pages
NumPy and Pandas
No ratings yet
NumPy and Pandas
12 pages
Data Science Sample
No ratings yet
Data Science Sample
5 pages
Data Science
No ratings yet
Data Science
42 pages
ML Lab Manual 2025-2
No ratings yet
ML Lab Manual 2025-2
35 pages
Journal
No ratings yet
Journal
48 pages
Experiment No: 1 Introduction To Data Analytics and Python Fundamentals Page-1/11
No ratings yet
Experiment No: 1 Introduction To Data Analytics and Python Fundamentals Page-1/11
8 pages
Python Notes by Prof T
No ratings yet
Python Notes by Prof T
10 pages
DAP Writeups - Merged
No ratings yet
DAP Writeups - Merged
33 pages
Fds QB
No ratings yet
Fds QB
6 pages
Data Exploration and Analysis With Python
No ratings yet
Data Exploration and Analysis With Python
9 pages
Ai Programs
No ratings yet
Ai Programs
22 pages
Data Exploration Preparation
No ratings yet
Data Exploration Preparation
12 pages
Important Questions With Solutions IP
No ratings yet
Important Questions With Solutions IP
5 pages
23HCS4142 PDF
No ratings yet
23HCS4142 PDF
24 pages
Datascience
No ratings yet
Datascience
26 pages
Unit 5
No ratings yet
Unit 5
28 pages
Eda Lab Assignment2
No ratings yet
Eda Lab Assignment2
10 pages
Fds Merged
No ratings yet
Fds Merged
102 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
اكواد لغة سي.... جاهز نماذج اختبارات...
No ratings yet
اكواد لغة سي.... جاهز نماذج اختبارات...
9 pages
CCS0007 - Laboratory Exercise 3
No ratings yet
CCS0007 - Laboratory Exercise 3
17 pages
Fully Automatic Hot Foil Stamping Machine
No ratings yet
Fully Automatic Hot Foil Stamping Machine
4 pages
BPF Template File
No ratings yet
BPF Template File
34 pages
Rekabentuk Dan Analisis Produk
No ratings yet
Rekabentuk Dan Analisis Produk
7 pages
Disaster Recovery Using Alwayson Availability Group - Scenario 1
No ratings yet
Disaster Recovery Using Alwayson Availability Group - Scenario 1
34 pages
Introduction To ROC Analysis: Pattern Recognition Letters June 2006
No ratings yet
Introduction To ROC Analysis: Pattern Recognition Letters June 2006
16 pages
Limooezekii Report 7
No ratings yet
Limooezekii Report 7
17 pages
FTDI Driver Uninstall With 2-12-28 Install
No ratings yet
FTDI Driver Uninstall With 2-12-28 Install
7 pages
DSB For R PDF
No ratings yet
DSB For R PDF
6 pages
Resume Professional Aafridah Software Engineer
No ratings yet
Resume Professional Aafridah Software Engineer
4 pages
Stata Finite Mixture Models Reference Manual: Release 16
No ratings yet
Stata Finite Mixture Models Reference Manual: Release 16
138 pages
ROV Umbilical Winch 20210111 1S Rev 2 OM 4100 A3 4 180 190 FS NZ
No ratings yet
ROV Umbilical Winch 20210111 1S Rev 2 OM 4100 A3 4 180 190 FS NZ
7 pages
Ffu 0001114 01
No ratings yet
Ffu 0001114 01
27 pages
MSBTE Solution App-2
No ratings yet
MSBTE Solution App-2
4 pages
Deloitte PPT-Devang
No ratings yet
Deloitte PPT-Devang
7 pages
Media Factsheet - JTC Wis and Gaussian Robotics Collaborate To Develop Singapores First Fully Autonomous Cleaning Solution
No ratings yet
Media Factsheet - JTC Wis and Gaussian Robotics Collaborate To Develop Singapores First Fully Autonomous Cleaning Solution
5 pages
Object Oriented Software Engineering Using UML Patterns and Java 3rd Edition by Bernd Bruegge, Allen H Dutoit ISBN 0133002098 9780133002096
100% (12)
Object Oriented Software Engineering Using UML Patterns and Java 3rd Edition by Bernd Bruegge, Allen H Dutoit ISBN 0133002098 9780133002096
76 pages
Appendix C - Simulink Refresher
No ratings yet
Appendix C - Simulink Refresher
27 pages
Background of The Study: of The Digital Payment System On Financial Inclusion in The Philippines
No ratings yet
Background of The Study: of The Digital Payment System On Financial Inclusion in The Philippines
44 pages
Epq96 2 Data Sheet 4921240364 Uk
No ratings yet
Epq96 2 Data Sheet 4921240364 Uk
8 pages
Mark VI Turbine Controls GE - AddingIO - Doc 1 ADDING NEW INPUTS/OUPUTS
No ratings yet
Mark VI Turbine Controls GE - AddingIO - Doc 1 ADDING NEW INPUTS/OUPUTS
29 pages
UM S7 Product Data Sheet
No ratings yet
UM S7 Product Data Sheet
2 pages
Simulation of Five-Level Five-Phase SVPWM Voltage Source Inverter PDF
No ratings yet
Simulation of Five-Level Five-Phase SVPWM Voltage Source Inverter PDF
5 pages
CSS 10 QUARTER 2 Module 1
No ratings yet
CSS 10 QUARTER 2 Module 1
27 pages
Result and Discussion Table 3 Demographic Profile of Teachers Profile F P
No ratings yet
Result and Discussion Table 3 Demographic Profile of Teachers Profile F P
2 pages
Lec6. Operator Overload
No ratings yet
Lec6. Operator Overload
28 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Prac 2

Uploaded by

Prac 2

Uploaded by

22SE02ML063 Business Analytics

array_2d = np.arange(1, 26).reshape(5, 5)

matrix = np.random.randint(1, 11, size=(3, 3))

column_vector = np.random.randint(1, 6, size=(3, 1))

result_matrix = np.dot(matrix, column_vector)

print("Original 3x3 Matrix:")

print("\nFirst 3 Rows of DataFrame:")

df["Weight"] = np.random.randint(50, 101, size=len(df))

filtered_df = df[df["Age"] > 25][["Name", "Height"]]

print("\nDataFrame with Missing Values Handled:")

Create a bar plot comparing the sales of different products in a store.

products = ["Product A", "Product B", "Product C", "Product D"]

plt.bar(products, sales, color=['blue', 'green', 'red', 'purple'])

plt.hist(data['total_bill'], bins=50, alpha=0.7, label='Total Bill', color='blue')

lunch_data = data[data['time'] == 'Lunch']

plt.hist(lunch_data['total_bill'], bins=50, alpha=0.7, label='Lunch', color='orange')

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.