0% found this document useful (0 votes)

10 views4 pages

Axe Submission

Uploaded by

Sagar veerala

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views4 pages

Axe Submission

Uploaded by

Sagar veerala

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Python Developer Initial Submission

Question:

Create a Python script or workflow that automates the analysis of customer data
(e.g., purchase history, browsing behavior) to identify trends and segment customers
for targeted marketing campaigns. What data processing and visualization tools would
you use? Please include a code snippet or pseudocode.

Libraries:

1. Pandas for data manipulation

2. NumPy for numerical computations
3. Matplotlib and Seaborn for visualization
4. Scikit-learn for clustering and segmentation
5. SciPy for statistical analysis

Tools:

1. Jupyter Notebook or Python script for data analysis

2. Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, and SciPy for data processing
and visualization
3. CSV file for data storage

Visualization:

1. Scatter plots for cluster visualization

2. Box plots for demographic data analysis
3. Heatmaps for correlation analysis

Script:

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.cluster import KMeans
from sklearn.preprocessing import StandardScaler
df = pd.read_csv('customer_data.csv')
df['purchase_history'] = df['purchase_history'].str.count(',')
df['browsing_behavior'] = df['browsing_behavior'].str.count(',')
scaler = StandardScaler()
df[['purchase_history', 'browsing_behavior']] =
scaler.fit_transform(df[['purchase_history', 'browsing_behavior']])

kmeans = KMeans(n_clusters=5)
df['cluster'] = kmeans.fit_predict(df[['purchase_history', 'browsing_behavior']])

sns.scatterplot(x='purchase_history', y='browsing_behavior', hue='cluster', data=df)

plt.title('Customer Clusters')
plt.show()

demographic_df = df['demographic_data'].apply(pd.Series)
sns.boxplot(x='age', data=demographic_df)
plt.title('Age Distribution')
plt.show()

segments = []
for cluster in df['cluster'].unique():
segment_df = df[df['cluster'] == cluster]
segments.append({'cluster': cluster, 'demographics':
segment_df['demographic_data'].describe()})

for segment in segments:

print(f"Cluster {segment['cluster']}:")
print(segment['demographics'])

Explanation:

Step 1: Load Data

df = pd.read_csv('customer_data.csv')
- Loads customer data from a CSV file named customer_data.csv into a Pandas
DataFrame (df).

Step 2: Scale Data

scaler = StandardScaler()
df[['purchase_history', 'browsing_behavior']] =
scaler.fit_transform(df[['purchase_history', 'browsing_behavior']])
- Creates a StandardScaler object (scaler) to normalize the data.
- Selects the purchase_history and browsing_behavior columns from the DataFrame
(df).
- Applies the scaler to these columns using fit_transform(), which:
- Subtracts the mean value from each column.
- Divides by the standard deviation.

Step 3: Cluster Customers

kmeans = KMeans(n_clusters=5)
df['cluster'] = kmeans.fit_predict(df[['purchase_history', 'browsing_behavior']])
- Creates a KMeans clustering object (kmeans) with 5 clusters (n_clusters=5).
- Selects the scaled purchase_history and browsing_behavior columns.
- Applies KMeans clustering using fit_predict(), which:
- Assigns each customer to a cluster based on their scaled values.
- Returns the cluster labels (0-4) and stores them in a new column (cluster) in the
DataFrame.

Step 4: Visualize Clusters

sns.scatterplot(x='purchase_history', y='browsing_behavior', hue='cluster', data=df)

plt.show()
- Uses Seaborn's scatterplot() function to visualize the clusters.
- Plots the scaled purchase_history values on the x-axis and browsing_behavior values
on the y-axis.
- Colors each point according to its cluster label (hue='cluster').

Step 5: Print Cluster Demographics

for cluster in df['cluster'].unique():

print(f"Cluster {cluster}:")
print(df[df['cluster'] == cluster]['demographic_data'].describe())

- Loops through each unique cluster label.

- Filters the DataFrame to include only customers in the current cluster.
The solution:

1. Identifies patterns in customer behavior (purchase history and browsing behavior).

2. Groups customers into 5 clusters based on these patterns.
3. Visualizes the clusters to understand the customer segments.
4. Provides demographic insights for each cluster.

Rubaiyat of Omar Khayyam
No ratings yet
Rubaiyat of Omar Khayyam
6 pages
Questionnaire - The Effects of Hybrid Work Setup To The Employee Well Being and Productivity
No ratings yet
Questionnaire - The Effects of Hybrid Work Setup To The Employee Well Being and Productivity
3 pages
Modern Synthetic Methods - 4
No ratings yet
Modern Synthetic Methods - 4
49 pages
Software Development Engineer 1 C30ca3bb 7409 4681 8f29 0a0d5ab3fbab
No ratings yet
Software Development Engineer 1 C30ca3bb 7409 4681 8f29 0a0d5ab3fbab
3 pages
Multicultural Journalism Education in The Netherlands - A Case Study
No ratings yet
Multicultural Journalism Education in The Netherlands - A Case Study
15 pages
Notes Key Topic 1.7 Rational Functions and End Behavior AP PC
No ratings yet
Notes Key Topic 1.7 Rational Functions and End Behavior AP PC
2 pages
Pps Exam
No ratings yet
Pps Exam
6 pages
Week 8 Task Answer
No ratings yet
Week 8 Task Answer
2 pages
L2 Students' Barriers in Engaging With Form and Content-Focused AI-generated Feedback in Revising Their Compositions
No ratings yet
L2 Students' Barriers in Engaging With Form and Content-Focused AI-generated Feedback in Revising Their Compositions
23 pages
Data Analysis Project On Customer Purchases Dataset
No ratings yet
Data Analysis Project On Customer Purchases Dataset
1 page
Question 02
No ratings yet
Question 02
3 pages
Document 11
No ratings yet
Document 11
6 pages
Backtrader Essentials: Building Successful Strategies with Python
From Everand
Backtrader Essentials: Building Successful Strategies with Python
Ali AZARY
No ratings yet
Higher Ability
No ratings yet
Higher Ability
4 pages
? Report On Social Network Ads Dataset
No ratings yet
? Report On Social Network Ads Dataset
20 pages
Core Answer
No ratings yet
Core Answer
22 pages
Ra 2011027020100
No ratings yet
Ra 2011027020100
1 page
提高高中生词汇的方法
100% (1)
提高高中生词汇的方法
6 pages
Tasks For Students-1
No ratings yet
Tasks For Students-1
3 pages
Netflix Data Analysis 1691522070
No ratings yet
Netflix Data Analysis 1691522070
18 pages
WD Syllabus
No ratings yet
WD Syllabus
2 pages
He Week 2
No ratings yet
He Week 2
19 pages
Sample Sales Data Analysis
No ratings yet
Sample Sales Data Analysis
13 pages
Elimination Reactions
No ratings yet
Elimination Reactions
63 pages
2020-02.25 Prodigy Disc Flight Chart PDF
No ratings yet
2020-02.25 Prodigy Disc Flight Chart PDF
1 page
Data Mining
No ratings yet
Data Mining
10 pages
DWDM Report
No ratings yet
DWDM Report
6 pages
Tasks For Students
No ratings yet
Tasks For Students
4 pages
Disastermanagement 221101070057 9e298264
No ratings yet
Disastermanagement 221101070057 9e298264
49 pages
ALOJIPAN Assessment - Task - 1 - Sampling - Data - Visualization
No ratings yet
ALOJIPAN Assessment - Task - 1 - Sampling - Data - Visualization
12 pages
Links For Datasets
No ratings yet
Links For Datasets
3 pages
Intro Qugates
No ratings yet
Intro Qugates
4 pages
Nursing Leadership and Management PDF
No ratings yet
Nursing Leadership and Management PDF
8 pages
Varshini Phase 2
No ratings yet
Varshini Phase 2
19 pages
Untitled Document-2-1-13-7-11.4
No ratings yet
Untitled Document-2-1-13-7-11.4
5 pages
Assignment ....
No ratings yet
Assignment ....
8 pages
AML Assignment 1 1
No ratings yet
AML Assignment 1 1
4 pages
Information and Resources For Starting A Home-Based Food Business
No ratings yet
Information and Resources For Starting A Home-Based Food Business
2 pages
Guides
No ratings yet
Guides
23 pages
Data Visualization: Types of Data Visualization: Charts and Graphs Line Charts
No ratings yet
Data Visualization: Types of Data Visualization: Charts and Graphs Line Charts
15 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Extracted Notebook Content
No ratings yet
Extracted Notebook Content
17 pages
Guide
No ratings yet
Guide
28 pages
BIDA Practical Print
No ratings yet
BIDA Practical Print
56 pages
Usage of Cell Phone and Learning Performance
No ratings yet
Usage of Cell Phone and Learning Performance
12 pages
Wa0002.
No ratings yet
Wa0002.
4 pages
Presentation KL Maritime
No ratings yet
Presentation KL Maritime
7 pages
Python Data Analysis and Visualization 100 Practical Exercises With Results and Explanations (Yuka, Horikawa Yui, Kirigaya Kouta Etc.) (Z-Library)
No ratings yet
Python Data Analysis and Visualization 100 Practical Exercises With Results and Explanations (Yuka, Horikawa Yui, Kirigaya Kouta Etc.) (Z-Library)
453 pages
AFS19-SA094-Unisteel Scaffolding and Formwork-Alpino-31122019
No ratings yet
AFS19-SA094-Unisteel Scaffolding and Formwork-Alpino-31122019
5 pages
lpdsc212 - Yenny Gunawan - Tektonik Arsitektur Joglo-P PDF
No ratings yet
lpdsc212 - Yenny Gunawan - Tektonik Arsitektur Joglo-P PDF
31 pages
Customer Segmentation
No ratings yet
Customer Segmentation
9 pages
Main - Py Text File
No ratings yet
Main - Py Text File
5 pages
Lecture - 7 - Practical - DBSCAN Clustering in Python
No ratings yet
Lecture - 7 - Practical - DBSCAN Clustering in Python
3 pages
Kyocera KM1650 / 2050 Parts List / Manual
No ratings yet
Kyocera KM1650 / 2050 Parts List / Manual
48 pages
Ex - 08 DS
No ratings yet
Ex - 08 DS
11 pages
Action Plan in Mathematics Grade 4-Integrity
100% (2)
Action Plan in Mathematics Grade 4-Integrity
2 pages
Ads Phase3
No ratings yet
Ads Phase3
9 pages
A Beginner's Guide To Customer Segmentation With Python - by Sigli Mumuni - Medium
No ratings yet
A Beginner's Guide To Customer Segmentation With Python - by Sigli Mumuni - Medium
14 pages
Supermarket Sales Analysis Project
No ratings yet
Supermarket Sales Analysis Project
8 pages
Update Plan
100% (1)
Update Plan
79 pages
Supermarket Sales Data Analysis
No ratings yet
Supermarket Sales Data Analysis
6 pages
Gokul
No ratings yet
Gokul
10 pages
Forage, Harvest, Feast - Honeysuckle
No ratings yet
Forage, Harvest, Feast - Honeysuckle
6 pages
IIM PBA Assignment 2
No ratings yet
IIM PBA Assignment 2
3 pages
Case Study Module 1
No ratings yet
Case Study Module 1
4 pages
Project Sale Analysis
No ratings yet
Project Sale Analysis
8 pages
Final Ca
No ratings yet
Final Ca
10 pages
Technologyname Phase2
No ratings yet
Technologyname Phase2
20 pages
Customer Segmentation in Python
No ratings yet
Customer Segmentation in Python
71 pages
Exp2 - Data Visualization and Cleaning and Feature Selection
No ratings yet
Exp2 - Data Visualization and Cleaning and Feature Selection
13 pages
In Tenshi PPP Tte Jum Am
No ratings yet
In Tenshi PPP Tte Jum Am
23 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
10 pages
Exploratory Data Analysis66
No ratings yet
Exploratory Data Analysis66
17 pages
Install Win7 To USB3 - 0 Computers PDF
No ratings yet
Install Win7 To USB3 - 0 Computers PDF
8 pages
Customer Segmentation With K-Means and RMF
No ratings yet
Customer Segmentation With K-Means and RMF
13 pages
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
Jupyter Notebook Project DM Nikita Chaturvedi 25.07.2021
100% (5)
Jupyter Notebook Project DM Nikita Chaturvedi 25.07.2021
83 pages
Customer Segmentation Report
No ratings yet
Customer Segmentation Report
8 pages
Research 7 Q3 W4
No ratings yet
Research 7 Q3 W4
9 pages
Customer Segmentation Using RFM Analysis: Overview
No ratings yet
Customer Segmentation Using RFM Analysis: Overview
11 pages
Diwali Sales Analysis EDA 1696347982
No ratings yet
Diwali Sales Analysis EDA 1696347982
8 pages
Machine Learning - Project
80% (10)
Machine Learning - Project
14 pages
Final Case
No ratings yet
Final Case
45 pages
Ads Phase 5
No ratings yet
Ads Phase 5
23 pages
CSUDS Project
No ratings yet
CSUDS Project
13 pages
Replacing The Hood Maxfire
No ratings yet
Replacing The Hood Maxfire
2 pages
CH2 Descriptive Analytics QA PDF
No ratings yet
CH2 Descriptive Analytics QA PDF
25 pages
DEEO
50% (2)
DEEO
8 pages
Visualisation All
0% (1)
Visualisation All
70 pages
Another Project-Creating Customer Segments
No ratings yet
Another Project-Creating Customer Segments
31 pages
Mall Customer Data Analysis PDF
No ratings yet
Mall Customer Data Analysis PDF
10 pages
Customer Segmentation PDF
No ratings yet
Customer Segmentation PDF
18 pages
Mining and Visualising Real-World Data: About This Module
100% (1)
Mining and Visualising Real-World Data: About This Module
16 pages
Machine Learning - Customer Segment Project. Approved by UDACITY
100% (1)
Machine Learning - Customer Segment Project. Approved by UDACITY
19 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Axe Submission

Uploaded by

Axe Submission

Uploaded by

Python Developer Initial Submission

1. Pandas for data manipulation

1. Jupyter Notebook or Python script for data analysis

1. Scatter plots for cluster visualization

sns.scatterplot(x='purchase_history', y='browsing_behavior', hue='cluster', data=df)

for segment in segments:

Step 1: Load Data

Step 2: Scale Data

Step 3: Cluster Customers

Step 4: Visualize Clusters

sns.scatterplot(x='purchase_history', y='browsing_behavior', hue='cluster', data=df)

Step 5: Print Cluster Demographics

for cluster in df['cluster'].unique():

- Loops through each unique cluster label.

1. Identifies patterns in customer behavior (purchase history and browsing behavior).

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.