0% found this document useful (0 votes)

27 views4 pages

Different Methods of Plotting

The document discusses various methods for data analysis and visualization using Pandas and Matplotlib in Python. These include reading data from files, plotting data, cleaning data through operations like dropping duplicates and columns, grouping and aggregating data, merging DataFrames, and indexing and filtering DataFrames.

Uploaded by

brylla montero

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views4 pages

Different Methods of Plotting

Uploaded by

brylla montero

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

‭import pandas as pd‬

‭import numpy as np‬

‭import matplotlib.pyplot as plt‬
‭df = pd.read_csv(“/path”) >>‬‭csv for csv files‬
‭df = pd.read_excel(“/path”) >>‬‭excel for xlsx files‬

‭Different Methods of Plotting‬

‭df.plot(kind = ‘line’, title = ‘//name’, xlabel = ‘//name’, ylabel = ‘‘//name’’)‬
‭df.plot(kind = ‘bar’, stacked = True)‬
‭df.plot.barh(stacked = True)‬
‭df.plot.scatter(x = '//name', y = '//name', c = '//color', s = //size)‬
‭df.plot.hist(bins=10) >>‬‭bins is the interval‬
‭df.boxplot()‬
‭df.plot.area()‬
‭df.plot.pie(y = ‘//name’, figsize = (//size, //size))‬

‭Data Cleaning‬
‭df.info() >>‬‭provides info abt the csv/xlsx file‬

‭df.shape >>‬‭counts the number of columns and rows: ex.(22, 20)‬

‭df = df.drop_duplicates() >>‬‭for dropping duplicates‬

‭df = df.drop(columns = "//name of the column") >>‬‭dropping a specific column‬

‭df["//name of the column"].str.strip() >>‬‭removing a specific symbol or letter in the column‬

‭df["//name of the column"] = df["//name of the column"].str.strip("/._") >>‬‭also for removing‬‭a‬

‭specific symbol or letter but at once‬

‭df["//name of the column"] = df["//name of the column"].str.replace('[^0-9a-zA-Z]', '') >>‬‭used for‬

‭replacing‬

‭df["//name of the column"] = df["name of the column"].apply(lambda x: str(x)) >>‬‭lambda x:‬

‭str(x) converts each element x to a string using the str() function‬

‭df["Phone_Number"] = df["Phone_Number"].apply(lambda x: x[0:3] + '-' + x[3:6] + '-' + x[6:10])‬

‭>>‬‭The lambda function slices the string x into three parts: the first 3 characters (area code), the‬
‭next 3 characters (prefix), and the last 4 characters (line number). Then, it concatenates these parts‬

‭with hyphens in between to format the phone number. Result: "###-###-####"‬

‭df[["Street_Address", "State", "Zip_Code"]] = df["Address"].str.split(',', 2, expand = True) >>‬

‭df["Address"].str.split(',', 2, expand=True): This splits each element in the "Address" column into‬

‭substrings using the comma (,) as the delimiter. df[["Street_Address", "State", "Zip_Code"]] = ...:‬

‭This assigns the result of the split operation to three new columns in the DataFrame:‬

‭"Street_Address", "State", and "Zip_Code".‬

‭for x in df.index: >>‬ ‭allows you to loop through each row of the DataFrame.‬

‭if df.loc[x, "Do_Not_Contact"] == "Y": >>‬‭checks if the value in the "Do_Not_Contact" column for‬

‭the current row (x) is equal to "Y".‬

‭df.drop(x, inplace = True) >>‬‭drops the row if it has a value of “Y”‬

‭df = df.reset_index(drop=True) >>‬‭resets the index of the DataFrame to a default integer‬

‭index, starting from 0, and drops the existing index.‬

‭Group by and Aggregating‬

‭name_of_the_variable = df.groupby('//name of the column')‬

‭df.groupby('//name of the column').mean() >>‬‭gets the mean of the column‬

‭df.groupby('//name of the column').count() >>‬‭calculates the count of non-null values for‬

‭each column within each group.‬

‭df.groupby('Base Flavor').sum() >>‬ ‭calculates the sum of numerical values for each column‬

‭within each group.‬

‭df.groupby('Base Flavor').describe() >>‬‭provides statistics such as count, mean, standard‬

‭deviation, minimum, maximum, and quartiles.‬

‭df.groupby(['Base Flavor','Liked']).agg({'Flavor Rating': ['mean','max','count','sum']}) >>‬‭applies‬

‭aggregation functions to the "Flavor Rating" column within each group‬

‭Merging‬

‭df_inner = df1.merge(df2, how = 'inner', on = ['FellowshipID', 'FirstName']) >>‬‭df_inner, will‬

‭contain the rows from df1 and df2 where the values in both "FellowshipID" and "FirstName"‬

‭columns match.‬

‭df_outer = df1.merge(df2, how = 'outer') >>‬‭df_outer, will contain all rows from both df1 and df2,‬

‭with NaN values filled in where data is missing from either DataFrame.‬

‭df1.merge(df2, how = 'left') >>‬‭if there are no matches in df2 for a particular row in df1, NaN‬

‭values will be filled in for the columns from df2.‬

‭df_right = df1.merge(df2, how = 'right') >>‬‭In a right join, all the rows from the right df2 are‬

‭retained, and only the matching rows from the left df1 are appended. If there are no matching rows‬

‭in df1 for a particular row in df2, NaN values are filled in for the columns from df1.‬

‭From Exercise 1‬
‭data[:4] >>‬‭slicing data frames‬
‭data.head() >>‬‭gets the first 5 infos per column‬

‭Indexing Columns‬
‭data.director_name[:4]‬
‭cols = ["movie_title","director_name"]‬
‭data[cols][:5]‬

‭Finding info from a specific person/others (Find Movies by James Cameron)‬

‭james = data[data.director_name == ('James Cameron')]‬
‭show = ["movie_title","director_name"]‬
‭james[show][:5]‬

‭Sorting‬
‭sorted_data = data.sort_values(by="gross", ascending=False) >>‬‭used for sorting‬
‭sorted_data[:5] >>‬‭first 5 from the list will be displayed‬
‭Only 2 specific columns are shown (Movie title and Gross)‬
‭sorted_data = data.sort_values(by="gross", ascending=False)‬
‭cols = ["movie_title","gross"]‬
‭sorted_data[cols][:5]‬

‭Top 5 Films of Michael Bay‬

‭df = df[df.director_name == ('Michael Bay')] >>‬‭shows the films directed by Michael Bay‬
‭df = df.head(5) >>‬‭shows the top 5 films of Michael Bay‬

‭Challenge 5‬
‭sortedData = df2[(df2['gross'] == 67344392)] >>‬‭finds the name of the actor who has a gross‬
‭of 67344392‬
‭cols = ["movie_title","gross","actor_1_name"] >>‬‭shows these 3 specific column only‬
‭sortedData[cols] >>‬‭prints‬

‭sortedData2 = df3[(df3['actor_3_name'] == 'Omar Sy')] >>‬‭finds the actor named Omar Sy‬
‭cols = ["movie_title","actor_1_name","actor_3_name","country"] >>‬‭shows these 4 specific‬
‭column only‬
‭sortedData2[cols] >>‬‭It displays these columns for the filtered rows.‬

‭actorOne = df3[df3['actor_3_name'] == 'Omar Sy'] >>‬‭filters df3 to only include “Omar Sy”‬
‭actorThree = df3[df3['actor_1_name'] == 'Bruce Willis'] >>‬‭filters df3 to only include “Bruce‬
‭Willis”‬
‭mdata = pd.merge(actorOne, actorThree, how='outer') >>‬‭used for merging‬
‭mdata >>‬‭prints‬

Pandas Cheat Sheet PDF
67% (3)
Pandas Cheat Sheet PDF
1 page
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (3)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
9 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (4)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
11 pages
Instructions Reference Manual (W474) CPU CJ2M
100% (1)
Instructions Reference Manual (W474) CPU CJ2M
1,314 pages
Waves Interference Remote Lab1
25% (4)
Waves Interference Remote Lab1
3 pages
Content Pandas Cheat Sheet
No ratings yet
Content Pandas Cheat Sheet
9 pages
Pandas Syntax Revision For ML
No ratings yet
Pandas Syntax Revision For ML
10 pages
Pandas Cheat Sheet
100% (2)
Pandas Cheat Sheet
6 pages
Important Pandas Operations 1697910759
No ratings yet
Important Pandas Operations 1697910759
6 pages
Fundamental - Python
No ratings yet
Fundamental - Python
3 pages
Python Cheat Sheet Code Academy
100% (1)
Python Cheat Sheet Code Academy
1 page
Pandas Commands
No ratings yet
Pandas Commands
3 pages
Data Science Cheat Sheet: KEY Imports
100% (1)
Data Science Cheat Sheet: KEY Imports
1 page
Python Interviews
No ratings yet
Python Interviews
154 pages
Python Cheat Sheets
97% (33)
Python Cheat Sheets
11 pages
Python Libraries Cheat Sheets
No ratings yet
Python Libraries Cheat Sheets
6 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
60 pages
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
Pandas Data Wrangling Cheatsheet Datacamp PDF
No ratings yet
Pandas Data Wrangling Cheatsheet Datacamp PDF
1 page
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
5 pages
Pandas
No ratings yet
Pandas
13 pages
12 Pandas
100% (1)
12 Pandas
21 pages
Pandas Cheat Sheet Final
No ratings yet
Pandas Cheat Sheet Final
1 page
Pandas
No ratings yet
Pandas
5 pages
Pandas Merged
No ratings yet
Pandas Merged
2 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Python CheatSheet
No ratings yet
Python CheatSheet
2 pages
Python Cheat Sheet For Excel Users
100% (2)
Python Cheat Sheet For Excel Users
5 pages
Data Wrangling
No ratings yet
Data Wrangling
2 pages
Python For Data Science: Advanced Indexing Data Wrangling in Pandas Cheat Sheet Combining Data
No ratings yet
Python For Data Science: Advanced Indexing Data Wrangling in Pandas Cheat Sheet Combining Data
1 page
Introduction To Pandas Programming 2
No ratings yet
Introduction To Pandas Programming 2
3 pages
Commands SQL, Python (BASICS)
No ratings yet
Commands SQL, Python (BASICS)
7 pages
Pandas Cheat Sheet
85% (13)
Pandas Cheat Sheet
2 pages
Pandas Cheat Sheet CN
No ratings yet
Pandas Cheat Sheet CN
4 pages
Pandas Cheat Sheet
100% (4)
Pandas Cheat Sheet
2 pages
Data Handling Part Ii
No ratings yet
Data Handling Part Ii
41 pages
Unit 2 notes-II
No ratings yet
Unit 2 notes-II
47 pages
Pandas
No ratings yet
Pandas
44 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Week 3 GGG
No ratings yet
Week 3 GGG
17 pages
Data Wrangling Cheat Sheet
No ratings yet
Data Wrangling Cheat Sheet
1 page
Data WranglingGUIA PYTHON-05
No ratings yet
Data WranglingGUIA PYTHON-05
1 page
Pandas Tutorial
No ratings yet
Pandas Tutorial
9 pages
Chapter 2 Python Pandas - II
No ratings yet
Chapter 2 Python Pandas - II
19 pages
PDF&Rendition 1
No ratings yet
PDF&Rendition 1
47 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Pandas Notes Design
No ratings yet
Pandas Notes Design
5 pages
Rapids Cheatsheet
100% (1)
Rapids Cheatsheet
2 pages
Justenoughpython Pandas 220915 175329
No ratings yet
Justenoughpython Pandas 220915 175329
64 pages
Bokeh Cheat Sheet Python For Data Science: 3 Renderers & Visual Customizations
No ratings yet
Bokeh Cheat Sheet Python For Data Science: 3 Renderers & Visual Customizations
26 pages
Pandas Part-2
No ratings yet
Pandas Part-2
9 pages
Python 2.1.3
No ratings yet
Python 2.1.3
6 pages
Pandas
No ratings yet
Pandas
26 pages
Pandas Cheat Sheet - Python For Data Science
No ratings yet
Pandas Cheat Sheet - Python For Data Science
5 pages
Exp 3
No ratings yet
Exp 3
10 pages
NumPy and Pandas Step
No ratings yet
NumPy and Pandas Step
9 pages
Module 6
No ratings yet
Module 6
6 pages
Module 5
No ratings yet
Module 5
5 pages
S S S K S G: Ent385-Control Engineering SESSION 2012/13 Tutorial 5 Chapter: Root Locus
No ratings yet
S S S K S G: Ent385-Control Engineering SESSION 2012/13 Tutorial 5 Chapter: Root Locus
5 pages
Conclu Fee
No ratings yet
Conclu Fee
2 pages
Parato Project Proposal
No ratings yet
Parato Project Proposal
21 pages
GED 101 Understanding The Self
100% (3)
GED 101 Understanding The Self
154 pages
Week8 Module in Physics
No ratings yet
Week8 Module in Physics
32 pages
Q4 Applied Entrepreneurship WK 6
No ratings yet
Q4 Applied Entrepreneurship WK 6
4 pages
Gen Chem Week 5 Module
No ratings yet
Gen Chem Week 5 Module
9 pages
Department of Education: Republic of The Philippines
No ratings yet
Department of Education: Republic of The Philippines
2 pages
Gen Physics Module Week 5
No ratings yet
Gen Physics Module Week 5
25 pages
Physics Week4
No ratings yet
Physics Week4
7 pages
Contemporary Philippine Arts From The Regions
100% (4)
Contemporary Philippine Arts From The Regions
30 pages
Information Sheet No. 6
No ratings yet
Information Sheet No. 6
2 pages
Q4 Applied Entrepreneurship WK 5
No ratings yet
Q4 Applied Entrepreneurship WK 5
4 pages
Ucsp Summative Test
No ratings yet
Ucsp Summative Test
5 pages
Day 9: Primary Health Care (PHC) : CHN Lec Term 2 Exam
No ratings yet
Day 9: Primary Health Care (PHC) : CHN Lec Term 2 Exam
46 pages
Cersai: Central Registry of Securitisation Asset Reconstruction and Security Interest of India
No ratings yet
Cersai: Central Registry of Securitisation Asset Reconstruction and Security Interest of India
3 pages
C# Tutorial - SoloLearn - Learn To Code For FREE!
No ratings yet
C# Tutorial - SoloLearn - Learn To Code For FREE!
1 page
Confirmation
No ratings yet
Confirmation
2 pages
Cantilever Slab
No ratings yet
Cantilever Slab
3 pages
Eslab 01
No ratings yet
Eslab 01
64 pages
Analytical Chemistry For Engineers
No ratings yet
Analytical Chemistry For Engineers
76 pages
Mil STD 444
100% (1)
Mil STD 444
161 pages
Case Study BARGAIN CITY
No ratings yet
Case Study BARGAIN CITY
1 page
18CSP83 - Project Phase 2 - Body
No ratings yet
18CSP83 - Project Phase 2 - Body
11 pages
ME 111 Thermodynamics 1
No ratings yet
ME 111 Thermodynamics 1
8 pages
Human Settlements and Town Planning
No ratings yet
Human Settlements and Town Planning
3 pages
Table Morgan Sample Thesis
86% (7)
Table Morgan Sample Thesis
1 page
Danfoss Refrigeration Basics - ESSENTIAL
100% (1)
Danfoss Refrigeration Basics - ESSENTIAL
24 pages
Shop Drawings
No ratings yet
Shop Drawings
3 pages
Case-Control Study Design
No ratings yet
Case-Control Study Design
60 pages
General Tolerances - DIN - IsO - 2768
No ratings yet
General Tolerances - DIN - IsO - 2768
2 pages
Đề thi học kì 2 2022 - 2023
No ratings yet
Đề thi học kì 2 2022 - 2023
3 pages
7.chapter 4 Fire Protection Design Process
No ratings yet
7.chapter 4 Fire Protection Design Process
4 pages
002 - ManualC - G - 47-50 - ING Rev.2 20.10.11
No ratings yet
002 - ManualC - G - 47-50 - ING Rev.2 20.10.11
13 pages
DLP Cot2
No ratings yet
DLP Cot2
3 pages
Lab 6: Memory Allocation: CS 429: Fall 2018
No ratings yet
Lab 6: Memory Allocation: CS 429: Fall 2018
8 pages
Tutorial Letter 102 - Portfolio Exam Information
No ratings yet
Tutorial Letter 102 - Portfolio Exam Information
10 pages
Biodata of Profvssapkal
No ratings yet
Biodata of Profvssapkal
30 pages
Definitions of Curriculum Bsed
No ratings yet
Definitions of Curriculum Bsed
1 page
Invitation PWD Forum
No ratings yet
Invitation PWD Forum
5 pages
Pico Interactive Instruction Manual
No ratings yet
Pico Interactive Instruction Manual
200 pages
Solving Linear Fractional Programming Problems With Interval Coefficients in The Objective Function. A New Approach
No ratings yet
Solving Linear Fractional Programming Problems With Interval Coefficients in The Objective Function. A New Approach
11 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Different Methods of Plotting

Uploaded by

Different Methods of Plotting

Uploaded by

‭import pandas as pd‬

‭import numpy as np‬

‭Different Methods of Plotting‬

‭df.shape >>‬‭counts the number of columns and rows: ex.(22, 20)‬

‭df = df.drop_duplicates() >>‬‭for dropping duplicates‬

‭df = df.drop(columns = "//name of the column") >>‬‭dropping a specific column‬

‭df["//name of the column"].str.strip() >>‬‭removing a specific symbol or letter in the column‬

‭df["//name of the column"] = df["//name of the column"].str.strip("/._") >>‬‭also for removing‬‭a‬

‭specific symbol or letter but at once‬

‭df["//name of the column"] = df["//name of the column"].str.replace('[^0-9a-zA-Z]', '') >>‬‭used for‬

‭df["//name of the column"] = df["name of the column"].apply(lambda x: str(x)) >>‬‭lambda x:‬

‭str(x) converts each element x to a string using the str() function‬

‭df["Phone_Number"] = df["Phone_Number"].apply(lambda x: x[0:3] + '-' + x[3:6] + '-' + x[6:10])‬

‭with hyphens in between to format the phone number. Result: "###-###-####"‬

‭df[["Street_Address", "State", "Zip_Code"]] = df["Address"].str.split(',', 2, expand = True) >>‬

‭"Street_Address", "State", and "Zip_Code".‬

‭the current row (x) is equal to "Y".‬

‭df.drop(x, inplace = True) >>‬‭drops the row if it has a value of “Y”‬

‭df = df.reset_index(drop=True) >>‬‭resets the index of the DataFrame to a default integer‬

‭index, starting from 0, and drops the existing index.‬

‭Group by and Aggregating‬

‭name_of_the_variable = df.groupby('//name of the column')‬

‭df.groupby('//name of the column').mean() >>‬‭gets the mean of the column‬

‭df.groupby('//name of the column').count() >>‬‭calculates the count of non-null values for‬

‭each column within each group.‬

‭within each group.‬

‭df.groupby('Base Flavor').describe() >>‬‭provides statistics such as count, mean, standard‬

‭deviation, minimum, maximum, and quartiles.‬

‭df.groupby(['Base Flavor','Liked']).agg({'Flavor Rating': ['mean','max','count','sum']}) >>‬‭applies‬

‭aggregation functions to the "Flavor Rating" column within each group‬

‭df_inner = df1.merge(df2, how = 'inner', on = ['FellowshipID', 'FirstName']) >>‬‭df_inner, will‬

‭values will be filled in for the columns from df2.‬

‭Finding info from a specific person/others (Find Movies by James Cameron)‬

‭Top 5 Films of Michael Bay‬

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.