0% found this document useful (0 votes)
101 views18 pages

Internship Presentation

The document summarizes an internship presentation on data science with Python. The presentation covers Python libraries and packages for data analysis (NumPy, Pandas, Matplotlib), machine learning algorithms (linear regression, logistic regression, decision trees, K-means clustering), and data visualization. It also discusses Python setup, mathematical computing, scientific computing, data manipulation, machine learning with SciKit learn, and integrating Python with Hadoop MapReduce and Spark. The presentation is delivered by Yash Litoriya to the Department of Mechanical Engineering at JSS Academy of Technical Education in Noida, India.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
101 views18 pages

Internship Presentation

The document summarizes an internship presentation on data science with Python. The presentation covers Python libraries and packages for data analysis (NumPy, Pandas, Matplotlib), machine learning algorithms (linear regression, logistic regression, decision trees, K-means clustering), and data visualization. It also discusses Python setup, mathematical computing, scientific computing, data manipulation, machine learning with SciKit learn, and integrating Python with Hadoop MapReduce and Spark. The presentation is delivered by Yash Litoriya to the Department of Mechanical Engineering at JSS Academy of Technical Education in Noida, India.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 18

INTERNSHIP/MINI PROJECT PRESENTATION

ON
DATA SCIENCE WITH PYTHON
BY-
Yash Litoriya
Roll No-2000910400152

DEPARTMENT OF MECHANICALENGINEERING
JSS ACADEMY OF TECHNICAL EDUCATION
NOIDA
DATA SCIENCE WITH PYTHON

OBJECTIVE -
This objective is to focus on the Python language
and fundamentals of some of the most widely
used Python packages; including NumPy, Pandas
and Matplotlib, then apply them to Data
Analysis and Data Visualization projects.
Topic
1. Python libraries
2. Linear regression
3. Logistic regression
4. Decision tree in machine learning
5. K means clustering
6. Dimesionality reduction
7. Python setup and essentials
8. Mathematical computing with python (NumPy)
9. Scientific computing with python (SciPy)
10. Data manipulation with Pandas
11. Machine learning with SciKit learn
12. Data visualisation with python using MatPlot lib
13. Python integration with Hadoop Map Reduce and Spark
Libraries in python
Linear Regression
Logistic regression

•Logistic regression is a fundamental classification technique.


•It belongs to the group of linear classifiers and is somewhat similar to polynomial
and linear regression.
•Logistic regression is fast and relatively uncomplicated, and it’s convenient for you
to interpret the results.
•Although it’s essentially a method for binary classification, it can also be applied to
multiclass problems.
Decision Tree in machine learning
K means clustering

•The technique to segregate Datasets into various groups, on basis of having similar
features and characteristics, is being called Clustering.

•Kmeans Algorithm is an Iterative algorithm that divides a group of n datasets into


k subgroups /clusters based on the similarity and their mean distance from the
centroid of that particular subgroup/ formed.
Dimensionality reduction
Python setup and essentials

Anaconda is a distribution of the Python and R programming languages for scientific


computing (data science, machine learning applications, large-scale data processing,
predictive analytics, etc.), that aims to simplify package management and deployment.
Mathematical computing with python
(NumPy)
Scientific computing with python
(SciPy)
Data manipulation with Pandas
Machine learning with SciKit learn
Data visualisation with python using
MatPlotlib
Data visualisation -
• You are a Sales Manager in a leading global organization.
• The organization plans to study the sales details of each product across all regions and
countries.
• This is to identify the product which has the highest sales in a particular region and up the
production. This research will enable the organization to increase the manufacturing of that
product in that particular region.
Python integration with Hadoop Map Reduce
and Spark
Thank You

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy