0% found this document useful (0 votes)
92 views10 pages

Class 1. Introduction To Data Science

Data science involves using computer science, statistics, and machine learning to analyze large amounts of data. It encompasses collecting, cleaning, analyzing, visualizing, and interacting with data to derive insights and create data products. Data science draws upon skills from computer science like coding and algorithms, mathematics like statistics, and domain expertise to understand data in context. The amount of data being generated is growing exponentially and new technologies like the Square Kilometer Array telescope will generate as much data in a single day as the entire planet produces in a year, driving more opportunities for data science.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
92 views10 pages

Class 1. Introduction To Data Science

Data science involves using computer science, statistics, and machine learning to analyze large amounts of data. It encompasses collecting, cleaning, analyzing, visualizing, and interacting with data to derive insights and create data products. Data science draws upon skills from computer science like coding and algorithms, mathematics like statistics, and domain expertise to understand data in context. The amount of data being generated is growing exponentially and new technologies like the Square Kilometer Array telescope will generate as much data in a single day as the entire planet produces in a year, driving more opportunities for data science.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

Data Analytics and

Data Science

Dra. Ana Carolina Torregroza Espinosa


Departamento PI
Universidad de la Costa
Introduction to Data
Science

Data Science is the science which uses computer,


statistics and machine learning and human-
computer interactions to collect, clean, integrate,
analyze, visualize and interact with data to create
data products.
Data Science: Solving Problems with Data

Computer science, data Algorithms and


MATH AND numerical techniques
engineering and HACKING
wrangling, coding STATISTICS to derive insights
SKILLS Machine KNOWLEDGE
Learning

DATA
SCIENCE

Danger Traditional
Zone! Research

Understanding of the
underlying assumptions Domain knowledge,
business acumen, experience, value
SUBSTANTIVE to the business
EXPERIENCE
AI, Machine Learning, and Deep Learning

• AI: Getting machines to do


what humans are good at

• Machine Learning:
Feeding an algorithm data
to learn and predict
something

• Deep Learning: A type of


machine learning
What’s all the fuss?
This stuff was created many many years ago

• Bayes Theorem • Thomas Bayes mid 1700’s

• Regression • Legendre, Gauss and Galton early 1800’s

• Neural
Networks
• McCulloch and Pitts early 1940s
Think about All Our Data and Compute

SKA - 2020
(Square Kilometer Array Telescope)

It is still
GROWING!

Will generate as much


data in a day as the entire
PLANET does in a year!
Data Science

https://www.ted.com/talks/kenneth_c
ukier_big_data_is_better_data?langu
age=en#t-939136
What are “Big Data”?
The Data Science Process: Getting from Raw Data to
Outcomes
The Data Science
Formal Framework CRISP–DM Workflow
Cross Industry Standard Process
for Data Mining

Joe Blizstein and Hanspeter Pfister created for Harvard Data Science
course.
Questions?

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy