0% found this document useful (0 votes)

23 views5 pages

Semana 3

Databases are essential tools for data analysts that help store, organize and access data more efficiently. Relational databases connect tables through common fields like primary and foreign keys, which reference unique identifiers and link data across tables. SQL is the primary language used to query and manipulate data within databases.

Uploaded by

Maria Pia Barreto

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views5 pages

Semana 3

Uploaded by

Maria Pia Barreto

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

ALL ABOUT DATA BASES

Hello again. So far, you've seen how data can be gathered and analyzed to solve all kinds of
problems. Next step, we're going to learn all about databases as a refresher. A database is a
collection of data stored in a computer system, but storage is just the beginning. You'll
discover how databases make it possible to find the exact piece of information you need for
your analysis. You'll also learn how to sort data in order to zoom in on what you need to
generate insightful reports and much more. Then we'll go even deeper, and I mean really,
really deep. I'm talking about metadata. You've probably heard someone say, wow that's so
meta. Usually they're talking about something referencing back to itself or being completely
self aware. For example if a character in a book knows she's in a book, that's meta. If you make
a documentary about making documentaries, that's also meta. And here at Google, I
constantly analyze how I analyze data. That's definitely meta.

Reproduce el video desde :1:3 y sigue la transcripción1:03

I do that to give my work a quality check to make sure my methods are fair. And to be certain
that I'm paying attention to any biases that might affect the outcome. As an analyst, you
should do this too. Sometimes we get a little too close to our data. So stepping back and asking
ourselves if our processes make sense is key. But let's back up just a bit and define metadata.
Metadata is data about data. Like I said: deep.

Reproduce el video desde :1:30 y sigue la transcripción1:30

Metadata is extremely important when working with databases. Think of it like a reference
guide. Without the guide all you have is a bunch of data with no context explaining what it
means. Metadata tells you where the data comes from, when and how it was created, and
what it's all about.

Reproduce el video desde :1:48 y sigue la transcripción1:48

Up next, you'll learn how to take data from a database or another source and bring it into a
spreadsheet. You'll do this either by importing it directly or by using SQL to generate the
request. And once you have data in a spreadsheet, the possibilities are endless. Everything
we're about to cover is a very important part of the prepare phase of the data analysis process.
It's how data analysts figure out which kind of data is going to be most helpful to them. If you
have the right data, you're much more likely to be able to solve your business problems
successfully. So, ready to tap into the incredible power of databases? Let's go!

DATABASE FEATURE

Databases are essential tools for data analysts. I use them constantly. Just about all of the data
I access is stored within databases. Databases store and organize data, making it much easier
for data analysts to manage and access information. They help us get insights faster, make
data-driven decisions, and solve problems. You've already heard a bit about what databases
are and how they're used by data analysts. Now let's learn more about database features and
components. Here's a simple database structure. It contains tables with information from a car
manufacturer. The top level includes car dealerships, product details, and repair parts. Then if
you drill down to the next level by selecting one of those tables, you'll find more specific
details about each item. This is called a relational database. A relational database is a database
that contains a series of related tables that can be connected via their relationships. For two
tables to have a relationship, one or more of the same fields must exist inside both tables. For
example, here, branch ID exists in this table and this one. If a field exists within both tables, we
can use it to connect the tables together. The branch ID field is the key to connecting these
tables. There are two types of keys. A primary key is an identifier that references a column in
which each value is unique. You can think of it as a unique identifier for each row in a table.
For our dealership table with information about the different dealership branches, branch ID is
the primary key. Similarly, for the product details table about each car, VIN is our primary key.
As an analyst you may need to create tables. If you do decide to include a primary key, it
should be unique, meaning no two rows can have the same primary key. Also, it cannot be null
or blank. There are also foreign keys. A foreign key is a field within a table that's a primary key
in another table. In other words, a foreign key is how one table can be connected to another.
Because our repair parts table contains information about each car part, the primary key is
part ID. Each row in our repair parts table represents one unique part. All the other keys in this
table, such as the VIN, are the foreign keys that allow the repair parts table to be connected to
the other tables. As you can see, a table can only have one primary key but it can have multiple
foreign keys. Understanding primary and foreign keys can be tricky, so you'll have more
opportunities to practice coming up. But as a general summary, a primary key is used to ensure
data in a specific column is unique. It uniquely identifies a record in a relational database table.
Only one primary key is allowed in a table and they cannot contain null or blank values. And a
foreign key is a column or group of columns in a relational database table that provides a link
between the data and two tables. It refers to the field in a table that's the primary key of
another table. Lastly, it's important to note that more than one foreign key is allowed to exist
in a table. Feel free to rewatch this video to be sure you understand primary and foreign keys
clearly. And coming up, you'll begin practicing how to access and analyze data from actual
databases. That will be a great opportunity to improve your understanding of primary and
foreign keys, database organization and how you might use databases in your future analytics
career.

Databases in data analytics

Databases enable analysts to manipulate, store, and process data. This helps them search
through data a lot more efficiently to get the best insights.

Relational databases
A relational database is a database that contains a series of tables that can be connected
to show relationships. Basically, they allow data analysts to organize and link data based
on what the data has in common.

In a non-relational table, you will find all of the possible variables you might be interested in
analyzing all grouped together. This can make it really hard to sort through. This is one
reason why relational databases are so common in data analysis: they simplify a lot of
analysis processes and make data easier to find and use across an entire database.

The key to relational databases

Tables in a relational database are connected by the fields they have in common. You
might remember learning about primary and foreign keys before. As a quick refresher, a
primary key is an identifier that references a column in which each value is unique. In
other words, it's a column of a table that is used to uniquely identify each record within that
table. The value assigned to the primary key in a particular row must be unique within the
entire table. For example, if customer_id is the primary key for the customer table, no two
customers will ever have the same customer_id.

By contrast, a foreign key is a field within a table that is a primary key in another table. A
table can have only one primary key, but it can have multiple foreign keys. These keys are
what create the relationships between tables in a relational database, which helps organize
and connect data across multiple tables in the database.

Some tables don't require a primary key. For example, a revenue table can have multiple
foreign keys and not have a primary key. A primary key may also be constructed using
multiple columns of a table. This type of primary key is called a composite key. For
example, if customer_id and location_id are two columns of a composite key for a customer
table, the values assigned to those fields in any given row must be unique within the entire
table.
SQL? You’re speaking my language
Databases use a special language to communicate called a query language. Structured
Query Language (SQL) is a type of query language that lets data analysts communicate
with a database. So, a data analyst will use SQL to create a query to view the specific data
that they want from within the larger set. In a relational database, data analysts can write
queries to get data from the related tables. SQL is a powerful tool for working with
databases — which is why you are going to learn more about it coming up!

Inspecting a dataset: A guided, hands-

on tour

As a data analyst, you'll use data to answer questions and solve problems. When you
analyze data and draw conclusions, you are generating insights that can influence business
decisions, drive positive change, and help your stakeholders meet their goals.

Before you begin an analysis, it’s important to inspect your data to determine if it contains
the specific information you need to answer your stakeholders’ questions. In any given
dataset, it may be the case that:

 The data is not there (you have sandwich data, but you need pizza data)
 The data is insufficient (you have pizza data for June 1-7, but you need data
for the entire month of June)
 The data is incorrect (your pizza data lists the cost of a slice as $250, which
makes you question the validity of the dataset)
Inspecting your dataset will help you pinpoint what questions are answerable and what data
is still missing. You may be able to recover this data from an external source or at least
recommend to your stakeholders that another data source be used.

In this reading, imagine you’re a data analyst inspecting spreadsheet data to determine if
it’s possible to answer your stakeholders’ questions.

The scenario
You are a data analyst working for an ice cream company. Management is interested in
improving the company's ice cream sales.

The company has been collecting data about its sales—but not a lot. The available data is
from an internal data source and is based on sales for 2019. You’ve been asked to review
the data and provide some insight into the company’s ice cream sales. Ideally,
management would like answers to the following questions:

1. What is the most popular flavor of ice cream?

2. How does temperature affect sales?
3. How do weekends and holidays affect sales?
4. How does profitability differ for new versus returning customers?

Chapter 10 Database
No ratings yet
Chapter 10 Database
76 pages
Databases in Data Analytics - Coursera
No ratings yet
Databases in Data Analytics - Coursera
2 pages
Database and Database Management System
No ratings yet
Database and Database Management System
8 pages
Database Presentation
No ratings yet
Database Presentation
33 pages
Introduction To Databases Notes
No ratings yet
Introduction To Databases Notes
3 pages
Relational Databases
No ratings yet
Relational Databases
30 pages
10TH DBMS
No ratings yet
10TH DBMS
95 pages
12 RDBMS
No ratings yet
12 RDBMS
8 pages
Introduction To Database and Relation Database
No ratings yet
Introduction To Database and Relation Database
28 pages
Dbms Notes
No ratings yet
Dbms Notes
61 pages
DBMS
No ratings yet
DBMS
32 pages
Database Management System
No ratings yet
Database Management System
13 pages
Based On Material by Myra Cohen and Formatted by Robbie de La Vega
No ratings yet
Based On Material by Myra Cohen and Formatted by Robbie de La Vega
4 pages
Amber SQL
No ratings yet
Amber SQL
6 pages
Basic Concepts of Database
No ratings yet
Basic Concepts of Database
16 pages
Chap-11 Relational Databases Notes Class 12
No ratings yet
Chap-11 Relational Databases Notes Class 12
12 pages
Relational Database Management Systems (Basic)
No ratings yet
Relational Database Management Systems (Basic)
18 pages
ITFPlusEBook (FC0 U61) Module2 - Unit4
No ratings yet
ITFPlusEBook (FC0 U61) Module2 - Unit4
11 pages
Unit - 3 RDBMS
No ratings yet
Unit - 3 RDBMS
51 pages
Lecture 1 Examples
No ratings yet
Lecture 1 Examples
20 pages
Introductiontodatabases 151106233350 Lva1 App6892
No ratings yet
Introductiontodatabases 151106233350 Lva1 App6892
33 pages
Lecture 13 Database
No ratings yet
Lecture 13 Database
34 pages
Hierarchy of Data: Database File
No ratings yet
Hierarchy of Data: Database File
40 pages
Lec 14 Database
No ratings yet
Lec 14 Database
45 pages
Romney Ais13 PPT 04
No ratings yet
Romney Ais13 PPT 04
29 pages
Notes of Database
No ratings yet
Notes of Database
3 pages
It4 Access Reviewer
No ratings yet
It4 Access Reviewer
7 pages
الوحدة الرابعة -
No ratings yet
الوحدة الرابعة -
9 pages
Database - Management Functions
No ratings yet
Database - Management Functions
7 pages
Database Summary Note
No ratings yet
Database Summary Note
10 pages
DBMS PPT (F)
No ratings yet
DBMS PPT (F)
11 pages
Database
No ratings yet
Database
28 pages
ACCESS Notes
No ratings yet
ACCESS Notes
7 pages
Introduction To Database Programming: What Is A Database?
No ratings yet
Introduction To Database Programming: What Is A Database?
4 pages
Access Notes and Activity
No ratings yet
Access Notes and Activity
9 pages
Introduction To Database Management System
No ratings yet
Introduction To Database Management System
14 pages
DBMS Fundamentals
No ratings yet
DBMS Fundamentals
6 pages
Unit - 3 RDBMS
No ratings yet
Unit - 3 RDBMS
51 pages
Database
No ratings yet
Database
6 pages
Introduction To MS Access
No ratings yet
Introduction To MS Access
20 pages
Components of A Database System
No ratings yet
Components of A Database System
42 pages
4.database Concepts
No ratings yet
4.database Concepts
7 pages
Ch-11 Relational Databases
No ratings yet
Ch-11 Relational Databases
47 pages
Name Qandeelmahmood
No ratings yet
Name Qandeelmahmood
5 pages
Database
No ratings yet
Database
5 pages
11 TH
No ratings yet
11 TH
11 pages
Databases
No ratings yet
Databases
30 pages
Unit2-Relational Database Model
No ratings yet
Unit2-Relational Database Model
7 pages
Databases and MS ACCESS
No ratings yet
Databases and MS ACCESS
13 pages
Module No.2.0: Office Application Unit No.2.3: Database Application Element 2.3.1: Selecting Database
No ratings yet
Module No.2.0: Office Application Unit No.2.3: Database Application Element 2.3.1: Selecting Database
16 pages
Tables in Access
No ratings yet
Tables in Access
6 pages
Database Note
No ratings yet
Database Note
19 pages
It U2 Notes
No ratings yet
It U2 Notes
86 pages
MSAccess Database Management System
No ratings yet
MSAccess Database Management System
13 pages
Information Technology Part - B: Unit-3 Database Development I Define The Following
No ratings yet
Information Technology Part - B: Unit-3 Database Development I Define The Following
21 pages
DBMS Notes
No ratings yet
DBMS Notes
5 pages
Database Concepts Till Features of MySQL
No ratings yet
Database Concepts Till Features of MySQL
13 pages
DatabaseProgramming PDF
No ratings yet
DatabaseProgramming PDF
93 pages
DBMS 19-20
No ratings yet
DBMS 19-20
2 pages
0 AI M Learning Deep Learning 2022
No ratings yet
0 AI M Learning Deep Learning 2022
81 pages
UNIT 1 - BIG DATA ANALYTICS Full
No ratings yet
UNIT 1 - BIG DATA ANALYTICS Full
28 pages
Topic Segmentation For Textual Document Written in Arabic Language
No ratings yet
Topic Segmentation For Textual Document Written in Arabic Language
10 pages
Aec-Vac Student's List of Even Sem 23-24
No ratings yet
Aec-Vac Student's List of Even Sem 23-24
2 pages
TE Syllabus
No ratings yet
TE Syllabus
105 pages
Sun Dbms QB
No ratings yet
Sun Dbms QB
3 pages
Introduction To Data Science and Analytics
100% (2)
Introduction To Data Science and Analytics
31 pages
Cec 349 Rfid System Design and Testing
No ratings yet
Cec 349 Rfid System Design and Testing
1 page
Https Github Com Prasadkaru MongoDB Lesson Code 1704253560
No ratings yet
Https Github Com Prasadkaru MongoDB Lesson Code 1704253560
48 pages
CSEC Information Technology June 2016 P02 ANS
No ratings yet
CSEC Information Technology June 2016 P02 ANS
17 pages
Unit 4 Unit 4 Bda
No ratings yet
Unit 4 Unit 4 Bda
16 pages
0943 7444 2016 2 107
No ratings yet
0943 7444 2016 2 107
6 pages
Dr. B. R. AMBEDKAR AND MAKING OF THE CONSTITUTION - A Case Study of Indian Federalism
No ratings yet
Dr. B. R. AMBEDKAR AND MAKING OF THE CONSTITUTION - A Case Study of Indian Federalism
13 pages
Subtitle
No ratings yet
Subtitle
2 pages
Lecture 1 - Data Mining 101
No ratings yet
Lecture 1 - Data Mining 101
23 pages
Seminar On Artificial Neural Network
No ratings yet
Seminar On Artificial Neural Network
17 pages
Mam Project
No ratings yet
Mam Project
27 pages
MCA Sem 5 (Rev) Syllabus 2010
No ratings yet
MCA Sem 5 (Rev) Syllabus 2010
19 pages
Cloud Computing Final Report
No ratings yet
Cloud Computing Final Report
18 pages
Paper 17881
No ratings yet
Paper 17881
6 pages
Laudon Mis15 PPT Ch06
No ratings yet
Laudon Mis15 PPT Ch06
40 pages
Durga Prasad Resume
No ratings yet
Durga Prasad Resume
1 page
Clinincal Decision Support System
No ratings yet
Clinincal Decision Support System
10 pages
Intern Backend Developer CV
No ratings yet
Intern Backend Developer CV
1 page
Possible Panel Question
No ratings yet
Possible Panel Question
2 pages
Database Management
No ratings yet
Database Management
199 pages
Ieee 1
No ratings yet
Ieee 1
6 pages
Z Syed Zubair Ahmed - Docx - Removed
No ratings yet
Z Syed Zubair Ahmed - Docx - Removed
6 pages
DBMS Exam Questions
No ratings yet
DBMS Exam Questions
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Semana 3

Uploaded by

Semana 3

Uploaded by

ALL ABOUT DATA BASES

Reproduce el video desde :1:3 y sigue la transcripción1:03

Reproduce el video desde :1:30 y sigue la transcripción1:30

Reproduce el video desde :1:48 y sigue la transcripción1:48

Databases in data analytics

The key to relational databases

Inspecting a dataset: A guided, hands-

1. What is the most popular flavor of ice cream?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.