0% found this document useful (0 votes)

31 views6 pages

Data Information For Interview

Git and AWS are both tools that can be used to manage data. Git is a version control system that allows tracking changes to files, while AWS provides cloud services including storage and compute. Some key differences are that Git is focused on file versioning while AWS provides broader data services, Git has more open access while AWS access is controlled, and Git has less governance than structured data services in AWS. These tools can be used together, with Git managing code/configuration and AWS managing production data and workloads.

Uploaded by

Damilola Isah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views6 pages

Data Information For Interview

Uploaded by

Damilola Isah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Some stuff on data

Here are the steps on how to create data products for a bank:

1. Identify the business problem. What is the bank trying to achieve with the data
product? What are the specific business goals that the data product should help
to achieve?
2. Gather the data. What data is available that can be used to solve the business
problem? This data could come from a variety of sources, such as customer
transactions, customer surveys, or third-party data.
3. Clean and prepare the data. The data needs to be cleaned and prepared before
it can be used for analysis. This includes removing errors, filling in missing
values, and transforming the data into a format that can be used by the data
product.
4. Build the data product. The data product is the application or tool that will use the
data to solve the business problem. This could be a dashboard, a predictive
model, or a recommendation engine.
5. Deploy the data product. The data product needs to be deployed so that it can be
used by the bank's employees or customers. This may involve integrating the
data product with the bank's existing systems or making it available through a
web portal.
6. Monitor and evaluate the data product. Once the data product is deployed, it is
important to monitor its performance and to evaluate whether it is achieving the
desired business goals. This may involve collecting feedback from users or
tracking the data product's impact on the bank's bottom line.

Here are some additional tips for creating data products for a bank:

● Start with a clear understanding of the business problem. The data product
should be designed to solve a specific business problem, not just to generate
interesting insights.
● Use the right data. The data product should be built using data that is relevant to
the business problem and that is of high quality.
● Involve the business stakeholders. The data product should be designed in
collaboration with the business stakeholders who will be using it. This will help to
ensure that the data product meets their needs and that it is used effectively.
● Test and iterate. The data product should be tested and iterated on to ensure
that it is working as expected and that it is meeting the needs of the business.
Data modeling is the process of creating a blueprint of the data that will be stored in a
database. It is a critical step in the development of any data-driven application, as it
ensures that the data is structured in a way that is both efficient and effective.

Here are the steps on how to do data modeling for a bank:

1. Identify the business requirements. What data does the bank need to store in
order to meet its business goals? This could include data on customers,
products, transactions, and risk factors.
2. Understand the data sources. Where will the data come from? This could include
internal data sources, such as customer transaction records, or external data
sources, such as credit bureau reports.
3. Create a data model. The data model should represent the data entities and their
relationships in a way that is both logical and efficient. This can be done using a
variety of data modeling tools and techniques.
4. Validate the data model. The data model should be validated to ensure that it
meets the business requirements and that it is consistent with the data sources.
This can be done by reviewing the data model with the business stakeholders
and by running data validation tests.
5. Implement the data model. The data model should be implemented in the
database so that the data can be stored and accessed. This may involve creating
new tables, columns, and relationships in the database.

Here are some additional tips for data modeling for a bank:

● Use a standard data modeling methodology. There are a number of standard

data modeling methodologies that can be used to create data models. These
methodologies provide a structured approach to data modeling that can help to
ensure that the data model is consistent and complete.
● Involve the business stakeholders. The data model should be designed in
collaboration with the business stakeholders who will be using it. This will help to
ensure that the data model meets their needs and that it is used effectively.
● Use a data modeling tool. There are a number of data modeling tools that can be
used to create data models. These tools can help to automate the data modeling
process and to ensure that the data model is consistent and accurate.
● Document the data model. The data model should be documented so that it can
be understood and maintained by others. This documentation should include the
data model itself, as well as the business requirements and the data sources.

Data profiling is the process of inspecting and analyzing data to identify its
characteristics and quality. It is a critical step in the data quality improvement process,
as it helps to identify data errors, inconsistencies, and gaps.
Here are the steps on how to do data profiling for a bank:

1. Identify the data sources. What data will be profiled? This could include data from
internal sources, such as customer transaction records, or external sources, such
as credit bureau reports.
2. Define the data profiling objectives. What do you want to achieve with the data
profiling? This could include identifying data errors, inconsistencies, and gaps;
understanding the data distribution; or assessing the data quality.
3. Select the data profiling tools. There are a number of data profiling tools
available, both commercial and open source. The tool you select will depend on
the size and complexity of the data set, as well as your specific profiling
objectives.
4. Run the data profiling analysis. The data profiling tool will analyze the data and
generate a report that identifies the data characteristics and quality.
5. Review the data profiling report. The data profiling report should be reviewed to
identify any data errors, inconsistencies, or gaps.
6. Take corrective action. Any data errors, inconsistencies, or gaps identified in the
data profiling report should be corrected.

Here are some additional tips for data profiling for a bank:

● Involve the business stakeholders. The data profiling process should be involved
with the business stakeholders who will be using the data. This will help to
ensure that the data profiling is meeting their needs and that the results are
actionable.
● Use a data profiling tool. There are a number of data profiling tools available that
can automate the data profiling process. These tools can help to ensure that the
data profiling is consistent and accurate.
● Document the data profiling results. The data profiling results should be
documented so that they can be understood and maintained by others. This
documentation should include the data profiling report, as well as the corrective
actions that were taken.

I hope this helps!

Data lakes and data marts are both types of data repositories, but they have different
purposes and use cases.

A data lake is a large, centralized repository for all of an organization's data, regardless
of its format or structure. This data can be structured, semi-structured, or unstructured,
and it can come from a variety of sources, such as transactional systems, social media,
and sensors. Data lakes are often used for exploratory data analysis and machine
learning, as they allow organizations to store and analyze all of their data in one place.
A data mart is a smaller, more focused repository of data that is typically used for a
specific business unit or function. Data marts are typically more structured than data
lakes, and they are often used for reporting and analysis.

Here is a table that summarizes the key differences between data lakes and data marts:

Feature Data Lake Data Mart

Purpos Store all of an organization's data Store data for a specific

e business unit or function

Data Structured, semi-structured, and Structured data

format unstructured data

Data Variety of sources Internal sources

source

Use Exploratory data analysis, Reporting, analysis

case machine learning

Structur Unstructured Structured

drive_spreadsheetExport to Sheets

Which type of data repository is right for your organization depends on your specific
needs and requirements. If you need to store and analyze all of your data in one place,
then a data lake may be a good option for you. If you need a more focused repository of
data for a specific business unit or function, then a data mart may be a better choice.

Here are some additional considerations when choosing between a data lake and a
data mart:

● Data volume: Data lakes are typically used for large volumes of data, while data
marts are typically used for smaller volumes of data.
● Data complexity: Data lakes can store data of any complexity, while data marts
typically store more structured data.
● Data access: Data lakes typically have more open access, while data marts
typically have more controlled access.
● Data governance: Data lakes typically have less data governance, while data
marts typically have more data governance.

I hope this helps!

Git and AWS are two powerful tools that can be used to manage data.

● Git is a distributed version control system (VCS) that allows you to track changes
to your data over time. This can be helpful for data scientists who need to track
changes to their models or datasets. Git is also a great way to collaborate on
data projects with others, as it allows you to share your code and data with
others.
● AWS is a cloud computing platform that offers a wide range of services for data
storage, processing, and analysis. This can be helpful for data scientists who
need to store large amounts of data or who need to process data in real time.
AWS also offers a variety of machine learning services that can be used to
analyze data.

Here are some of the specific roles that Git and AWS can play in data:

● Version control: Git can be used to track changes to data over time, which can be
helpful for data scientists who need to track changes to their models or datasets.
● Collaboration: Git can be used to collaborate on data projects with others, as it
allows you to share your code and data with others.
● Data storage: AWS offers a wide range of services for data storage, including
Amazon S3, Amazon EBS, and Amazon EFS. These services can be used to
store large amounts of data in the cloud.
● Data processing: AWS offers a variety of services for data processing, including
Amazon EMR, Amazon Kinesis, and Amazon Redshift. These services can be
used to process data in real time or to process large amounts of data in batches.
● Machine learning: AWS offers a variety of machine learning services, including
Amazon SageMaker, Amazon Rekognition, and Amazon Lex. These services
can be used to analyze data and to build machine learning models.

Overall, Git and AWS are two powerful tools that can be used to manage data. They
can be used for a variety of purposes, including version control, collaboration, data
storage, data processing, and machine learning.

Book of Hajj From Summarized Fiqh of Shaykh Fawzan
100% (1)
Book of Hajj From Summarized Fiqh of Shaykh Fawzan
67 pages
Final Thesis (Dated On 17 August - 14)
100% (1)
Final Thesis (Dated On 17 August - 14)
348 pages
Scholasticism and Monasticism.
100% (9)
Scholasticism and Monasticism.
19 pages
Module 4-6: E M A E M I T S C O L L E G E P H I L I P P I N E S
100% (1)
Module 4-6: E M A E M I T S C O L L E G E P H I L I P P I N E S
14 pages
Fine Dining Lovers
No ratings yet
Fine Dining Lovers
3 pages
GCC Agro Investments in Sub Saharan Africa March 2015
No ratings yet
GCC Agro Investments in Sub Saharan Africa March 2015
52 pages
Scavenger Hunt Lesson Plan
No ratings yet
Scavenger Hunt Lesson Plan
3 pages
TETDEDXHeqmTa Temple 0225E 12665
No ratings yet
TETDEDXHeqmTa Temple 0225E 12665
632 pages
MY COPY Evaluating Text Image
No ratings yet
MY COPY Evaluating Text Image
52 pages
Documentation Report - Ammungan Festival 2019
No ratings yet
Documentation Report - Ammungan Festival 2019
12 pages
Manufacturing Processes II: Fundamentals of Metal Forming
No ratings yet
Manufacturing Processes II: Fundamentals of Metal Forming
17 pages
Berio Seq1
100% (1)
Berio Seq1
9 pages
VSB5 Draft Public Comment
No ratings yet
VSB5 Draft Public Comment
27 pages
Employer's Feedback Survey - Google Forms
No ratings yet
Employer's Feedback Survey - Google Forms
36 pages
Design of Pedestal
No ratings yet
Design of Pedestal
4 pages
Frederick Schauer - Authority and Authorities
No ratings yet
Frederick Schauer - Authority and Authorities
33 pages
BES Quality Teaching Diverse Students
No ratings yet
BES Quality Teaching Diverse Students
103 pages
History of Japanese Culture - Wiki - Wiki
No ratings yet
History of Japanese Culture - Wiki - Wiki
1 page
Pharmacology of The Gastrointestinal Drugs (Ii) Choleretics, Cholagogues and Other Biliary Secretion Modifiers
No ratings yet
Pharmacology of The Gastrointestinal Drugs (Ii) Choleretics, Cholagogues and Other Biliary Secretion Modifiers
14 pages
Kushtagi Bangalore: SRS Travels 15L
No ratings yet
Kushtagi Bangalore: SRS Travels 15L
2 pages
Ghulam Khan
No ratings yet
Ghulam Khan
3 pages
Sight Screen Catalog
No ratings yet
Sight Screen Catalog
3 pages
1VW BF820W NPN HV
No ratings yet
1VW BF820W NPN HV
7 pages
Low-G Accelerometer For Rollover Applications SMB200: Automotive Electronics
No ratings yet
Low-G Accelerometer For Rollover Applications SMB200: Automotive Electronics
2 pages
I'm Gonna Live With You Not Because My Parents Left Me Their Debt But Because I Like You
No ratings yet
I'm Gonna Live With You Not Because My Parents Left Me Their Debt But Because I Like You
567 pages
0008-00-Risk Assessment For Asphalt Works (Full Construction and Widening Area)
No ratings yet
0008-00-Risk Assessment For Asphalt Works (Full Construction and Widening Area)
6 pages
Study Floor Management in OPD of Red Cross Hospital
No ratings yet
Study Floor Management in OPD of Red Cross Hospital
8 pages
BÀI TẬP VỀ NHÀ BUỔI 5
No ratings yet
BÀI TẬP VỀ NHÀ BUỔI 5
3 pages
The Importance of Grammar in Language Teaching and Learning
No ratings yet
The Importance of Grammar in Language Teaching and Learning
11 pages
Psychopathology Review: Allison M. Waters, PHD Richard T. Lebeau, PHD, & Michelle G. Craske, PHD
No ratings yet
Psychopathology Review: Allison M. Waters, PHD Richard T. Lebeau, PHD, & Michelle G. Craske, PHD
17 pages
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6458)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (464)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5181)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (1005)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (643)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (582)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2016)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1022)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2814)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4135)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2885)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (280)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4372)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
3.5/5 (2133)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Data Information For Interview

Uploaded by

Data Information For Interview

Uploaded by

Some stuff on data

Here are the steps on how to do data modeling for a bank:

● Use a standard data modeling methodology. There are a number of standard

I hope this helps!

Feature Data Lake Data Mart

Purpos Store all of an organization's data Store data for a specific

Data Structured, semi-structured, and Structured data

Data Variety of sources Internal sources

Use Exploratory data analysis, Reporting, analysis

Structur Unstructured Structured

I hope this helps!

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.