0% found this document useful (0 votes)

2 views5 pages

Big Data Technology

The document discusses the various types of NoSQL databases and their suitability for different data structures, emphasizing their flexibility and scalability compared to traditional SQL databases. It provides real-world examples, such as e-commerce platforms like Amazon benefiting from NoSQL databases for dynamic data, while banking systems require the reliability of relational databases. Additionally, it highlights the advantages of MapReduce in big data analytics, particularly its scalability and fault tolerance, along with an example of its application in log analysis.

Uploaded by

wambuadominic025

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views5 pages

Big Data Technology

Uploaded by

wambuadominic025

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

1

Big Data Technology

Student’s Name

Institutional Affiliation

Professor’s Name

Course Names

Submission Due Date

Big Data Technology

Why do we have many different NoSQL database types?

We have many different NoSQL database types because modern data comes in various

forms and structures, and a single database model cannot efficiently accommodate all use cases.

Basic relational databases optimize data that has a predefined schema. But big data—semi

structured or unstructured—has become very prevalent and developers and organizations need

more flexible, scalable and high-performance alternatives. The four categories of NoSQL

databases are document stores, key value stores, column family stores, and graph databases.

Each of these is aimed at meeting a particular data storage and retrieval necessity (Tripathi,

2025). For instance, MongoDB works very well when you are working with semi structured data

such as JSON documents, where as when you are working with relationship data in social

networks or recommendation engine, graph databases such as Neo4j are good. NoSQL types are

so diverse that organizational needs result in the usage of a best fit solution suitable for their

peculiar data architecture, performance and scalability requirements.

Can you provide a real-world example of where a NoSQL database is more suitable than a

SQL database and why?

A real-world example of a scenario where a NoSQL database is more suitable than a SQL

database is an e-commerce website such as Amazon. In such an application, information on the

product is very different depending on categories. Since it supports storing dynamic, schema-less

documents, it’s great to use a document-oriented NoSQL database such as MongoDB. Suppose

the book entry has fields like “author” and “ISBN”, but the shoe entry has fields such as “size”

and “color”. When trying to represent this in a relational database, the tables would be sparse,
3

consisting of many null values or it would require complex table designs. In addition to this,

NoSQL databases allow Amazon to scale horizontally to deal with great volumes of concurrent

users and massive data throughput. This is because this scalability and flexibility make NoSQL

the better choice for this scenario.

Can you provide a real-world example of where a relational database is more suitable than

a nonrelational database and why?

A real-world example where a relational database is more suitable is in banking systems.

Banks need to ensure that financial transactions are consistent, reliable, and meet ACID

(Atomicity, Consistency, Isolation, Durability) compliance. The other reason is that SQL based

relational databases such as Oracle or PostgreSQL are better fit for this purpose since they imply

strong schema definition, data integrity and transactional reliability (Tripathi, 2025). For

example, the system is responsible for ensuring that a debit and a corresponding credit operation

are completed successfully and in a single step when a user transfers money from one account to

another. There were major financial discrepancies, but any inconsistency would cause them. For

mission critical applications such as banking, eventual consistency is unacceptable, however,

nonrelational databases are more scalable.

In your opinion, what are the top two advantages of using MapReduce in big data

analytics?

In my opinion, the top two advantages of using MapReduce in big data analytics are its

scalability and fault tolerance. First, scalability is crucial because MapReduce allows the

processing of vast amounts of data across distributed computing resources. It reduces the size of

a problem by dividing it into separate chunks, processes them in parallel, and aggregates the
4

results just right (Abdalla et al., 2025). This makes it possible for organizations to conveniently

deal with terabytes or even petabytes of data. Second, the MapReduce framework is built with

fault tolerance. In case any node in the cluster goes down, system will automatically

reassignments the task to another node and this will not disturb the process of data processing.

MapReduce is highly reliable and therefore can be used in production for large scale data

analysis.

Can you provide an example of how MapRedcue enables sequential programming on a

computing cluster?

An example of how MapReduce enables sequential programming on a computing cluster

is in log analysis for a web application. To count the number of hits per URL for a server logs,

suppose a company desired this analysis. In the "Map" phase, the log files are read separately in

different nodes, and each URL along with count of 1 is extracted from each (Abdalla et al.,

2025). The "Reduce" phase sums all acts of each unique URL across the cluster. Map, shuffle,

reduce, is a logical, sequential flow of process, but these steps are run in parallel across many

machines. With the abstraction, simple sequential code can be written and the developers do not

need to explicitly manage complex parallelism.

References

Abdalla, H. B., Kumar, Y., Zhao, Y., & Tosi, D. (2025). A Comprehensive Survey of MapReduce

Models for Processing Big Data. Big Data and Cognitive Computing, 9(4), 77.

https://www.mdpi.com/2504-2289/9/4/77

Tripathi, N. (2025). NoSQL database education: A review of models, tools and teaching methods.

Journal of Systems and Software, 112391.

https://oulurepo.oulu.fi/bitstream/handle/10024/55103/nbnfioulu-202504142608.pdf?

sequence=1

Full Stack UNIT3
No ratings yet
Full Stack UNIT3
57 pages
Unit 3
No ratings yet
Unit 3
28 pages
Unit II Nosql Data Management
No ratings yet
Unit II Nosql Data Management
57 pages
NoSQL Databases
No ratings yet
NoSQL Databases
10 pages
NoSQL Technologies Notes Unit 1
100% (1)
NoSQL Technologies Notes Unit 1
20 pages
Unit - 2
No ratings yet
Unit - 2
70 pages
Windows Server 2008 Environment
100% (6)
Windows Server 2008 Environment
582 pages
Nosql
No ratings yet
Nosql
64 pages
Unit V Big Data Frameworks
No ratings yet
Unit V Big Data Frameworks
42 pages
Sample 3
No ratings yet
Sample 3
30 pages
777 1651399819 BD Module 5
No ratings yet
777 1651399819 BD Module 5
75 pages
Unit 6
No ratings yet
Unit 6
143 pages
NOSQL
No ratings yet
NOSQL
15 pages
BDA Module 3
No ratings yet
BDA Module 3
27 pages
BDA Unit2 Complete
No ratings yet
BDA Unit2 Complete
56 pages
3.1 Introduction To NoSQL
No ratings yet
3.1 Introduction To NoSQL
10 pages
Unit 5
No ratings yet
Unit 5
137 pages
6.unit 2 Bda
No ratings yet
6.unit 2 Bda
50 pages
SQL Made Easy
No ratings yet
SQL Made Easy
161 pages
Unit 4-1
No ratings yet
Unit 4-1
21 pages
PO Correction and Return Process in Oracle EBS
100% (1)
PO Correction and Return Process in Oracle EBS
11 pages
NoSQL Database
No ratings yet
NoSQL Database
8 pages
Unit Ii - Nosql Databases
No ratings yet
Unit Ii - Nosql Databases
112 pages
NoSQL Database
No ratings yet
NoSQL Database
64 pages
UNIT 5 NoSql DBMS Notes
No ratings yet
UNIT 5 NoSql DBMS Notes
19 pages
Unit 2 Bda
No ratings yet
Unit 2 Bda
28 pages
Big Data Notes
No ratings yet
Big Data Notes
18 pages
Unit No 1
No ratings yet
Unit No 1
34 pages
Cassandra: Types of Nosql Databases
No ratings yet
Cassandra: Types of Nosql Databases
6 pages
Unit Iii
No ratings yet
Unit Iii
22 pages
AGENCY - APP - mPhilGEPS Training Presentation
No ratings yet
AGENCY - APP - mPhilGEPS Training Presentation
24 pages
Fundamentals of Data Structures: Technical
No ratings yet
Fundamentals of Data Structures: Technical
404 pages
41 NoSQL Introduction
No ratings yet
41 NoSQL Introduction
18 pages
NoSQL DATABASE-B
No ratings yet
NoSQL DATABASE-B
4 pages
Unit II - BIG DATA ANALYTICS
No ratings yet
Unit II - BIG DATA ANALYTICS
11 pages
Advanced Googling
100% (1)
Advanced Googling
10 pages
Snowflake 101 - For Data Architects - LinkedIn
No ratings yet
Snowflake 101 - For Data Architects - LinkedIn
17 pages
NOs QL
No ratings yet
NOs QL
14 pages
NOSQL
No ratings yet
NOSQL
25 pages
Big Data Bhag 4 Changes
No ratings yet
Big Data Bhag 4 Changes
26 pages
No SQL - Types, CAP Theorem
No ratings yet
No SQL - Types, CAP Theorem
12 pages
Core Python
No ratings yet
Core Python
11 pages
Nosql Database: Nosql Databases Are Generally Classified Into Four Main Categories
No ratings yet
Nosql Database: Nosql Databases Are Generally Classified Into Four Main Categories
11 pages
995-5 BGM ISA Mod-2
No ratings yet
995-5 BGM ISA Mod-2
149 pages
BDA CW Chapter 3
No ratings yet
BDA CW Chapter 3
9 pages
NoSQL Group1
No ratings yet
NoSQL Group1
15 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
12 pages
NoSQL Notes
No ratings yet
NoSQL Notes
11 pages
No SQL
No ratings yet
No SQL
12 pages
Unit 3
No ratings yet
Unit 3
10 pages
Introduction To Nosql: What Is A Nosql Database Used For?
No ratings yet
Introduction To Nosql: What Is A Nosql Database Used For?
6 pages
Tools For Data Science
No ratings yet
Tools For Data Science
16 pages
NOSQL Concept 2
No ratings yet
NOSQL Concept 2
4 pages
Website: Vce To PDF Converter: Facebook: Twitter:: Aca-Cloud1.Vceplus - Premium.Exam.50Q
100% (1)
Website: Vce To PDF Converter: Facebook: Twitter:: Aca-Cloud1.Vceplus - Premium.Exam.50Q
15 pages
KsignSecureDB (Plug in SPIN) V1.5
No ratings yet
KsignSecureDB (Plug in SPIN) V1.5
32 pages
Checkmarx Static Application Security Testing (SAST) : Software Security Is Now A Boardroom Issue
No ratings yet
Checkmarx Static Application Security Testing (SAST) : Software Security Is Now A Boardroom Issue
2 pages
Users Guide-Integration Manager
No ratings yet
Users Guide-Integration Manager
74 pages
Deployment Guide Series IBM Tivoli Security Compliance Manager Sg246450
No ratings yet
Deployment Guide Series IBM Tivoli Security Compliance Manager Sg246450
214 pages
Nosql Databases
No ratings yet
Nosql Databases
2 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
1 page
Microsoft Dynamics 365 Business Central ERP Partner Vancouver
No ratings yet
Microsoft Dynamics 365 Business Central ERP Partner Vancouver
7 pages
Windows Registry: A Complete Guide To Examining The Windows Registry
No ratings yet
Windows Registry: A Complete Guide To Examining The Windows Registry
11 pages
Communication Management
No ratings yet
Communication Management
20 pages
Microservices White Paper
No ratings yet
Microservices White Paper
10 pages
System Software
No ratings yet
System Software
24 pages
Forte For Java, Enterprise Edition, 3.0
No ratings yet
Forte For Java, Enterprise Edition, 3.0
124 pages
Unit Ii Javascript Function
No ratings yet
Unit Ii Javascript Function
14 pages
Supplier Librarian: Brings Books T
No ratings yet
Supplier Librarian: Brings Books T
3 pages
Quiz 6
No ratings yet
Quiz 6
2 pages
System Design: Structure of Design Document
No ratings yet
System Design: Structure of Design Document
5 pages
Modified Python Lesson Plan
No ratings yet
Modified Python Lesson Plan
3 pages
SourceBreaker - Khizer-Sohail-21470261-cv-library
No ratings yet
SourceBreaker - Khizer-Sohail-21470261-cv-library
3 pages
Azure GDPR
No ratings yet
Azure GDPR
2 pages
Nosql Database
No ratings yet
Nosql Database
8 pages
Resume: Chandni
No ratings yet
Resume: Chandni
3 pages
UNIT No. 1 Introduction To Software and Software Engineering
No ratings yet
UNIT No. 1 Introduction To Software and Software Engineering
2 pages
SQL Demystified: A Beginner's Roadmap to Data Retrieval and Management
From Everand
SQL Demystified: A Beginner's Roadmap to Data Retrieval and Management
Kaushal Mehta
No ratings yet
Advanced SQL Queries: Writing Efficient Code for Big Data
From Everand
Advanced SQL Queries: Writing Efficient Code for Big Data
Robert Johnson
5/5 (2)
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
SQL and NoSQL: Building Hybrid Data Solutions for Modern Applications
From Everand
SQL and NoSQL: Building Hybrid Data Solutions for Modern Applications
Robert Johnson
No ratings yet
DBA's Guide to NoSQL
From Everand
DBA's Guide to NoSQL
The Enlightened DBA
5/5 (1)
Iceberg Table Formats and Analytics: Definitive Reference for Developers and Engineers
From Everand
Iceberg Table Formats and Analytics: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Efficient Data Querying with Drill: Definitive Reference for Developers and Engineers
From Everand
Efficient Data Querying with Drill: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
Efficient Parallel Computing with Dask: Definitive Reference for Developers and Engineers
From Everand
Efficient Parallel Computing with Dask: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Redshift Essentials: Definitive Reference for Developers and Engineers
From Everand
Redshift Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Sqoop Essentials: Definitive Reference for Developers and Engineers
From Everand
Sqoop Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
PrestoDB in Practice: Definitive Reference for Developers and Engineers
From Everand
PrestoDB in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Couchbase Essentials: Definitive Reference for Developers and Engineers
From Everand
Couchbase Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
The DynamoDB Handbook: Practical Solutions for Modern NoSQL Database Management
From Everand
The DynamoDB Handbook: Practical Solutions for Modern NoSQL Database Management
Robert Johnson
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Big Data Technology

Uploaded by

Big Data Technology

Uploaded by

1

Big Data Technology

Submission Due Date

Big Data Technology

Why do we have many different NoSQL database types?

peculiar data architecture, performance and scalability requirements.

SQL database and why?

database is an e-commerce website such as Amazon. In such an application, information on the

the better choice for this scenario.

a nonrelational database and why?

A real-world example where a relational database is more suitable is in banking systems.

mission critical applications such as banking, eventual consistency is unacceptable, however,

nonrelational databases are more scalable.

Can you provide an example of how MapRedcue enables sequential programming on a

An example of how MapReduce enables sequential programming on a computing cluster

need to explicitly manage complex parallelism.

Journal of Systems and Software, 112391.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.