0% found this document useful (0 votes)
36 views4 pages

Part A

The document defines key big data concepts like big data, crowd sourcing analytics, open source technology, and mobile business intelligence. It also compares NoSQL and relational databases, describes types of NoSQL databases, and differences between master-slave and peer-to-peer replication. Benefits of schemaless databases and map-reduce functions are defined. Advantages of using MapReduce with Hadoop are listed as well.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
36 views4 pages

Part A

The document defines key big data concepts like big data, crowd sourcing analytics, open source technology, and mobile business intelligence. It also compares NoSQL and relational databases, describes types of NoSQL databases, and differences between master-slave and peer-to-peer replication. Benefits of schemaless databases and map-reduce functions are defined. Advantages of using MapReduce with Hadoop are listed as well.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

1. What is bigdata? Why we need to analyse bigdata?

 Big data is a collection of data that is huge in volume, yet growing exponentially
with time.
 It is data with so large in size and complexity with unstructured data that none
of the traditional database management tools can store it or process it efficiently

2. Define crowd sourcing analytics.

 Crowd sourcing is the collection of information, opinions or work from a group


of people, usually sourced via the internet.
 Crowd sourcing allows companies to save time and money while tapping into
people with different skills or thoughts from all over the world.

3. Define open source technology.


 Open source technologies refer to software or computer programs that have their
source code available to the public, allowing anyone to access, modify, and
distribute it.
 This means that users can see and edit the code, which can result in greater
collaboration and innovation in software development

4. Define mobile business intelligence.

 Mobile business intelligence is a type of business intelligence that enables the


access and analysis of business data on mobile devices such as smartphones and
tablets.
 It allows users to access relevant business data including key performance
indicators, reports, and dashboards from any location at any time giving them
the ability to make informed decisions on the go.
 Mobile BI also enables users to share and collaborate on data insights with
others, providing real time updates and notifications.
5. Compare and contrast NoSQL vs. Relational database.

SQL NoSQL
1. SQL databases are called 1. No SQL databases, called as Non-
Relational Databases Relational (or) Distributed database
(RDBMS). 2. No SQL are document based, key value
2. They are table based data pairs graph data bases
bases. It represents data in 3. NoSQL databases have dynamic schema
form of tables which consist of for unstructured data.
ʼn number of rows and 4. NoSQL databases are horizontally
columns. scalable. They are scaled by increasing
3. SQL databases have the database servers in the pool of
Predefined scheme resource to reduce the load.
4. SQL databases are vertically 5. In NoSQL databases queries are
scalable. They are scaled by focused on collection of documents
Increasing the horse Power of sometimes it is also called as UnQL
the hardware (Unstructured Query Language). The
5. SQL databases uses syntax of using UnQL varies from
SQL(structured query database to database.
Language) for defining and
Example:MongoDB, Cassandra, Hbase,
manipulating data
Radix
Example: MySQL, Oracle

6. What are the types of NoSQL databases?


 Key value databases They are a simpler type of database where each item
contains keys and values.
 Document databases It store data in documents similar to JSON. Each document
contains pairs of fields and values.
 Wide column stores or Column family Data stores Store data in tables, rows and
dynamic columns.
 Graph databases Store data in nodes and edges. Nodes typically store
information about people, place and things, while edges store information about
the relationships between the nodes.

7. Compare master slave replication and peer to peer replication

Master slave replication Peer to peer replication


1. In master-slave replication 1. Peer to peer replication setup
setup, there is one server the various nodes are all equals
considered as the Master 2. Any node can accept reads as
2. There are also multiple slave well as writes, and they
servers that receive updates communicate the writes to each
from the master other
3. Read can happen from either 3. One nodes failure does not
the master or the slaves. So this cause problems as the
system distributes reads across remaining nodes can continue
multiple nodes. It is however their work without losing a beat
still constrained by have writes
happen in only one place.

8. List the benefits of schemaless databases.


 Flexibility
 Scalability
 Reduced complexity
 Lack of Structure
 Data quality issues
 Performance issues
9. Define map and reduce function.
Map()
 Map task in map reduce is performed using the Map() function
 This part of MapReduce is responsible for processing one or more chunks of
data and producing the output results
Reduce()
 The next part/component/stage of the MapReduce programming model is the
Reduce() function
 This part of MapReduce is responsible for consolidating the result reduced by
each of the Map() functions/tasks.

10. What are the advantages of using MapReduce with Hadoop?


 Flexible
 Parallel Processing
 Resilient
 Scalable
 Cost effective
 Simple
 Secure
 Speed

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy