Lecture 11- Introduction to Apache Hive
Lecture 11- Introduction to Apache Hive
Apache Hive
By
Dr. Aditya Bhardwaj
aditya.bhardwaj@bennett.edu.in
Lecture 14
Lecture 11
Joins and Partitision
Introduction to in Hive
Apache Hive
Lecture 15
Practical
Demonstrations on
HQL
Hadoop vs Pig vs Hive- Quick Look at Industrial Use cases
Use Case Apache Hadoop Apache Pig Apache Hive
Hive and SQL Server are not comparable in any way other than
the similarity in the syntax of the query language.
While SQL Server is built to be able to respond in real-time
from a single machine, hive is for processing large data sets that
may span hundreds or thousands of machines.
Apache Hive is an open source project run by volunteers at the
Apache Software Foundation, used for querying, managing and
storing structured data on Hadoop.
Hive uses HQL (Hive Query Language) that lets you use SQL-
like syntax to define your map and reduce steps
Hive vs. SQL
Challenges of Hive
▪ Compared to Apache Pig, Latency for Apache Hive queries is
generally very high.
Key Summary Points on Hive
HIVE is not a database but a data analysis tool through SQL
kind of syntax.
Apache Hive converts the SQL queries into MapReduce jobs and
then submits it to the Hadoop cluster.
Reference
https://hive.apache.org/
Thanks