Ex-8-hive.pptx

Hive Installation
 Hive is a data warehouse software project built on
top of Hadoop, that facilitate reading, writing, and
managing large datasets residing in distributed
storage using SQL

Download Hive 2.1.0
https://archive.apache.org/dist/hive/hive-2.1.0/
Download Derby Metastore
10.12.1.1
https://archive.apache.org/dist/db/derby/db-derby-10.12.1.1/

Download hive-site.xml
 https://github.com/apache/hive/blob/master/data/c
onf/hive-site.xml
https://drive.google.com/file/d/1qqAo7RQfr5Q6O-
GTom6Rji3TdufP81zd/view

STEP - 1: Extract the Hive file
 Extract file apache-hive-2.1.0-bin.tar.gz and place
under "D:Hive", you can use any preferred
location –
 [1] You will get again a tar file post extraction –

[2] Go inside of apache-hive-2.1.0-
bin.tar folder and extract again –

[3] Copy the leaf folder “apache-hive-2.1.0-bin” and
move to the root folder "D:Hive" and removed all other
files and folders –

STEP - 2: Extract the Derby file
 Similar to Hive, extract file db-derby-10.12.1.1-bin.tar.gz and place
under "D:Derby", you can use any preferred location –

STEP - 3: Moving hive-site.xml file
 Drop the downloaded file “hive-site.xml” to hive
configuration location “D:Hiveapache-hive-2.1.0-
binconf”.

STEP - 4: Moving Derby libraries
Next, need to drop all derby library to hive library location –
[1] Move to library folder under derby location D:Derbydb-derby-
10.12.1.1-binlib.

[2] Select all and copy all libraries.
[3] Move to library folder under hive location D:Hiveapache-hive-2.1.0-binlib.

[4] Drop all selected libraries
here.

STEP - 5: Configure Environment
variables
Set the path for the following Environment variables (User Variables) on
windows 10 –
 HIVE_HOME - D:Hiveapache-hive-2.1.0-bin
 HIVE_BIN - D:Hiveapache-hive-2.1.0-binbin
 HIVE_LIB - D:Hiveapache-hive-2.1.0-binlib
 DERBY_HOME - D:Derbydb-derby-10.12.1.1-bin
 HADOOP_USER_CLASSPATH_FIRST - true

This PC - > Right Click - > Properties - > Advanced System Settings - >
Advanced - > Environment Variables

STEP - 6: Configure System
variables
Next onward need to set System variables, including Hive bin directory path –
 HADOOP_USER_CLASSPATH_FIRST - true
 Variable: Path
 Value:
 D:Hiveapache-hive-2.1.0-binbin
 D:Derbydb-derby-10.12.1.1-binbin

Now need to do a cross check with Hive configuration file for Derby details –
hive-site.xml
[1] Edit file D:/Hive/apache-hive-2.1.0-bin/conf/hive-site.xml, paste
below xml paragraph and save this file.

hive-site.xml
 <configuration>
 <property>
 <name>javax.jdo.option.ConnectionURL</name>
 <value>jdbc:derby://localhost:1527/metastore_db;create=true</value>
 <description>JDBC connect string for a JDBC metastore</description>
 </property>
 <property>
 <name>javax.jdo.option.ConnectionDriverName</name>
 <value>org.apache.derby.jdbc.ClientDriver</value>
 <description>Driver class name for a JDBC metastore</description>
 </property>
 <property>
 <name>hive.server2.enable.impersonation</name>
 <description>Enable user impersonation for HiveServer2</description>
 <value>true</value>
 </property>
 <property>
 <name>hive.server2.authentication</name>
 <value>NONE</value>
 <description> Client authentication types. NONE: no authentication check LDAP: LDAP/AD based authentication KERBEROS: Kerberos/GSSAPI authentication CUSTOM: Custom
authentication provider (Use with property hive.server2.custom.authentication.class) </description>
 </property>
 <property>
 <name>datanucleus.autoCreateTables</name>
 <value>True</value>
 </property>
 </configuration>

Open command prompt and change directory to
“D:Hadoophadoop-2.8.0sbin" and type "start-all.cmd" to
start apache

It will open four instances of cmd for following
tasks –
 Hadoop Datanaode
 Hadoop Namenode
 Yarn Nodemanager
 Yarn Resourcemanager

It can be verified via browser also as –
Namenode (hdfs) - http://localhost:50070
Datanode - http://localhost:50075
All Applications (cluster) -
http://localhost:8088 etc.

Since the ‘start-all.cmd’ command has been deprecated so you can
use below command in order wise -
 “start-dfs.cmd” and
 “start-yarn.cmd”

STEP - 9: Start Derby server
 Post successful execution of Hadoop, change directory to
“D:Derbydb-derby-10.12.1.1-binbin” and type “startNetworkServer
-h 0.0.0.0” to start derby server.

STEP - 10: Start the Hive
 Derby server has been started and ready to accept connection so
open a new command prompt under administrator privileges and
move to hive directory as “D:Hiveapache-hive-2.1.0-binbin” –
[1] Type “jps -m” to check NetworkServerControl
[2] Type “hive” to execute hive server.

STEP-11: Some hands on activities
[1] Create Database in Hive -
CREATE DATABASE IF NOT EXISTS TRAINING;
[2] Show Database -
SHOW DATABASES;

[3] Creating Hive Tables -
CREATE TABLE IF NOT EXISTS testhive(col1 char(10), col2 char(20));

Ex-8-hive.pptx

More Related Content

What's hot (18)

Similar to Ex-8-hive.pptx (20)

More from vishal choudhary (20)

Recently uploaded (20)

Ex-8-hive.pptx