0% found this document useful (0 votes)

114 views6 pages

Hive Commands for Beginners

The document provides an overview of common Hive commands used for interacting with Hive databases, tables, and data. Some key points covered include: - How to start the Hive shell, check HDFS contents, create databases and tables, load and query data, and drop databases and tables. - How to create external tables in Hive, load data into different file formats like ORC and Parquet, and perform operations on tables like renaming, adding/dropping columns. - How to query data using conditions, sorting, aggregation, joins, and views.

Uploaded by

Abdul Khaliq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

114 views6 pages

Hive Commands for Beginners

Uploaded by

Abdul Khaliq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Hive Illustration : Basics

 To get started with the hive shell,

hive

 To check what is present in the HDFS,

hadoop fs – ls

 To create a directory in the current path (let’s say the name is ‘foo’),

hadoop fs - mkdir foo

 To create a database, in the hive shell (let’s say the name is ‘vin_emp’),

create database vin_emp;

 To see existing databases,

show databases;

 To start using the database,

use vin_emp;

 To check the tables present in the database,

show tables;

 To come out of the hive shell,

quit;

 to list contents of the current working directory,

 To create a directory,

mkdir myhivedata

 To navigate into that data,

cd myhivedata

 To check the present working directory

pwd

 To check the contents of a file (name of the file is employees.txt),

cat employees.txt

 To create a table within the hive shell,

create table emp_global(id int,name string,city string,continent string)

row format delimited
fields terminated by ‘ , ‘
stored as textfile ;
 To check the tables,
show tables ;

 To query the table,

select * from emp_global ;

 To load data into the table,

load local inpath ‘employees.txt’ into table emp_global;

 To drop all tables inside a database,

drop database vin_emp cascade,

 To know the schema of the table,

describe emp_global ;

 To drop a table,

drop table emp_global ;

Hive Illustration : External tables in hive

 To create a database in a certain desired location,

create database vin_emp_loc location ‘/user/cloudera/myhivedata’ ;

 To copy a file from local file system to hdfs location,

hadoop fs – put empglobal.csv empdata

 To see the contents of the file,

hadoop df – cat empdata/empglobal.csv

Hive Illustration : Loading different file formats

 How to know what the table type is, whether internal or external,

describe extended emp_global ;

 To load the data into orc table,

insert into table emp_global_orc select * from emp_global ;

 Create a table whose schema is exactly like an existing table,

create table emp_global_seq LIKE emp_global_orc stored as sequencefile ;

Hive Illustration : Loading data into Hive tables

 Create table only if another table of the same name doesn’t exist and an input multiple
values in a single column using an array,

create table if not exists sibling_data (

name string, age int, country string, siblings array<string> )
row format delimited
fields terminated by ‘ , ‘
collection items terminated by ‘#’
lines terminated by ‘\n’
sorted as textfile ;

 To create table with multiple inputs of different data type in a single column,

create table auto_details(company string, model string, fuel string,

basic_specs struct<vehicle_type : string, doors : int, gears : int>,
engine_specs struct<cc : int, bhp : double>)
row format delimited
fields terminated by ‘ , ‘
collection items terminated by ‘#’ ;

Hive Illustration : Simple Operations on Hive tables

 To rename an existing table,
alter table auto_details rename to auto_table ;
 To change the name of any column,
alter table auto_details change fuel fuel_type string ;
 To add a new column to an existing table,
alter table auto_details add columns (milage double) ;
 To drop columns,(mention columns which need to remain inside the brackets after “replace”
keyword)
alter table auto_details replace (company string, model string, fuel_type string) ;

Hive Illustration : Query Operations on Hive tables

 To create a table inside a desired pre-existing database, without navigating into the
database first
create table if not exists company.empdata (
empid int,
empname string,
salary double,
designation string,
department string,,
salary double,
designation string,
department string,
age int)
row format delimited
fields terminated by ‘ , ‘
lines terminated by ‘\n’
tblproperties(‘skip.header.line.count’ = ‘1’) ;

 To select all columns and only those rows which satisfy a certain condition,
select * from empdata where department = “HR” ;
 To select all columns and only those rows which satisfy more than one condition,
select * from empdata where department =”HR” and salary > 25000 ;
 To select only desired columns and only those rows which satisfy more than one condition,
select empname, age from empdata where department =”HR” and salary > 25000 ;
 To select all columns and sort the rows based on a desired column,
select * from empdata order by salary ;
 To select all columns and sort the rows based on a desired column in descending order,

select * from empdata order by salary desc ;

 To count the total number of rows in the dataset,
select count(*) from empdata ;
 To use ‘groupby’ to count number of rows based in each category of a certain column,
select department, count(*) from empdata group by department ;
 To select all column but only those rows which do not have null value in a desired column,
select * from empdata where salary is not null ;
 To select rows by matching a substring with a desired column value,
select * from empdata where designation rlike “Manager” or rlike “manager” or “Lead” ;
 To find the average of a desired numerical column, grouped by a categorical column,
select department, avg(salary) from empdata group by department;

Hive Illustration : Querying complex structures

 To enable join operations in the hive shell,
SET hive.auto.conveert.join = False;
 To perform a join operation,
select emp.empname, emp.salary from emp_epf pf join empdata emp on (pf.empid = emp.empid) ;
 To perform a left outer join operation,
select emp.empname, emp.salary from emp_epf pf left outer join empdata emp on (pf.empid =
emp.empid) ;
 To perform a right outer join operation,
select emp.empname, emp.salary from emp_epf pf right outer join empdata emp on (pf.empid =
emp.empid) ;
 To perform a full outer join operation,
select emp.empname, emp.salary from emp_epf pf full outer join empdata emp on (pf.empid =
emp.empid) ;

Hive Illustration : Views

 To create view,
create view if not exists high_sal as select * from empdata where salary > 50000 ;
 To query data from view,
select * from high_sal ;
 To see if view is created,
show tables ;
 To see the table type, (virtual or Managed),
describe formatted high_sal ;
 To create a table, partitioned by a desired column,
create table emp_global_part(id int, name string, city string, country string)
portioned by (continent string)
row format delimited
fields terminated by ‘ , ‘
stored as textfile ;

Hive Tutorial for Data Analysts
No ratings yet
Hive Tutorial for Data Analysts
11 pages
DSCI 5350 - Lecture 5 PDF
No ratings yet
DSCI 5350 - Lecture 5 PDF
64 pages
Bda Unit-4
No ratings yet
Bda Unit-4
12 pages
HIVE
No ratings yet
HIVE
80 pages
HIVE
No ratings yet
HIVE
28 pages
BDAV Practical 4 Hive
No ratings yet
BDAV Practical 4 Hive
21 pages
Big Data Analytics and Developers Training Session 10
No ratings yet
Big Data Analytics and Developers Training Session 10
27 pages
Hive
No ratings yet
Hive
15 pages
HiveQL Overview
No ratings yet
HiveQL Overview
71 pages
Hive Database and Table Management
No ratings yet
Hive Database and Table Management
11 pages
A 3 Hive
No ratings yet
A 3 Hive
5 pages
Apache Hive Notes
No ratings yet
Apache Hive Notes
15 pages
Hive for Data Engineers
No ratings yet
Hive for Data Engineers
13 pages
Big Data Analytics: Welcome
No ratings yet
Big Data Analytics: Welcome
69 pages
Cheat Sheet: Hive Basics
No ratings yet
Cheat Sheet: Hive Basics
1 page
Hive for Data Engineers
No ratings yet
Hive for Data Engineers
18 pages
BDA Unit-5
No ratings yet
BDA Unit-5
39 pages
Hive Table Session
No ratings yet
Hive Table Session
23 pages
Hive DDL and DML Commands Guide
No ratings yet
Hive DDL and DML Commands Guide
8 pages
Practical-2 Hive (Show - Create - Load Commands)
No ratings yet
Practical-2 Hive (Show - Create - Load Commands)
13 pages
Hive Data Analysis Operations Guide
No ratings yet
Hive Data Analysis Operations Guide
18 pages
Hive Commands Syn
No ratings yet
Hive Commands Syn
27 pages
Week-11 - 12-Hivepdf - 2023 - 11 - 10 - 12 - 47 - 43
No ratings yet
Week-11 - 12-Hivepdf - 2023 - 11 - 10 - 12 - 47 - 43
8 pages
Bda-Unit-Iv - 2020-21
100% (1)
Bda-Unit-Iv - 2020-21
30 pages
Hive PPTs
No ratings yet
Hive PPTs
34 pages
Wa0006.
No ratings yet
Wa0006.
53 pages
Apache HIVE
No ratings yet
Apache HIVE
44 pages
Understanding Hive and Pig in Hadoop
No ratings yet
Understanding Hive and Pig in Hadoop
91 pages
Introduction to Hive in Big Data
No ratings yet
Introduction to Hive in Big Data
65 pages
Introduction to Hive: Architecture & Features
No ratings yet
Introduction to Hive: Architecture & Features
40 pages
Apache Hive for Data Analysts
No ratings yet
Apache Hive for Data Analysts
51 pages
Hive Notes PDF
No ratings yet
Hive Notes PDF
12 pages
Unit-5 - Hive
No ratings yet
Unit-5 - Hive
31 pages
Hive Overview
No ratings yet
Hive Overview
28 pages
Introduction To Hive
No ratings yet
Introduction To Hive
14 pages
Hive Tutorial
No ratings yet
Hive Tutorial
25 pages
Hiveppt
No ratings yet
Hiveppt
29 pages
Bigdata@master: 4.set The Environmental Variable HIVE - HOME in Bashrc File
No ratings yet
Bigdata@master: 4.set The Environmental Variable HIVE - HOME in Bashrc File
91 pages
Hive L1
No ratings yet
Hive L1
134 pages
Hive Database Setup Guide
No ratings yet
Hive Database Setup Guide
2 pages
5 - Hive
No ratings yet
5 - Hive
51 pages
Hive Data Warehousing Overview
No ratings yet
Hive Data Warehousing Overview
61 pages
Unit 5
No ratings yet
Unit 5
21 pages
Hive
No ratings yet
Hive
29 pages
Unit 2.2 Hive
No ratings yet
Unit 2.2 Hive
80 pages
Module 3-1
No ratings yet
Module 3-1
32 pages
Hive Part 2
No ratings yet
Hive Part 2
53 pages
Hive and Pig
No ratings yet
Hive and Pig
57 pages
HDFSandhivecommands
No ratings yet
HDFSandhivecommands
15 pages
Hive Programs
No ratings yet
Hive Programs
7 pages
HQL Cheat Sheet PDF
No ratings yet
HQL Cheat Sheet PDF
3 pages
Overview of Hive Data Warehouse System
No ratings yet
Overview of Hive Data Warehouse System
9 pages
Unit Iv Part - 1
No ratings yet
Unit Iv Part - 1
60 pages
Hive Main
No ratings yet
Hive Main
33 pages
Hive For SQL Users: Cheat Sheet
No ratings yet
Hive For SQL Users: Cheat Sheet
3 pages
Hive Intoduction and Tables
No ratings yet
Hive Intoduction and Tables
31 pages
RMIP 21RMI56 Module2
No ratings yet
RMIP 21RMI56 Module2
25 pages
The Success-Driven Life
No ratings yet
The Success-Driven Life
79 pages
NR 00012
No ratings yet
NR 00012
48 pages
Fault Detection Method Using A Convolution Neural Network For Hybrid Active Neutral-Point Clamped Inverters
No ratings yet
Fault Detection Method Using A Convolution Neural Network For Hybrid Active Neutral-Point Clamped Inverters
11 pages
Plant Cell Structure and Functions
No ratings yet
Plant Cell Structure and Functions
1 page
North Cotabato
No ratings yet
North Cotabato
40 pages
Syllabus Combined Ad No 10
No ratings yet
Syllabus Combined Ad No 10
15 pages
Oil Recommendations
No ratings yet
Oil Recommendations
4 pages
Apm-001 2025
No ratings yet
Apm-001 2025
282 pages
WFH Excuses - The Ultimate Guide To Remote Work Alibis
No ratings yet
WFH Excuses - The Ultimate Guide To Remote Work Alibis
14 pages
SECATEUR Electrique - 20220331 - 0001
No ratings yet
SECATEUR Electrique - 20220331 - 0001
2 pages
Slip Seat Upholstery Guide
No ratings yet
Slip Seat Upholstery Guide
4 pages
WISC V Intermediate
No ratings yet
WISC V Intermediate
38 pages
Bba 206
No ratings yet
Bba 206
332 pages
CY1001+CY1002 Chemistry+Lab
No ratings yet
CY1001+CY1002 Chemistry+Lab
4 pages
Casting Defect Solutions Guide
No ratings yet
Casting Defect Solutions Guide
6 pages
ASME IX Welding Procedure Specification
100% (2)
ASME IX Welding Procedure Specification
2 pages
Candela Obscura Character and Circle Sheets 11 20 23
100% (1)
Candela Obscura Character and Circle Sheets 11 20 23
12 pages
Week 1 - Lesson 1 - Skimming Vs Scanning-1
No ratings yet
Week 1 - Lesson 1 - Skimming Vs Scanning-1
4 pages
Introduction To System Analysis and Design (Slides)
100% (7)
Introduction To System Analysis and Design (Slides)
46 pages
STEP by STEP (Master Data Loading)
No ratings yet
STEP by STEP (Master Data Loading)
12 pages
BAS DG Sets Operation Procedure and Control Modes Queries From KEIL
No ratings yet
BAS DG Sets Operation Procedure and Control Modes Queries From KEIL
4 pages
Honeywell Duct Temperature Sensor Guide
No ratings yet
Honeywell Duct Temperature Sensor Guide
1 page
Example Thesis Methodology PDF
20% (5)
Example Thesis Methodology PDF
2 pages
Python Ass 2
No ratings yet
Python Ass 2
7 pages
ESA Cordex400W System
No ratings yet
ESA Cordex400W System
2 pages
Antonio Pigafetta
No ratings yet
Antonio Pigafetta
6 pages
Salesforce AI Associate 1
No ratings yet
Salesforce AI Associate 1
7 pages
3DS 2023 Perfect Package Us Por
No ratings yet
3DS 2023 Perfect Package Us Por
2 pages
Brazilian Stock Market Forecasting Insights
No ratings yet
Brazilian Stock Market Forecasting Insights
65 pages

Hive Commands for Beginners

Uploaded by

Hive Commands for Beginners

Uploaded by

Hive Illustration : Basics

 To get started with the hive shell,

 To check what is present in the HDFS,

hadoop fs - mkdir foo

create database vin_emp;

 To see existing databases,

 To start using the database,

 To check the tables present in the database,

 To come out of the hive shell,

 to list contents of the current working directory,

 To navigate into that data,

 To check the present working directory

 To check the contents of a file (name of the file is employees.txt),

 To create a table within the hive shell,

create table emp_global(id int,name string,city string,continent string)

 To query the table,

select * from emp_global ;

 To load data into the table,

load local inpath ‘employees.txt’ into table emp_global;

 To drop all tables inside a database,

drop database vin_emp cascade,

 To know the schema of the table,

drop table emp_global ;

Hive Illustration : External tables in hive

create database vin_emp_loc location ‘/user/cloudera/myhivedata’ ;

 To copy a file from local file system to hdfs location,

hadoop fs – put empglobal.csv empdata

 To see the contents of the file,

hadoop df – cat empdata/empglobal.csv

Hive Illustration : Loading different file formats

describe extended emp_global ;

 To load the data into orc table,

insert into table emp_global_orc select * from emp_global ;

 Create a table whose schema is exactly like an existing table,

Hive Illustration : Loading data into Hive tables

create table if not exists sibling_data (

create table auto_details(company string, model string, fuel string,

Hive Illustration : Simple Operations on Hive tables

Hive Illustration : Query Operations on Hive tables

select * from empdata order by salary desc ;

Hive Illustration : Querying complex structures

Hive Illustration : Views

You might also like