0% found this document useful (0 votes)

70 views22 pages

Hadoop Pig Setup and Usage Guide

This document provides instructions for downloading and configuring Pig on Hadoop. It outlines steps to download Pig, extract the files, configure configuration files, start Hadoop and Pig, run Pig queries on sample data files, and compare Pig to HBase and Hive.

Uploaded by

vishnu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views22 pages

Hadoop Pig Setup and Usage Guide

Uploaded by

vishnu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Pig

For online Hadoop training, send mail to [email protected]

Agenda
Download Pig tar.gz file
Extract the content of Pig tar.gz
Configure pig-env.sh file
Configure pig.properties file
Start your Hadoop
Start Pig shell
Input file for Pig query
Access HDFS from Pig shell
Execute Pig commands
Store Pig query's output into HDFS
Check the output
Comparison of HBase/Hive/Pig
Download Pig from Apache website
www.apache.org/dyn/closer.cgi/pig
Select a stable version of Pig
Click on pig-0.11.0-tar.gz
Save pig-0.11.0-tar.gz file
Untar pig-0.11.0-tar.gz file

5
Configure pig-env.sh file

Create pig-env.sh file in PIG_HOME/conf

Add the following entries in PIG_HOME/conf/pig-env.sh file

export JAVA_HOME=/usr
export PIG_HOME=/home/neeraj/local_cluster_home/pig-0.11.0
export HADOOP_HOME=/home/neeraj/local_cluster_home/hadoop-1.0.3
export PIG_CLASSPATH=$HADOOP_HOME/conf/
Configure pig.properties file

Add the following entries in PIG_HOME/conf/pig.properties file

fs.default.name=hdfs://localhost:9000
mapred.job.tracker=localhost:9001

Copy core-site.xml, hdfs-site.xml & mapred-site.xml file from

HADOOP_HOME/conf to PIG_HOME/conf
Start your Hadoop
Check Hadoop processes
&
Safemode
Make sure that safe mode is off before you start Pig
Start Pig shell
Input file for Pig
Access HDFS from Pig shell
Execute Pig query
records = LOAD '/pig_input_files/temprature.txt' AS (year:chararray, temperature:int);

filtered_records = FILTER records BY temperature != 9999;

grouped_records = GROUP filtered_records BY year;

max_temp = FOREACH grouped_records GENERATE group,MAX(filtered_records.temperature);

DUMP max_temp;
Execute Pig query
records = LOAD '/pig_input_files/temprature.txt' AS (year:chararray, temperature:int);

filtered_records = FILTER records BY temperature != 9999;

grouped_records = GROUP filtered_records BY year;

max_temp = FOREACH grouped_records GENERATE group,MAX(filtered_records.temperature);

STORE max_temp INTO '/pig_output_files';

Pig job details
Output of Pig query
Exit from Pig shell
HBase/Hive/Pig
HBase/Hive/Pig suitability

HBase is suitable when...

When you need to handle unstructured data
When you need to edit the data
When you need versioned data

Hive is suitable when...

When you need to handle structured data
When you don't need to edit the data
When you comfortable in SQL syntax
Pig is suitable when...
When you need to handle structured data
When you don't need to edit the data
When you are comfortable in scripting
…Thanks…

For online Hadoop training, send mail to [email protected]

Pig Tutorial PDF
No ratings yet
Pig Tutorial PDF
22 pages
Pig Setup and Test Run: by Kannan Kalidasan
No ratings yet
Pig Setup and Test Run: by Kannan Kalidasan
17 pages
BigData Module 2
No ratings yet
BigData Module 2
41 pages
Apache Pig for Data Engineers
No ratings yet
Apache Pig for Data Engineers
5 pages
Pig Data Types and Features Overview
No ratings yet
Pig Data Types and Features Overview
16 pages
Apache Pig Data Processing Guide
No ratings yet
Apache Pig Data Processing Guide
10 pages
BIG DATA Module 2 FINAL SMI
No ratings yet
BIG DATA Module 2 FINAL SMI
44 pages
Lab 5
No ratings yet
Lab 5
9 pages
Analyzing Agriculture Data with Apache Pig
No ratings yet
Analyzing Agriculture Data with Apache Pig
33 pages
UNIT 5 Notes by ARUN JHAPATE
No ratings yet
UNIT 5 Notes by ARUN JHAPATE
21 pages
Essential Hadoop Tools Overview
No ratings yet
Essential Hadoop Tools Overview
35 pages
Apache Pig Guide: Features & Functions
No ratings yet
Apache Pig Guide: Features & Functions
31 pages
Pig
No ratings yet
Pig
16 pages
BDH Practical 08 29
No ratings yet
BDH Practical 08 29
3 pages
Shoaib Program From 7
No ratings yet
Shoaib Program From 7
17 pages
Hadoop - PIG User Material
No ratings yet
Hadoop - PIG User Material
292 pages
Bda - Module Ii
No ratings yet
Bda - Module Ii
239 pages
Unit 5 Lecture No-2 (PIG)
No ratings yet
Unit 5 Lecture No-2 (PIG)
94 pages
Understanding Apache Pig Architecture
No ratings yet
Understanding Apache Pig Architecture
33 pages
Bda Exp3 Chinmay
No ratings yet
Bda Exp3 Chinmay
5 pages
Pig Notes-1
No ratings yet
Pig Notes-1
6 pages
Unit 5 Lecture No-2 (PIG)
No ratings yet
Unit 5 Lecture No-2 (PIG)
101 pages
Apache Pig in Nosql Databases
No ratings yet
Apache Pig in Nosql Databases
5 pages
PIG A Big Data Processor
No ratings yet
PIG A Big Data Processor
49 pages
BDC Output 7
No ratings yet
BDC Output 7
9 pages
BDS Unit 3 1
No ratings yet
BDS Unit 3 1
42 pages
Hadoop
No ratings yet
Hadoop
15 pages
Pig Slides
No ratings yet
Pig Slides
46 pages
Apache Pig Installation and Workouts Guide
No ratings yet
Apache Pig Installation and Workouts Guide
7 pages
Unit-V CC&BD CS62
No ratings yet
Unit-V CC&BD CS62
73 pages
4.1 Pig Unit4
No ratings yet
4.1 Pig Unit4
55 pages
PIG - Installation Step
No ratings yet
PIG - Installation Step
2 pages
Pig Expt 5
No ratings yet
Pig Expt 5
4 pages
Bda 06
No ratings yet
Bda 06
15 pages
Apache Pig Tutorial PDF
0% (1)
Apache Pig Tutorial PDF
21 pages
Apache Pig: Data Processing Guide
No ratings yet
Apache Pig: Data Processing Guide
12 pages
05 Hadoop Pig TwitterCaseStudy
No ratings yet
05 Hadoop Pig TwitterCaseStudy
10 pages
Da 450 Slide Guide - Odt
No ratings yet
Da 450 Slide Guide - Odt
80 pages
Apache Pig Tutorial
100% (1)
Apache Pig Tutorial
207 pages
Experiment-7 Pig-Script
No ratings yet
Experiment-7 Pig-Script
4 pages
Chapter 5 - Introducing Pig Pig Architecture
No ratings yet
Chapter 5 - Introducing Pig Pig Architecture
81 pages
Apache Pig: For Live Hadoop Training, Please See Courses
No ratings yet
Apache Pig: For Live Hadoop Training, Please See Courses
25 pages
Apache Pig: Big Data Analytics Guide
No ratings yet
Apache Pig: Big Data Analytics Guide
65 pages
HADOOP One Day Crash Course
No ratings yet
HADOOP One Day Crash Course
19 pages
Bdaut 2
No ratings yet
Bdaut 2
66 pages
Introduction to Pig Latin in Big Data
No ratings yet
Introduction to Pig Latin in Big Data
58 pages
3 Pig
No ratings yet
3 Pig
1 page
BigData Unit 4
No ratings yet
BigData Unit 4
13 pages
Ba1 3
No ratings yet
Ba1 3
7 pages
PIG and HIVE Installation PDF
No ratings yet
PIG and HIVE Installation PDF
1 page
Unit No 4 Hadoop Eco System
No ratings yet
Unit No 4 Hadoop Eco System
15 pages
Hadoop Lab Instructions and Programs
No ratings yet
Hadoop Lab Instructions and Programs
7 pages
Apache Pig Installation and Usage Guide
No ratings yet
Apache Pig Installation and Usage Guide
5 pages
Big Data Unit-5
No ratings yet
Big Data Unit-5
81 pages
Unit 5 (Pig, Hive, Hbase)
No ratings yet
Unit 5 (Pig, Hive, Hbase)
18 pages
Hadoop 2.x.x YARN Installation Guide
No ratings yet
Hadoop 2.x.x YARN Installation Guide
30 pages
Hadoop Upgrade Guide for Admins
No ratings yet
Hadoop Upgrade Guide for Admins
15 pages
HBase and NoSQL Overview
No ratings yet
HBase and NoSQL Overview
14 pages
Installing Single Node Hadoop
No ratings yet
Installing Single Node Hadoop
12 pages
Recover From Namenode Failure
No ratings yet
Recover From Namenode Failure
14 pages
Hadoop Demo
No ratings yet
Hadoop Demo
14 pages
Hadoop Installation
No ratings yet
Hadoop Installation
10 pages
Analyzing AI's Societal Impact
No ratings yet
Analyzing AI's Societal Impact
4 pages
Speech Outline Draft (A167729)
No ratings yet
Speech Outline Draft (A167729)
3 pages
Road Accident Prediction Model Presentation-1
No ratings yet
Road Accident Prediction Model Presentation-1
24 pages
Test XII Money & Banking (2025)
No ratings yet
Test XII Money & Banking (2025)
1 page
Williams Et Al 2017 Estimating Pavement S Flood Resilience
No ratings yet
Williams Et Al 2017 Estimating Pavement S Flood Resilience
8 pages
Managing Your Emotions at Work
No ratings yet
Managing Your Emotions at Work
22 pages
Clarinet Institute Haydn Joseph Baryton Trio No. 96
No ratings yet
Clarinet Institute Haydn Joseph Baryton Trio No. 96
14 pages
Google's List of Class-Action Lawsuits
100% (2)
Google's List of Class-Action Lawsuits
7 pages
Pain Case - SOAP Note (2011)
100% (3)
Pain Case - SOAP Note (2011)
2 pages
Data Analytics in Accounting Guide
No ratings yet
Data Analytics in Accounting Guide
14 pages
Dod Energy Management Handbook
No ratings yet
Dod Energy Management Handbook
250 pages
TTX 2019 Ncov Poe 16022020 Final Generic
No ratings yet
TTX 2019 Ncov Poe 16022020 Final Generic
41 pages
MATH 131 F24 Hoeppner - Agan.amurthalingam.4.8 Homework
No ratings yet
MATH 131 F24 Hoeppner - Agan.amurthalingam.4.8 Homework
2 pages
Floral Induction in Plant Development
No ratings yet
Floral Induction in Plant Development
26 pages
Annual Report
100% (1)
Annual Report
192 pages
Case Laws
100% (1)
Case Laws
24 pages
Week 3
No ratings yet
Week 3
3 pages
Electrical Door Chimes Guide
No ratings yet
Electrical Door Chimes Guide
13 pages
Eye Contact During Sex
No ratings yet
Eye Contact During Sex
3 pages
Year 11 History Source Analysis
No ratings yet
Year 11 History Source Analysis
5 pages
Department of Education: Batangas City South Senior High School
No ratings yet
Department of Education: Batangas City South Senior High School
3 pages
Otorhinolaryngology Exam Papers
No ratings yet
Otorhinolaryngology Exam Papers
7 pages
Charlie Puth - Left and Right (Ft. Jung Kook of BTS) by Kelvin Vilten
No ratings yet
Charlie Puth - Left and Right (Ft. Jung Kook of BTS) by Kelvin Vilten
3 pages
CAN Shift Engineer Area Visit Checklist Rev-01
No ratings yet
CAN Shift Engineer Area Visit Checklist Rev-01
2 pages
Speaking Test Sample
100% (1)
Speaking Test Sample
7 pages
Mohr’s Circle & Stress Transformation
No ratings yet
Mohr’s Circle & Stress Transformation
19 pages
Senior High School Video Reflection Task
No ratings yet
Senior High School Video Reflection Task
2 pages
Getting Sick
No ratings yet
Getting Sick
5 pages
Cheng Et Al 2024 Digital Diaries Supporting Self-Regulated Learning During In-Person and Online Transitions
No ratings yet
Cheng Et Al 2024 Digital Diaries Supporting Self-Regulated Learning During In-Person and Online Transitions
58 pages
Home Economics 9
No ratings yet
Home Economics 9
9 pages

Hadoop Pig Setup and Usage Guide

Uploaded by

Hadoop Pig Setup and Usage Guide

Uploaded by

Pig

For online Hadoop training, send mail to [email protected]

Create pig-env.sh file in PIG_HOME/conf

Add the following entries in PIG_HOME/conf/pig-env.sh file

Add the following entries in PIG_HOME/conf/pig.properties file

Copy core-site.xml, hdfs-site.xml & mapred-site.xml file from

filtered_records = FILTER records BY temperature != 9999;

grouped_records = GROUP filtered_records BY year;

max_temp = FOREACH grouped_records GENERATE group,MAX(filtered_records.temperature);

filtered_records = FILTER records BY temperature != 9999;

grouped_records = GROUP filtered_records BY year;

max_temp = FOREACH grouped_records GENERATE group,MAX(filtered_records.temperature);

STORE max_temp INTO '/pig_output_files';

HBase is suitable when...

Hive is suitable when...

For online Hadoop training, send mail to [email protected]

You might also like