0% found this document useful (0 votes)

19 views9 pages

DA Lab Program-3

Uploaded by

Diksha Padiyar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views9 pages

DA Lab Program-3

Uploaded by

Diksha Padiyar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

DATA ANALYTICS LABORATORY (21CSL66)

3. IMPLEMENT AN MR PROGRAM THAT PROCESSES A WEATHER

DATASET.

Steps to be followed:

• Step-1: We can download the dataset from this Link, For various cities in
different years. choose the year of your choice and select any one of the
data text-file for analysing.

We can get information about data from README.txt file available on the
NCEI website.

• Step-2: Make a project in Eclipse with below steps:

§ First Open Eclipse à then select File à New à Java Project à

Name it MyProject à then select use an execution
environment à choose JavaSE-1.8 then next à Finish.

§ In this Project Create Java class with name MyMaxMin à then

click Finish.

§ Copy the below source code to this MyMaxMin java class.

// importing Libraries

import java.io.IOException;

import java.util.Iterator;

import org.apache.hadoop.fs.Path;

import org.apache.hadoop.io.LongWritable;

import org.apache.hadoop.io.Text;

import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;

import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;

1
import org.apache.hadoop.mapreduce.lib.output.TextOutputFormat;

import org.apache.hadoop.mapreduce.lib.input.TextInputFormat;

import org.apache.hadoop.mapreduce.Job;

import org.apache.hadoop.mapreduce.Mapper;

import org.apache.hadoop.mapreduce.Reducer;

import org.apache.hadoop.conf.Configuration;

public class MyMaxMin {

// Mapper

/*MaxTemperatureMapper class is static

* and extends Mapper abstract class

* having four Hadoop generics type

* LongWritable, Text, Text, Text.

public static class MaxTemperatureMapper extends

Mapper<LongWritable, Text, Text, Text> {

/**

* @method map

* This method takes the input as a text data type.

* Now leaving the first five tokens, it takes

* 6th token is taken as temp_max and

* 7th token is taken as temp_min. Now

* temp_max > 30 and temp_min < 15 are

* passed to the reducer.

2
*/

// the data in our data set with

// this value is inconsistent data

public static final int MISSING = 9999;

@Override

public void map(LongWritable arg0, Text Value, Context

context)

throws IOException, InterruptedException {

// Convert the single row(Record) to

// String and store it in String

// variable name line

String line = Value.toString();

// Check for the empty line

if (!(line.length() == 0)) {

// from character 6 to 14 we have

// the date in our dataset

String date = line.substring(6, 14);

// similarly we have taken the maximum

// temperature from 39 to 45 characters

float temp_Max =
Float.parseFloat(line.substring(39, 45).trim());

// similarly we have taken the minimum

// temperature from 47 to 53 characters

3
float temp_Min =
Float.parseFloat(line.substring(47, 53).trim());

// if maximum temperature is

// greater than 30, it is a hot day

if (temp_Max > 30.0) {

// Hot day

context.write(new Text("The Day is Hot

Day :" + date),

new Text(String.valueOf(temp_Max)));

// if the minimum temperature is

// less than 15, it is a cold day

if (temp_Min < 15) {

// Cold day

context.write(new Text("The Day is Cold

Day :" + date),

new
Text(String.valueOf(temp_Min)));

// Reducer

/*MaxTemperatureReducer class is static

and extends Reducer abstract class

4
having four Hadoop generics type

Text, Text, Text, Text.

public static class MaxTemperatureReducer extends

Reducer<Text, Text, Text, Text> {

/**

* @method reduce

* This method takes the input as key and

* list of values pair from the mapper,

* it does aggregation based on keys and

* produces the final context.

public void reduce(Text Key, Iterator<Text> Values, Context

context)

throws IOException, InterruptedException {

// putting all the values in

// temperature variable of type String

String temperature = Values.next().toString();

context.write(Key, new Text(temperature));

/**

* @method main

* This method is used for setting

* all the configuration properties.

5
* It acts as a driver for map-reduce

* code.

public static void main(String[] args) throws Exception {

// reads the default configuration of the

// cluster from the configuration XML files

Configuration conf = new Configuration();

// Initializing the job with the

// default configuration of the cluster

Job job = new Job(conf, "weather example");

// Assigning the driver class name

job.setJarByClass(MyMaxMin.class);

// Key type coming out of mapper

job.setMapOutputKeyClass(Text.class);

// value type coming out of mapper

job.setMapOutputValueClass(Text.class);

// Defining the mapper class name

job.setMapperClass(MaxTemperatureMapper.class);

// Defining the reducer class name

job.setReducerClass(MaxTemperatureReducer.class);

// Defining input Format class which is

// responsible to parse the dataset

// into a key value pair

job.setInputFormatClass(TextInputFormat.class);

6
// Defining output Format class which is

// responsible to parse the dataset

// into a key value pair

job.setOutputFormatClass(TextOutputFormat.class);

// setting the second argument

// as a path in a path variable

Path OutputPath = new Path(args[1]);

// Configuring the input path

// from the filesystem into the job

FileInputFormat.addInputPath(job, new Path(args[0]));

// Configuring the output path from

// the filesystem into the job

FileOutputFormat.setOutputPath(job, new Path(args[1]));

// deleting the context path automatically

// from hdfs so that we don't have

// to delete it explicitly

OutputPath.getFileSystem(conf).delete(OutputPath);

// exiting the job only if the

// flag value becomes false

System.exit(job.waitForCompletion(true) ? 0 : 1);

7
§ Now we need to add external jar for the packages that we have
import. Download the jar package Hadoop Common and Hadoop
MapReduce Core according to the Hadoop version.

§ Now we add these external jars to our MyProject.

Right Click on MyProject à then select Build Path à Click

on Configure Build Path and select Add External jars…. and add
jars from its download location then click à Apply and Close.

§ Now export the project as jar file.

Right-click on MyProject choose Export.. and go to Java à JAR

file click à Next and choose your export destination then click
à Next.
choose Main Class as MyMaxMin by clicking à Browse and then
clickàFinish àOk.

• Step-4: Start the Hadoop daemons.

start-dfs.sh

start-yarn.sh

• Step-5: Move the dataset to Hadoop HDFS.

hdfs dfs -put /file_path /destination

In below command / shows the root directory of our HDFS,

hdfs dfs -put /home/…./……./datasetname.txt /

hdfs dfs -ls /

8
• Step-6: Now Run your Jar File with below command and produce the
output in MyOutput File.

hadoop jar /jar_file_location /dataset_location_in_HDFS /output-file_name

hadoop jar /…./…./…./Project.jar /datasetname.txt /MyOutput

• Step-7: Now Move to localhost:50070/, under utilities select Browse the file
system and download part-r-00000 in /MyOutput directory to see result.

• Step-8: See the result in downloaded file.

Practical 2-2
No ratings yet
Practical 2-2
9 pages
Worksheet 6th
No ratings yet
Worksheet 6th
6 pages
Map Reduce
No ratings yet
Map Reduce
46 pages
cl3 Exp 09
No ratings yet
cl3 Exp 09
4 pages
Hadoop MapReduce for Temperature Analysis
No ratings yet
Hadoop MapReduce for Temperature Analysis
22 pages
BDA4
No ratings yet
BDA4
7 pages
Group B PR 3 DSBDA
No ratings yet
Group B PR 3 DSBDA
6 pages
Exp 3 4
No ratings yet
Exp 3 4
7 pages
22MCC20017 Suraj Kumar Thakur BIG Data 2.2
No ratings yet
22MCC20017 Suraj Kumar Thakur BIG Data 2.2
5 pages
Hadoop Weather
No ratings yet
Hadoop Weather
4 pages
Map Reduce
No ratings yet
Map Reduce
15 pages
Map Reduce 1
No ratings yet
Map Reduce 1
8 pages
BD 2lab
No ratings yet
BD 2lab
7 pages
Hadoop
No ratings yet
Hadoop
19 pages
BDA IA2 Programs
No ratings yet
BDA IA2 Programs
6 pages
Experiment 1 2
No ratings yet
Experiment 1 2
19 pages
Tutorial Partitioner
No ratings yet
Tutorial Partitioner
8 pages
MapReduce and YARN: Key Differences
No ratings yet
MapReduce and YARN: Key Differences
39 pages
AP20110010464
No ratings yet
AP20110010464
7 pages
CSF443 Lab-Report Nimish Shandilya 1000016934
No ratings yet
CSF443 Lab-Report Nimish Shandilya 1000016934
17 pages
Week 1 Hadoop and Hdfs Commands
No ratings yet
Week 1 Hadoop and Hdfs Commands
1 page
Hadoop MapReduce and Streaming Guide
No ratings yet
Hadoop MapReduce and Streaming Guide
115 pages
MR Progs For Self Excercise
No ratings yet
MR Progs For Self Excercise
14 pages
Unit-Iii: A Weather Dataset
No ratings yet
Unit-Iii: A Weather Dataset
12 pages
Short Programs
No ratings yet
Short Programs
41 pages
Bda Material Unit 3
No ratings yet
Bda Material Unit 3
14 pages
Week 2 de Unedited
No ratings yet
Week 2 de Unedited
13 pages
Unit-Iii: A Weather Dataset
No ratings yet
Unit-Iii: A Weather Dataset
12 pages
Mcsl26 See QP Solution 2024
No ratings yet
Mcsl26 See QP Solution 2024
33 pages
Hadoop MapReduce Examples
No ratings yet
Hadoop MapReduce Examples
8 pages
BDA Unit 4 Notes
No ratings yet
BDA Unit 4 Notes
20 pages
ADBMS Module4
No ratings yet
ADBMS Module4
31 pages
Document 6
No ratings yet
Document 6
15 pages
Understanding MapReduce Framework
No ratings yet
Understanding MapReduce Framework
28 pages
Unit IV BDA
No ratings yet
Unit IV BDA
32 pages
Analyzing The Data With Hadoop
No ratings yet
Analyzing The Data With Hadoop
13 pages
MapReduce - Notes
No ratings yet
MapReduce - Notes
17 pages
Lab Manual
No ratings yet
Lab Manual
86 pages
Unit 4 Handouts
No ratings yet
Unit 4 Handouts
13 pages
Sets Bda
No ratings yet
Sets Bda
19 pages
Unit III EBDP 2022
No ratings yet
Unit III EBDP 2022
77 pages
Data Analytics Lab Manual Guide
No ratings yet
Data Analytics Lab Manual Guide
80 pages
Using Map Reduce Concept, Implement A Java Pro...
No ratings yet
Using Map Reduce Concept, Implement A Java Pro...
2 pages
Analyzing Data With Hadoop
No ratings yet
Analyzing Data With Hadoop
54 pages
Cloud LAB 10.1,11.1,12.1
No ratings yet
Cloud LAB 10.1,11.1,12.1
6 pages
Unit V Programming Model
No ratings yet
Unit V Programming Model
53 pages
Running MapReduce in Eclipse Setup
No ratings yet
Running MapReduce in Eclipse Setup
6 pages
BDA University Questions
No ratings yet
BDA University Questions
10 pages
HDFS File Operations and COVID Analysis
No ratings yet
HDFS File Operations and COVID Analysis
6 pages
Bda Lab Manual 2024
No ratings yet
Bda Lab Manual 2024
45 pages
Big Data Analytics with Hadoop Guide
No ratings yet
Big Data Analytics with Hadoop Guide
10 pages
Program 3 Instructions
No ratings yet
Program 3 Instructions
4 pages
MapReduce Hands On
100% (1)
MapReduce Hands On
28 pages
Unit Iii LM
No ratings yet
Unit Iii LM
14 pages
Big Data Lab
No ratings yet
Big Data Lab
12 pages
MapReduce for Patent Analysis
No ratings yet
MapReduce for Patent Analysis
3 pages
Big Data Manual
No ratings yet
Big Data Manual
82 pages
BDA
No ratings yet
BDA
19 pages
Economics Assignment, 2024
No ratings yet
Economics Assignment, 2024
2 pages
GE en N AGui MDTAW SE MOD Proprts Trngls
No ratings yet
GE en N AGui MDTAW SE MOD Proprts Trngls
3 pages
CS3491 CCS Iat-2 QP (2024)
No ratings yet
CS3491 CCS Iat-2 QP (2024)
3 pages
Instruction Manual For Moisture Meter
No ratings yet
Instruction Manual For Moisture Meter
110 pages
The Oxford Handbook of Computational Linguistics 2nd Edition Ruslan Mitkov (Editor) Available Instanly
100% (1)
The Oxford Handbook of Computational Linguistics 2nd Edition Ruslan Mitkov (Editor) Available Instanly
74 pages
Comparison Clauses - BS1377 and ISO17892
100% (2)
Comparison Clauses - BS1377 and ISO17892
1 page
18cs32 - Data Structure and Its Application
No ratings yet
18cs32 - Data Structure and Its Application
22 pages
Fundamental MCQ Questions
No ratings yet
Fundamental MCQ Questions
301 pages
Generation of Power Using Piezoelectric Transducer
No ratings yet
Generation of Power Using Piezoelectric Transducer
4 pages
SN-QC-SAPP-103 Indosef 500mg Injection UPDATED
No ratings yet
SN-QC-SAPP-103 Indosef 500mg Injection UPDATED
8 pages
World Weather Data 1968 Mar
No ratings yet
World Weather Data 1968 Mar
110 pages
Pergamon: Gas-Solid Fluidization: A Typical Dissipative Structure
No ratings yet
Pergamon: Gas-Solid Fluidization: A Typical Dissipative Structure
3 pages
Broadband Radial Discone Antenna Design
No ratings yet
Broadband Radial Discone Antenna Design
7 pages
12th Maths EM Unit Test 2 Model Question Paper English Medium PDF Download
No ratings yet
12th Maths EM Unit Test 2 Model Question Paper English Medium PDF Download
2 pages
Essential Exercises For Marimba
No ratings yet
Essential Exercises For Marimba
6 pages
Swarm Intelligence Seminar
100% (1)
Swarm Intelligence Seminar
35 pages
7 Path Profile
No ratings yet
7 Path Profile
19 pages
Developing Emergency Room Key Performance Indicators: What To Measure and Why Should We Measure It?
No ratings yet
Developing Emergency Room Key Performance Indicators: What To Measure and Why Should We Measure It?
5 pages
ETL Testing Goals and Strategies
No ratings yet
ETL Testing Goals and Strategies
3 pages
1 - Measurement Notes
No ratings yet
1 - Measurement Notes
11 pages
Aquarium Fabrication and Setup Guide
No ratings yet
Aquarium Fabrication and Setup Guide
12 pages
Create LSMW with Batch Input Steps
No ratings yet
Create LSMW with Batch Input Steps
21 pages
Klein Organic Chemistry Chapter 1: Review of General Chemistry
100% (1)
Klein Organic Chemistry Chapter 1: Review of General Chemistry
2 pages
MD1 08B Power Screws May2022 NS
No ratings yet
MD1 08B Power Screws May2022 NS
8 pages
Thread Standards Reference Guide
No ratings yet
Thread Standards Reference Guide
1 page
Theory of Machines Lab Manual
No ratings yet
Theory of Machines Lab Manual
25 pages
Igcse Biology 4ed TR Eoc Test 5
No ratings yet
Igcse Biology 4ed TR Eoc Test 5
4 pages
Construction Project Estimate
No ratings yet
Construction Project Estimate
21 pages
Analog Timepaper
No ratings yet
Analog Timepaper
49 pages