0% found this document useful (0 votes)
30 views12 pages

Installing Hadoop On Linux

This document provides a step-by-step guide for installing Hadoop on a Linux system, specifically Ubuntu or CentOS. It outlines prerequisites such as Java, SSH configuration, and user creation, followed by detailed installation steps including downloading Hadoop, setting environment variables, configuring core files, and starting Hadoop daemons. The document concludes with instructions on accessing Hadoop's web UIs for monitoring the cluster.

Uploaded by

Mahek gupta
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
30 views12 pages

Installing Hadoop On Linux

This document provides a step-by-step guide for installing Hadoop on a Linux system, specifically Ubuntu or CentOS. It outlines prerequisites such as Java, SSH configuration, and user creation, followed by detailed installation steps including downloading Hadoop, setting environment variables, configuring core files, and starting Hadoop daemons. The document concludes with instructions on accessing Hadoop's web UIs for monitoring the cluster.

Uploaded by

Mahek gupta
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Installing Hadoop on Linux

Linux OS (Ubuntu/CentOS)

Java (JDK 8 or higher)

Prerequisites SSH (Password-less login)

Minimum 4GB RAM


sudo apt update

sudo apt install


Step 1 – openjdk-11-jdk
Install Java java -version

Set JAVA_HOME in
.bashrc
Step 2 –
Add
• sudo adduser hadoop
Hadoop • sudo usermod -aG sudo hadoop

User •
su - hadoop
Create user for secure operations
Step 3 – Configure
SSH
• ssh-keygen -t rsa -P ""
• cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
• ssh localhost
• Needed for Hadoop's distributed communication
Step 4 – Download Hadoop
wget
[Link]
3.3.6/[Link]

tar -xzvf [Link]

mv hadoop-3.3.6 hadoop
Step 5 – Set
Environment Variables
• Edit ~/.bashrc to add:
• export HADOOP_HOME=~/hadoop
• export
PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HO
ME/sbin
• export JAVA_HOME=/usr/lib/jvm/java-11-openjdk-
amd64
• Run: source ~/.bashrc
Edit [Link],
[Link], mapred-
Step 6 – [Link], [Link]
Configure
Core Files Set NameNode URI,
directories, and
memory configs
Step 7 – Format HDFS

HDFS NAMENODE - PREPARES HDFS METADATA


FORMAT STRUCTURE
[Link]
Step 8 –
Start [Link]
Hadoop
Daemons jps (check NameNode,
DataNode, etc.)
HDFS:
[Link]
Step 9 – YARN:
Access [Link]
Web UIs
Monitor your Hadoop
cluster
Java + SSH = Pre-
requirements

Download and Configure


Hadoop
Conclusion
Format HDFS and Start
Services

Use Web UI for monitoring

You might also like