Installing Hadoop on Linux
Linux OS (Ubuntu/CentOS)
Java (JDK 8 or higher)
Prerequisites SSH (Password-less login)
Minimum 4GB RAM
sudo apt update
sudo apt install
Step 1 – openjdk-11-jdk
Install Java java -version
Set JAVA_HOME in
.bashrc
Step 2 –
Add
• sudo adduser hadoop
Hadoop • sudo usermod -aG sudo hadoop
•
User •
su - hadoop
Create user for secure operations
Step 3 – Configure
SSH
• ssh-keygen -t rsa -P ""
• cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
• ssh localhost
• Needed for Hadoop's distributed communication
Step 4 – Download Hadoop
wget
[Link]
3.3.6/[Link]
tar -xzvf [Link]
mv hadoop-3.3.6 hadoop
Step 5 – Set
Environment Variables
• Edit ~/.bashrc to add:
• export HADOOP_HOME=~/hadoop
• export
PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HO
ME/sbin
• export JAVA_HOME=/usr/lib/jvm/java-11-openjdk-
amd64
• Run: source ~/.bashrc
Edit [Link],
[Link], mapred-
Step 6 – [Link], [Link]
Configure
Core Files Set NameNode URI,
directories, and
memory configs
Step 7 – Format HDFS
HDFS NAMENODE - PREPARES HDFS METADATA
FORMAT STRUCTURE
[Link]
Step 8 –
Start [Link]
Hadoop
Daemons jps (check NameNode,
DataNode, etc.)
HDFS:
[Link]
Step 9 – YARN:
Access [Link]
Web UIs
Monitor your Hadoop
cluster
Java + SSH = Pre-
requirements
Download and Configure
Hadoop
Conclusion
Format HDFS and Start
Services
Use Web UI for monitoring