Hadoop 2 Single Node Installation with YARN

Pre-requisites Before starting with Hadoop 2 single node installation, make sure that the node have the following pre-requisites: a)    Any Linux Operating system b)    Sun Java 1.6 or above should already be installed and the version should be same across all the nodes. To install Java, you can refer to the installation steps […]

Build Hadoop-2.4.0 Source on windows and Configure in Eclipse

We can now build hadoop source version 2.4.0 on windows and configure it to use in eclipse. Follow the steps mentioned below to configure hadoop source on windows. Requisites 1. Download hadoop distribution hadoop-2.4.0-src.tar.gz from here. 2. 7-ZIP (Right click on any folder to check if it is already installed […]

HDFS High Availability Overview

Background Single Point of Failure (SPOF) in HDFS: Each cluster had a single NameNode, and if that machine or process became unavailable, the cluster as a whole would be unavailable until the NameNode was either restarted or brought up on a separate machine. Ecosystem Dependency: The Hadoop ecosystem components like […]

Install Java in Linux

To install Java in Linux, refer the following instructions: 1. Download the 32bit or 64bit compressed binary “.tar.gz” file from here. 2. Create a system directory like /usr/lib/jvm  to install JDK and copy the tar file to the directory. sudo mkdir -p /usr/lib/jvm sudo mv jdk-7u3-linux-x64.tar.gz /usr/lib/jvm/   3. Change the present […]