Ex-06-Pseudo-Node-Configuration-for-Hadoop-on-Ubuntu

AIM

To implement Pseudo Node configuration for Hadoop on ubuntu

Pre-requisites

a) jdk

Single-Node Configuration

Create a dedicated user account for hadoop

Install java1.8 in folder /usr/local

Install Hadoop

Set the hadoop environment variables: Include the following lines in the $HOME/.bashrc file

Set hadoop environment variables: Include the following lines /etc/profile file

Run the.bashrc & profile files from the $ prompt for updating the changes

$ bin/hadoop version

Configuration of the hadoop files: hadoop-env.sh, core-site.xml, mapred-site.xml, hdfs- site.xml and yarn-site.xml

path :: /usr/local/hadoop-2.5.1/etc/hadoop

a) hadoop-env.sh Include the following lines in hadoop-env.sh file

b) core-site.xml Configure the directory for Hadoop to store its data files, the network ports it listens to, etc. Setup will use Hadoop’s Distributed File System (HDFS-single local machine)

Include the following lines in core-site.xml file between and tags

c) mapred-site.xml

$sudo cp mapred-site.xml.template mapred-site.xml

Include the following lines in mapred-site.xml file

<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>

d) hdfs-site.xml Include the following lines in hdfs-site.xml file

e) yarn-site.xml

Include the following lines in yarn-site.xml file

Format the Hadoop File system implemented on top of the local file system using

Start Hadoop using

Explore Hadoop using http://localhost:50070/ from the browser

The commonly used HDFS Commands are as follows:

Create a directory ‘/input’ in HDFS

Copy the input files into the distributed file system

Run some of the examples provided

Examine the output files Copy the output files from the distributed file system to the local file system and examine them:

View the output files on the distributed file system

$ bin/hdfs dfs -cat /output/*

Result:

Thus, the implementation of Pseudo Node configuration for Hadoop on ubuntu is successfully executed.

surekaelango / ex-06-pseudo-node-configuration-for-hadoop-on-ubuntu Goto Github PK