Apache Hadoop Prerequisites and Installation
The following files are needed for any Hadoop project:
Please follow the steps given in the following link for seting up Hadoop on Windows 10 machines How to set up Hadoop on Windows 10
- hadoop-hdfs-3.3.4.jar (Java Archive File)
- zookeeper-3.6.4.jar (Java Archive File)
- log4j-1.2-api-2.19.0.jar (Java Archive File)
- hadoop-mapreduce-client-core-3.3.4.jar (Java Archive File)
- hbase-0.92.1.jar (Java Archive File)
- hadoop-common-3.3.4.jar (Java Archive File)
- AN IDE(Integrated DEvelopment Environment) like Eclipse
- Click on the perspective menu in the menu bar and select Mapreduce as the perespective
After installing successfully Hadoop, we have a directory structure as given below in the hadoop-3.2.1 directory
Eclipse IDE Setup and initialization
Configuration
The binary executables to start the name node, data nodes, mapreduce sites, and HDFS filesystem are in the %HADOOP_HOME%\bin directory
Starting the server
Startiing HDFS daemons
Run the following command to start HDFS daemons. When you do so, there should be two new windows that open: one for datanode and the other for namenode:
$ %HADOOP_HOME%\sbin\start-dfs.cmd
Comments
Post a Comment