Hadoop Ecosystem
Hadoop Installation on Mac
Installation
Download the latest version of hadoop binaries and extract it in local folder
On Mac you can also install with the brew command > brew install hadoop (The current version at writing was 2.7.3) This installs hadoop at /usr/local/Cellar/hadoop/2.7.3
The current JDK version was 1.8 and the java home was set up as /Library/Java/JavaVirtualMachines/jdk1.8.0_73.jdk/Contents/Home It is a good practice to set this up in .bashrc so that it could be picked up all your JVM based apps
Configuration
Edit hadoop-env.sh
SSH Mac: Enable Remote Login in System Preference -> Sharing.
ssh and check that you can ssh to the localhost without a passphrase:
Edit following config files in your Hadoop directory
Execution
Format and start HDFS and YARN > hdfs namenode -format > start-dfs.sh
Now you can browse the web interface for the NameNode at - http://localhost:50070/
Make the HDFS directories required to execute MapReduce jobs:
Start ResourceManager daemon and NodeManager daemon:
Browse the web interface for the ResourceManager at - http://localhost:8088/
Test examples code that came with the hadoop version
Examine the output files:
Copy the output files from the distributed filesystem to the local filesystem and examine them:
submit a yarn job
When you’re done, stop the daemons with:
Reference: http://zhongyaonan.com/hadoop-tutorial/setting-up-hadoop-2-6-on-mac-osx-yosemite.html