I am learning hadoop. I am trying to visualize what happen after I submit job. I mean can somebody explain step-by-step what happen when I execute command
hadoop jar example.jar WordCount test.txt output
How this Java program submitted to JobTracker. How namenode, datanode comes into picture etc.
Thanks Aniruddha
At the highest level, there are four independent entities:
The client, which submits the MapReduce job.
The jobtracker, which coordinates the job run. The jobtracker is a Java application whose main class is JobTracker.
The tasktrackers, which run the tasks that the job has been split into. Tasktrackers are Java applications whose main class is TaskTracker.
The distributed filesystem, which is used for sharing job files between the other entities.
Please go through the below link
http://answers.oreilly.com/topic/459-anatomy-of-a-mapreduce-job-run-with-hadoop/
Collected from the Internet
Please contact [email protected] to delete if infringement.
Comments