以前写的mapreduce的wordcount,都忘了怎么执行了
hadoop jar
-
mapper mapper1.py
-
file mapper1.py
-
reducer reducer1.py
-
file reducer1.py
-
input /user/hujianqiu/wordcount/input_word/*
-
output output_word
其中,调用的jar包可不是随便找一个的,而是要用本机安装的hadoop-streaming的jar包,因而要找一下hadoop安装在哪里,可以使用如下命令:
locate
*share/hadoop/tools/lib/hadoop-streaming*
这样就能很快地定位到hadoop的安装目录了