The MongoDB Connector for Hadoop is a plugin for Hadoop that provides the ability to use MongoDB as an input source and/or an output destination.
This guide also includes the following documentation:
The MongoDB Connector for Hadoop uses the SBT Build Tool tool for compilation. SBT provides superior support for discrete configurations targeting multiple Hadoop versions. The distribution includes self-bootstrapping copy of SBT in the distribution as sbt. Create a copy of the jar files using the following command:
The MongoDB Connector for Hadoop supports a number of Hadoop releases. You can change the Hadoop version supported by the build by modifying the value of hadoopRelease in the build.sbt file. For instance, set this value to:
hadoopRelease in ThisBuild := "cdh3"
configures a build against Cloudera CDH3u3.
hadoopRelease in ThisBuild := "0.21"
configures a build against Hadoop 0.21 from the mainline Apache distribution.
After building, you will need to place the “core” jar and the mongo-java-driver in the lib directory of each Hadoop server.
For more complete install instructions please see the install instructions in the readme
By Mike O’Brien
MongoDB, Hadoop and HuMONGOus Data by Steve Francia at MongoSF 2012
MongoDB + Hadoop by Brendan McAdams at MongoDB Philly 2012
mongo-hadoopで始める大規模ログ解析 〜低コストへの新たな道〜 (BigData Analysis with Mongo-Hadoop) by Daichi Morifuji at MongoTokyo 2012