- MongoDB Integration and Tools >
- MongoDB Connector for Hadoop
MongoDB Connector for Hadoop¶
The MongoDB Connector for Hadoop is a plugin for Hadoop that provides the ability to use MongoDB as an input source and/or an output destination.
This guide also includes the following documentation:
The MongoDB Connector for Hadoop uses the SBT Build Tool tool for compilation. SBT provides superior support for discrete configurations targeting multiple Hadoop versions. The distribution includes self-bootstrapping copy of SBT in the distribution as sbt. Create a copy of the jar files using the following command:
The MongoDB Connector for Hadoop supports a number of Hadoop releases. You can change the Hadoop version supported by the build by modifying the value of hadoopRelease in the build.sbt file. For instance, set this value to:
hadoopRelease in ThisBuild := "cdh3"
configures a build against Cloudera CDH3u3.
hadoopRelease in ThisBuild := "0.21"
configures a build against Hadoop 0.21 from the mainline Apache distribution.
After building, you will need to place the “core” jar and the mongo-java-driver in the lib directory of each Hadoop server.
For more complete install instructions please see the install instructions in the readme
- What’s New With MongoDB Hadoop Integration
By Mike O’Brien
MongoDB, Hadoop and HuMONGOus Data by Steve Francia at MongoSF 2012
MongoDB + Hadoop by Brendan McAdams at MongoDB Philly 2012
mongo-hadoopで始める大規模ログ解析 〜低コストへの新たな道〜 (BigData Analysis with Mongo-Hadoop) by Daichi Morifuji at MongoTokyo 2012