hadoop-mongodb driver and mahout

2012-07-02T01:02:50

I have setup hadoop on top of mongodb using hadoop-mongodb driver. Currently I can successfully output results from a M/R job to a mongo colection. I would like to use mahout to take advantage of some of the provided algorithms. Is it possible to use mahout on top of mongodb and output directly to a mongo collection? Is there a how to or a sample I can read?

Copyright License:
Author:「maxsap」,Reproduced under the CC 4.0 BY-SA copyright license with link to original source & disclaimer.
Link to:https://stackoverflow.com/questions/11283993/hadoop-mongodb-driver-and-mahout

About “hadoop-mongodb driver and mahout” questions

I have setup hadoop on top of mongodb using hadoop-mongodb driver. Currently I can successfully output results from a M/R job to a mongo colection. I would like to use mahout to take advantage of s...
I am trying to setup standalone Mahout on local machine where I can run spark-itemsimilarity command on it. I get following error : $ ./mahout Adding lib/ to CLASSPATH MAHOUT_LOCAL is set, running
I'm trying to train a logistic regression model with mahout. The command line and the output look like this: mahout trainAdaptiveLogistic --passes 100 --input /home/cloudera/Desktop/final.csv --
I'm trying to run mahout SGD classifier on a CSV file, and I'm getting this error - [vineet@localhost bin]$ ./mahout trainlogistic --input ./filtered.csv --output model --target target --catego...
When I run the mahout command, the following error happens: Running on hadoop, using /usr/local/hadoop/bin/hadoop and HADOOP_CONF_DIR= MAHOUT-JOB: /home/ubuntu/mahout/examples/target/mahout-
I am trying to cluster a sample dataset which is in csv file format. But when I give the below command, user@ubuntu:/usr/local/mahout/trunk$ bin/mahout kmeans -i /root/Mahout/temp/parsedtext-seqdir-
1)When I run this Random Forest example $MAHOUT_HOME/bin/mahout org.apache.mahout.classifier.df.mapreduce.BuildForest -Dmapred.max.split.size=1874231 -d inputMahoutExamples/RandomForest/rfsplit/
I am using mahout distribution 0.6 and solr 4.2, I want to generate mahout vectors for the solr index but the command gives an compatibility error. Why am I getting this error and how to resolve it...
I am new to hadoop and not to say mahout. I hope someone could assist me to get through here.. have been trying for 2 days.. I have already a hadoop cluster running. I am using hadoop-2.0.0-alpha...
I am trying to run Mahout locally (without Hadoop) on a Windows 8 Machine. I realize this is not the optimal set up but that's what I've got to work with. When I try to run bin/mahout I get the

Copyright License:Reproduced under the CC 4.0 BY-SA copyright license with link to original source & disclaimer.