You are on page 1of 22

Cloud computing

Recommender data mahout

student : Ahmed

Kareem oleiwi
2015220080

What is mahout

An algorithm library for scalable machine learning on Hadoop.

Apache Mahout is a library of scalable machine-learning


algorithms, implemented on top of Apache Hadoop and using the
MapReduce paradigm. Machine learning is a discipline of artificial
intelligence focused on enabling machines to learn without being
explicitly programmed, and it is commonly used to improve
future.performance based on previous outcomes.

Once big data is stored on the Hadoop Distributed File System


(HDFS), Mahout.provides the data science tools to automatically
find meaningful patterns in those.

The problem and solution

I've successfully installed Hadoop Cluster with 3 machines, and the cluster is
running fine, and I just installed Mahout on the Main name node for "testing
purposes", and I followed the instructions of installation and set the JAVA_HOME,
but when I try to run classify-20newsgroups.sh it goes and download
the dataset but after that I get the following error:

The solution of this problem to Then I've revised the.bashrc and confirmed that
the JAVA_HOME is set correctly,
The .bashrc is only read by a shell that is non-login, otherwise is read
.bash_profile. .

There are another several possibilities to set the JAVA_HOME:


1) set .bashrc from terminal

You might also like