You are on page 1of 3

Assignment No: 6

Aim: Download, install and study the features of any open source data mining
tool, study its features from the Data Pre-processing, Mining, and Visualization
or in short, form Data Analysis perspective and compare its features with Weka.

Output:
The data mining tool is Rapid Miner.

1. The screenshots of software/tool installed on computer.


2. Specific features/advantages of Rapid Miner.
 Rapid miner is an environment for machine learning and data mining
processes.
 It provides drag and drop interface to design the analytic process.
 It has the compatibility with various databases like oracle, MySQL,
Excel, SPSS, Microsoft SQL server etc.
 Rapid miner uses XML to describe the operator trees modelling
knowledge discovery process.
 More than 100 learning schemes for regression, classification and
clustering analysis.
 Rapid miner includes many learning algorithms from Weka.
 Specialized for Business solutions that include predictive analysis and
statistical computing.
 Access to Twitter and Salesforce.com, NoSQL databases, cloud storage
like Dropbox and Amazon S3.
 Data Access-Access load and analyse any type of data.
 Data Exploration-Extract statistics and key information.
 Data Prep-Efficiently build better models faster.
3 Comparative chart to compare it with Weka.

Parameter Weka Rapid Miner


Partitioning of dataset Pass-limited methods Pass-limited methods
into training set and test
data
Descriptor Scaling Fail-cannot save Pass
parameters for scaling
to apply to future
datasets.
File Formats supported Supports only 4 file Supports
formats approximately 22 file
formats
Parameter optimization Fail-not automatic Pass
of machine learning
User Interface Easy Difficult, complex
Connectivity Bad connectivity with Easy connectivity
Excel and non java with Excel
databases.
Speed Works faster on any Needs more memory
machine to operate smoothly
Data Set Size Supports only small Supports large and
data sets small data set
Primary Usage Machine Learning Data Mining,
Predictive Analysis
Compatibility with MySQL, PostgreSQL, Oracle, IBM DB2,
database MySQL Server, Microsoft SQL
Oracle, ODBC, Server, MySQL,
SQLite 3.x, Excel, Access, SPSS,
HSQLDB, etc. etc.
Platform Supporting Platform Independent Platform Independent
Flexibility Easy to use but not Flexible
enough flexible
Visualization Limited Visualization Better visualization
Interface Type GUI / CLI GUI
Supported

You might also like