Professional Documents
Culture Documents
Clementine 7.0
Clementine 7.0
Quickly develop
The basic idea behind data mining as a business process isn’t Clementine is a data mining workbench that enables you to quickly
new. It’s simply automating the age-old process of learning from develop predictive models using business expertise and put them to
experience to make better decisions about the future. The only work to improve decision making. Clementine is widely regarded as
difference is that now the flood of data though most organizations — the leading data mining workbench because it delivers the maximum
often terabytes of data — makes enterprise-strength analytics return on your data investment in the minimum amount of time.
essential. Clementine makes it possible to capitalize on all of the Unlike other data mining workbenches, which fail to truly support
data available in your organization through data mining to improve the entire business process of data mining and just focus on models
the way you manage of the future. for enhancing performance — Clementine supports the entire data
mining process for generating maximum returns in minimum time.
Broad analytics produce the best results CRISP-DM makes data mining a business process
To solve business problems, your organization must use many types To help companies focus data mining technology on business problem
of data in many different ways. Different tasks and different data types solving, SPSS and a global consortium of organizations involved in
require different analytical techniques. Clementine provides you with data mining developed the CRoss-Industry Standard Process for Data
the best and broadest range of analytics available to ensure that you Mining (CRISP-DM). Unlike data mining methodologies that focus
have the analytical techniques you need to get the best results and on technology, CRISP-DM makes data mining a business process that
tackle business problems as they arise. Even if opportunities for maps business goals to data mining goals. Now the de facto industry
improving your business are hidden deep within massive datasets with standard, a recent poll shows that over 50 percent of data miners use
millions of rows — Clementine scales the entire data mining process CRISP-DM as their process for mining data. Clementine is designed
to the size of your challenge so you can quickly solve your problem. to support the CRISP-DM process at every step to help you avoid
common mistakes and create the focused predictive intelligence
needed to solve the problem at hand quickly.
The CRISP-DM method makes data mining a business process by focusing data and treatment
mining technology on your specific business goals. The process begins with an
3
understanding of your business goals and ends with deployment of data mining †
Included with Clementine
results to improve decision making and produce business results. *
Available as a Clementine add-on module
Maximize returns
by deploying dynamic predictive intelligence
to every point of decision making
Opportunities for improving your business are occurring at points Capitalize every point of decision making
throughout your organization at this very moment. But if you aren’t Clementine has the wide range of deployment options you need to put
using dynamic predictive intelligence throughout your organization predictive intelligence to work where it can generate maximum returns.
to make better decisions and seize these opportunities — you aren’t Clementine enables you to deploy predictive intelligence in two ways —
going to solve your business problem effectively and maximize returns deployment to decision makers and deployment to systems.
from your investment in data. If you don’t put the predictive business
intelligence created through data mining to work, you won’t see Clementine helps you deploy intelligence to decision makers so they
significant returns. It’s that simple. By adopting the business process can plan better strategies. And Clementine helps you deploy “automated”
of data mining and deploying dynamic predictive intelligence — you decisions from those strategies to systems — such as your call center
can seize these opportunities in real time. or Web site — to ensure that every point of decision making is focused
on your business goal. When every point of decision making is focused
on solving your business problem, you are truly getting the maximum
return on your investment on data.
Here’s an example of predictive intelligence deployed to a call center application. The call center rep has all of the predictive intelligence needed to have the best chance
of keeping this customer happy and driving lifetime value. The rep knows the customer’s lifetime value (1), their risk of churning (2) and the recommendation most likely
to keep the customer happy (3). That recommendation can even be refined in real-time — while the representative is talking to the customer — by conducting a brief
“needs assessment” survey. The rep then feeds that survey data into the call center application, and a new recommendation is generated based on the predictive model.
4
Dynamic intelligence moves at the speed
of change Standard Life secured £33 million
Your business is constantly changing. So too must business intelligence
that is meant to improve the future of your business. The business
worth of mortgage revenue
process of data mining is more than just simply creating models. There Situation
are a number of steps in the business process of creating predictive
Standard Life is one of the world’s leading mutual
intelligence — including data access and transformation, the applica-
tion of multiple models and the generation of recommendations based financial services companies. Its mutual status is
on model predictions. Clementine Solution Publisher deploys the entire a key factor in its success. This status brings many
data mining business process for “dynamic” predictive intelligence — benefits, the chief one being that there are no
meaning the intelligence is adaptable to the changes in your business. shareholders to satisfy, only customers. All the
firm’s actions are therefore driven by the need to
Decision makers “close the loop” of solving problems by sharing
benefit its customers.
results with analysts to improve the solution on a continual basis.
Once a data mining solution is deployed, the analysts monitor the
effectiveness of the solution on new data from the field. Then, they Critical issue
quickly and easily refine the data mining process to increase the Standard Life Bank launched its Freestyle Mortgage
solution’s effectiveness. Clementine Solution Publisher is the only product in January 1999. In recent months, a number
data mining product that empowers organizations to quickly update of similar products have appeared from rival providers
the entire data mining process — keeping the solution on track by
making it essential that Standard Life Bank both
re-deploying solutions to every point of decision making.
consolidate and continue to expand its share of the
Deployment of the entire data mining process mortgage market. A major part of the project was to
keeps costs low develop models that could identify customer charac-
Clementine Solution Publisher’s ability to export all processing steps teristics relevant to any mortgage product. Donald
means your organization realizes significant time and cost savings for MacDonald, customer data analyst at Standard Life
deployment over the long run. Clementine enables you to export more
further explains, “Our vision was to increase both
of your solution automatically, so you cost-effectively deliver data mining
results — and update them as your business changes. If only a model the speed at which we build our models and the
were exported, you would have to spend the time and money to sophistication of those same models. This ultimately
reprogram the solution manually. The costs of updating would pile leads to improved customer communications as well
up over time and reduce your ability to respond quickly to changes as greater returns on the bottom line.”
in your business.
Solution
With Clementine, Standard Life created a predictive
Â
model for the remortgage offer showing the types of
These days, people want to be clients attracted to this product. The model allowed
the bank to focus its efforts on the best prospects
able to use the results of their for the remortgage product and create scores for
each customer. These scores enabled them to achieve
analysis on the front lines of more targeted direct mail and to score prospects
with similar characteristics accessing the Web site.
their business, not just read
Results
about them in a dusty report. ■ Achieved with the model, a nine times greater
Clementine Solution Publisher response than that achieved by the control group
■ Secured £33million (approx. $47 million) worth of
enables you to turn the findings mortgage application revenue
Ê
from predictive models into
action.
– David Pihlens
Managing Partner
Commetrix 5
Minimize time
to solution with the most productive
data mining workbench available
Data mining with Clementine is a business process designed to Clementine’s wide range of data visualization techniques also
minimize the time it takes to find solutions to business problems. accelerate progress toward a solution by helping you understand
Clementine’s powerful visual interface — combined with time-saving key relationships in your data and guiding the way to the best results.
process support tools — enables you to use business expertise to Explore multi-dimensional data with 3-D, panel, animation and other
quickly interact with your data and discover solutions in the shortest types of data visualization. Spot characteristics and patterns at a
amount of time possible. Clementine supports the full data mining glance with Clementine’s interactive graphs. Then “query by mouse”
process, including data access, transformations, modeling, evaluation to explore these patterns by selecting subsets of data or deriving new
and deployment. Clementine not only supports the entire data mining variables on the fly from discoveries made within the graph.
process from beginning to end, it supports the industry standard
process — CRISP-DM. Unparalleled analytics produce the best
results possible
Quickly discover solutions using business Clementine enables you to interactively mine your data with a
expertise comprehensive set of powerful modeling techniques to help you
The interactive, visual approach to data mining is the key to Clementine’s find the best result in the shortest amount of time. Effective data
ability to minimize time to solution. You search for a solution to your mining takes more than one algorithm or technique. Different
business problem by creating and interacting with a stream — a visual business problems and different data types requires different
map of the entire data mining business process — which your data techniques. To find the best solution to your problem, you need
flows through. This visual approach makes it easy to see every step in a range of techniques to choose from. Clementine offers an unparalleled
the process clearly — and enables you to use business expertise to breadth and depth of techniques. Other optional analytics — which
quickly explore hunches or ideas by interacting with the stream. easily “plug into” Clementine — are also available from outside
vendors through the Clementine Plus Partner Program. With this
open design, you can even plug your own applications into Clementine
and use a single, productive workbench for all your analysis.
Clementine’s visual approach makes it easy to see and interact with every step in
the business process of data mining — from accessing data wherever it resides to
deploying results to the point of decision making.
6
Computer retailer Sofmap increased sales by 18 percent
Situation Solution
Sofmap Company, Ltd., Tokyo, is one of Japan’s top Sofmap used SPSS Inc.’s Clementine data mining
personal computer and software retailers with 40 solution to build an engine that recommends
retail stores located throughout the country. appropriate products based on customers’ profile,
which is based on information gathered during
Critical issue the online registration process and from past
Sofmap managers believed that many of their transactions.
customers had difficulty making hardware and
software purchasing decisions, which was hindering Results
online sales. ■ Page views increased 67 percent per month after
recommendation engine went live
■ Profits tripled, as sales increased 18 percent vs.
the same period of the previous year
Find the best performing models quickly tailored to your business problem. Clementine also gives you process
The ease with which you can build and combine predictive models in support tools to manage your data mining projects by mapping them
Clementine means that you find the best performing models quickly. directly to the CRISP-DM process. The CRISP-DM help system and
Easily evaluate multiple models with lift, gains, profit, response and project tools help you organize streams, graphs and other output into
other model evaluation graphs in a one-step process, shortening project CRISP-DM phases.
time. All of Clementine’s advanced techniques work together to quickly
provide the best answer to your business problems. You can build and Scalability accelerates data mining
test numerous models to immediately see which model produces the and leverages IT investments
best result. Or even combine models by using the results of one model You don’t need expensive new hardware and software investments
as input into another model. These “meta-models” consider the initial with complicated configurations to experience scalable data mining.
model’s decisions and can improve results substantially. Clementine’s approach to scalability accelerates data mining and
leverages your existing IT investment by pushing processing back
Process support tools jumpstart projects into the database with in-database mining.
and improve results
Clementine provides process support tools for getting started quickly Data mining is a discovery-driven process — when you see a relationship
and evaluating your results. Clementine Application Templates (CATs) that sparks your curiosity, you need enterprise-strength scalability to
give you a head start for tackling common problems with application- interactively explore and analyze the pattern. These discoveries are a
specific best practices. By using pre-built streams of complex but necessary step to building higher-performing models. Clementine’s
common operations, you can expedite the time needed to see results. approach to scalability supports this interactivity. Most data mining
CATs combine the power and flexibility of a horizontal workbench tools focus on scaling narrow steps, which affect only the speed of a
with application-specific templates to deliver high-performing solutions part of the process. Instead, Clementine focuses on scaling as many
steps as possible to give you faster results overall.
Ê – Daniele Micci-Barreca
Director of Data Mining
ClearCommerce
7
Specifications
Clementine Application Templates Modeling algorithms System requirements
■ CRM CAT† ■ Prediction and classification: Clementine client:
■ Web mining CAT† – Neural networks (Multi-Layer ■ Hardware: Pentium-compatible processor or higher
■ Telco CAT† Perceptron, Radial Basis Function) and a monitor with 1024 x 768 resolution or higher
■ Crime CAT* – Decision trees and rule induction (C5.0, C&RT) (support for 65,536 colors is recommended). A
■ Fraud CAT* – Linear regression, logistic regression, multinomial CD-ROM drive for installation is also required.
■ Microarray CAT* logistic regression ■ Operating system: Windows 95, Windows 98,
– Native access to database management systems – GRI, Apriori and Web visualization – Additional option: Clementine Application
including Oracle, SQL Server, DB2; additional ■ Data reduction: Template for Web Mining: 115MB
access to any ODBC-compliant data source – Factor analysis, principle components – Additional option: Clementine Application
®
– Import delimited and fixed-width text, SPSS analysis Template for Telecommunications: 60MB
®
and SAS 6, 7, 8 files ■ Combine models for greater accuracy ■ Minimum RAM: 128MB
Data preparation for scalability plus at least twice the drive space of the amount
■ Generate subsets of data automatically from ■ Minimized network traffic via intelligent field of data to be processed.
graphs and tables projection ■ Minimum RAM: 128MB
■ Manipulate data with complete record and field Deployment with Clementine Clementine Solution Publisher Runtime:
■ Hardware: Pentium-compatible processor or
operations, including: Solution Publisher
■ Automated export of all operations, including: higher for Windows, SPARC for Solaris, HP
– Field filtering, naming, derivation and
– Data access Workstation for HP/UX or IBM RS/6000 for AIX.
value replacement
– Data manipulations A CD-ROM drive for installation is also required.
– Record selection, sampling, merging and ■ Operating system: Windows 2000 or Windows NT
concatenation, sorting, aggregation and balancing – Model scoring, including combinations of models
– Post-processing 4.0 with Service Pack 6 or higher; Solaris 2.6, 7 or 8;
– Specialized manipulations for showing the
■ Runtime environment for executing image file on HP/UX 11.0 or 11i; AIX 4
“history” of values and converting set variables ■ Minimum free drive space: 4MB for installation;
into flag variables target platforms
■ Easy update of solutions through small image file plus at least twice the drive space of the amount
of data to be processed.
■ Minimum RAM: 128MB
SPSS is a registered trademark and the other SPSS products named are trademarks of SPSS Inc. All other names are trademarks of their respective owners. Printed in the U.S.A. © Copyright 2002 SPSS Inc. CLM7BRO-0802