Professional Documents
Culture Documents
“Data Mining for Business Analytics: Concepts, Techniques, and Applications with JMP Pro® hits the ‘sweet spot’ in terms
Stephens • Patel
Shmueli • Bruce
of balancing the technical and applied aspects of data mining. The content and technical level of the book work
beautifully for a variety of students ranging from undergraduates to MBAs to those in applied graduate programs.”
— Allison Jones-Farmer, Van Andel Professor of Business Analytics & Director of the Center for Analytics and Data Science,
Department of Information Systems & Analytics, Farmer School of Business, Miami University
Featuring hands-on applications with JMP Pro®, a statistical package from the SAS Institute, the book
BUSINESS ANALYTICS
uses engaging, real-world examples to build a theoretical and practical understanding of key data
mining methods, especially predictive models for classification and prediction. Topics include data
visualization, dimension reduction techniques, clustering, linear and logistic regression, classification
and regression trees, discriminant analysis, naive Bayes, neural networks, uplift modeling, ensemble
Data Mining for Business Analytics: Concepts, Techniques, and Applications with JMP Pro® also includes:
Detailed summaries that supply an outline of key topics at the beginning of each chapter
CONCEPTS, TECHNIQUES, AND APPLICATIONS WITH JMP PRO®
End-of-chapter examples and exercises that allow readers to expand their comprehension of the
presented material
Data-rich case studies to illustrate various applications of data mining techniques
A companion website with over two dozen data sets, exercises and case study solutions, and slides
for instructors
Data Mining for Business Analytics: Concepts, Techniques, and Applications with JMP Pro® is an excellent
textbook for advanced undergraduate and graduate-level courses on data mining, predictive
analytics, and business analytics. The book is also a one-of-a-kind resource for data scientists, analysts,
researchers, and practitioners working with analytics in the fields of management, finance, marketing,
information technology, healthcare, education, and any other data-rich field.
Galit Shmueli, PhD, is Distinguished Professor at National Tsing Hua University’s Institute of
Service Science. She has designed and instructed data mining courses since 2004 at University
of Maryland, Statistics.com, Indian School of Business, and National Tsing Hua University, Taiwan.
Professor Shmueli is known for her research and teaching in business analytics, with a focus on
statistical and data mining methods in information systems and healthcare. She has authored
over 70 journal articles, books, textbooks, and book chapters, including Data Mining for Business
Analytics: Concepts, Techniques, and Applications with XLMiner®, Third Edition, also published by Wiley.
Peter C. Bruce is President and Founder of the Institute for Statistics Education at www.statistics.com
He has written multiple journal articles and is the developer of Resampling Stats software. He is the
author of Introductory Statistics and Analytics: A Resampling Perspective and co-author of Data Mining
for Business Analytics: Concepts, Techniques, and Applications with XLMiner ®, Third Edition, both published
by Wiley.
Mia Stephens is Academic Ambassador at JMP®, a division of SAS Institute. Prior to joining SAS, she
was an adjunct professor of statistics at the University of New Hampshire and a founding member
of the North Haven Group LLC, a statistical training and consulting company. She is the co-author of
three other books, including Visual Six Sigma: Making Data Analysis Lean, Second Edition, also published
by Wiley.
Nitin R. Patel, PhD, is Chairman and cofounder of Cytel, Inc., based in Cambridge, Massachusetts. A
Fellow of the American Statistical Association, Dr. Patel has also served as a Visiting Professor at the
Massachusetts Institute of Technology and at Harvard University. He is a Fellow of the Computer Society
of India and was a professor at the Indian Institute of Management, Ahmedabad, for 15 years. He is
co-author of Data Mining for Business Analytics: Concepts, Techniques, and Applications with XLMiner®,
Third Edition, also published by Wiley. Galit Shmueli • Peter C. Bruce • Mia L. Stephens • Nitin R. Patel
Subscribe to our free Statistics
eNewsletter at wiley.com/enewsletter www.jmp.com/dataminingbook
www.wiley.com/statistics
0.625 inches
FOREWORD xvii
PREFACE xix
ACKNOWLEDGMENTS xxi
PART IPRELThflNARffiS
1 Introduction 3
1.1 What Is Business Analytics? 3
Who Uses Predictive Analytics? 4
1.2 What Is Data Mining? 5
1.3 Data Mining and Related Terms 5
1.4 Big Data 6
1.5 Data Science 7
1.6 Why Are There So Many Different Methods? 7
1.7 Terminology and Notation 8
1.8 Roadmap to This Book 10
Order of Topics 11
Using JMP Pro, Statistical Discovery Software from SAS 11
vii
viii CONI'ENTS
Predictive Analytics 16
Data Reduction and Dimension Reduction 16
Data Exploration and Visnalization 16
Supervised and Unsupervised Learning 16
2.3 The Steps in Data Mining 17
2.4 Preliminary Steps 19
Organization of Datasets 19
Sampling from a Database 19
Oversampling Rare Events in Classification Tasks 19
Preprocessing and Cleaning the Data 20
Changing Modeling Types in JMP 20
Standardizing Data in JMP 25
2.5 Predictive Power and Overfitting 25
Creation and Use of Data Pattitions 25
Pattitioning Data for Crossvalidation in JMP Pro 27
Overfitting 27
2.6 Bttilding a Predictive Model with JMP Pro 29
Predicting Home Valnes in a Boston Neighborhood 29
Modeling Process 30
Setting the Random Seed in JMP 34
2.7 Using JMP Pro for Data Mining 38
2.8 Automating Data Mining Solutions 40
Data Mining Software Tools: the State of the Market by Herb Edelstein 41
Problems 44
3 Data Visnalization 51
3.1 Uses of Data Visualization 51
3.2 Data Examples 52
Example 1: Boston Housing Data 53
Example 2: Ridership on Amtrak Trains 53
3.3 Basic Charts: Bar Charts, Line Graphs, and Scatterplots 54
Using The JMP Graph Builder 54
Distribution Plots: Boxplots and Histograms 56
Tools for Data Visualization in lMP 59
Heatmaps (Color Maps and Cell Plots): Visualizing Correlations
and Missing Values 59
3.4 Multidimensional Visualization 61
Adding Variables: Color, Size, Shape, Multiple Panels, and
Auimation 62
Manipulations: Rescaling, Aggregation and Hierarchies, Zooming,
Filtering 65
Reference: Trend Lines and Labels 68
Adding Trendlines in the Graph Bttilder 69
Scaling Up: Large Datasets 70
CONTENTS ix
4 Dimension Reduction 81
4.1 Introduction 81
4.2 Curse of Dimensionality 82
4.3 Practical Considerations 82
Example 1: House Prices in Boston 82
4.4 Data Summaries 83
SUIII1llaI)' Statistics 83
Tabulating Data (Pivot Tables) 85
4.5 Correlation Analysis 87
4.6 Reducing the Number of Categories in Categorical Variables 87
4.7 Converting a Categorical Variable to a Continuous Variable 90
4.8 Principal Components Analysis 90
Example 2: Breakfast Cereals 91
Principal Components 95
Normalizing the Data 97
Using Principal Components for Classification and Prediction 100
4.9 Dimension Reduction Using Regression Models 100
4.10 Dimension Reduction Using Classification and Regression Trees 100
Problems 101
11 Neura1Nets 245
11.1 lntroduction 245
11.2 Concept and Structure of a Neural Network 246
CON1ENTS xiii
𝛼
xvi CONTENTS
®
®
®
®
®
Predictive Analytics: Supervised Learning
Forecasting Analytics
ACKNOWLEDGMENTS
The authors thank the many people who assisted us iu improving the first edition and im-
proving it further iu the second edition, and now iu this JMP edition. Anthony Babiuec, who
has been usiug drafts of this book for years iu his Data Miniug courses at Statistics.com,
provided us with detailed and expert corrections. The Statistics.com team has also provided
valuable checkiug, trouble-shooting, and critiqniug: Kuber Deokar, Instructional Opera-
tions Supervisor, and Shweta Jadhav and Dhanashree Vishwasrao, Assistant Teachers. We
also thank the many students who have used and commented on earlier editions of this text
at Statistics.com.
Similarly, Dan Toy and John Elder IV greeted our project with enthusiasm and provided
detailed and useful comments on earlier drafts. Boaz Shmueli and Raquelle Azran gave
detailed editorial comments and suggestions on the first two editions; Noa Shmueli provided
edits on the new edition; Bruce McCullough and Adam Hughes did the same for the first
edition. Ravi Bapna, who used an early draft in a Data Miniug course at the Indian School
of Busiuess, provided iuvaluable comments and helpful suggestions. Useful comments and
feedback have also come from the many iustructors, too numerous to mention, who have
used the book iu their classes.
From the Smith School of Business at the University of Maryland, colleagues Shrivard-
han Lele, Wolfgang Jank, and Paul Zantek provided practical advice and comments. We
thank Robert Wiudle, and MBA students Timothy Roach, Pablo Macouzet, and Nathan
Birckhead for iuvaluable datasets. We also thank MBA students Rob Whitener and Daniel
Curtis for the heatmap and map charts. And we thank the many MBA students for fruitful
discussions and interesting data miniug projects that have helped shape and improve the
book.
This book would not have seen the light of day without the nurturing support of the faculty
at the Sloan School of Management at MIT. Our special thanks to Dimitris Bertsimas, James
Orlin, Robert Freund, Roy Welsch, Gordon Kaufmann, and Gabriel Bitran. As teaching
assistants for the data miniug course at Sloan, Adam Mersereau gave detailed comments
on the notes and cases that were the genesis of this book, Romy Shioda helped with the
®
®
PART I
PRELIMINARIES
Another random document with
no related content on Scribd:
CHAPTER I
INTRODUCTION
But in spite of all this gaiety, Mr. Taft was making very satisfactory
progress in his career. As a law reporter he showed his growing
interest in the public welfare by meeting certain elements in
Cincinnati politics with vigorous denunciation. There was a man
named Tom Campbell, a clever criminal lawyer, who had something
more than a suspicion against him of bribery and corruption of both
witnesses and juries, and he had succeeded in organising a political
machine that was running the town according to his directions.
Campbell was counsel for the defence in what was known as the
Hoffman case and was strongly suspected of tampering with the jury,
and Mr. Taft in reporting the case, took special pains to bring out all
the fine points in the lawyer’s character and methods, telling the
truth as he saw it.
This brought him into association with Mr. Miller Outcalt, the
Assistant Prosecuting Attorney, who represented the State in the
Hoffman case, and when Mr. Outcalt succeeded by election to the
position of prosecuting attorney he offered the place of assistant to
Mr. Taft, although he had been at the bar not more than seven
months. Mr. Taft served in this office for fourteen months and the
experience he had in the rough-and-ready practice in criminal trials,
in preparing cases for trial, in examining witnesses, in making
arguments to the court and in summing up to the jury, was the most
valuable experience he could possibly have in fitting him for trial
work at the bar.
But this experience was shortened by a circumstance not of his
seeking. Major Benjamin Butterworth was the Congressman from
one of the Cincinnati districts in President Arthur’s administration,
and the President being anxious to relieve the Collector of Internal
Revenue, called on Major Butterworth to suggest the name of
another man. Major Butter worth had been for a long time a warm
friend of Mr. Taft, thought he had a good family name and was too
young in politics to have many political enemies, so he suggested him
and wrote to urge him to accept the appointment which the
President immediately offered to him. He accepted the place and
held it for a year, but it proved a serious interruption in his legal
career. He resigned as soon as it was possible and began practice
with Major H. P. Lloyd who had been his father’s partner before he
went to Vienna.
Mr. Taft went abroad in the summer of 1883 to visit Judge and
Mrs. Taft in Vienna, and it was about this time, when we had all
spent several years in frivolities, that several of us became very
serious-minded and decided that we must have something by way of
occupation more satisfying than dancing and amateur theatricals. I
secured a position as school teacher and taught for two years, first at
Madame Fredin’s and then at White and Sykes, both private schools
out on Walnut Hills. Then, with two of my intimate friends, I decided
to start a “salon.” We called it a “salon” because we planned to
receive a company who were to engage in what we considered
brilliant discussion of topics intellectual and economic, and we
decided that our gathering should include only specially invited
guests. Among these were the two Taft brothers, Will and Horace,
and other men common friends of us all.
In view of the fact that two marriages resulted from this salon, Mr.
Taft has suggested ulterior motives on the part of those who got it up,
but there was no truth in the charge. We were simply bent on
“improving our minds” in the most congenial atmosphere we could
create, and if our discussions at the salon usually turned upon
subjects of immediate personal interest, to the neglect of the abstruse
topics we had selected for debate, it was because those subjects were
just then claiming the attention of the whole community.
Cincinnati, thanks to the activities of Tom Campbell and his
followers, was then in a tangle of political mismanagement of a
particularly vicious character, and our little circle developed a civic
spirit which kept us alive to local interests to the exclusion, for the
time being, of everything else. Mr. Taft was intimately connected
with the reform movement, and in all its phases, through comedy
and tragedy, disappointment and elation, we fought it out at our
salon meetings with such high feeling and enthusiasm that its history
became the history of our lives during that period.
Then came the famous Berner case. This was in 1884. Berner had
committed a deliberate murder of an unusually appalling nature and
with robbery as the motive, and there was great excitement about it.
Campbell became his counsel and, in a trial which held the attention
of the community while it lasted, he succeeded in getting the man off
for manslaughter when the unanimous opinion was that he should
have been hanged. Nobody could see how an honest jury could have
rendered any other verdict. There was intense indignation
throughout the city and a meeting was called to denounce Campbell
as an embracer of juries and a suborner of perjury.
On the evening when the meeting to denounce Campbell was
called we were having a session of the salon and our whole
discussion was of the possible developments which might grow out of
the infamous Berner trial. We were greatly excited about it. I
remember the evening distinctly because of the terrible things that
happened. We were disturbed by a great commotion in the street and
we sallied forth in a body to see what it was all about.
The mass meeting was held at Music Hall and was presided over
by Dr. Kemper, a very effective speaker. The crowd was angry and
quickly passed the condemnatory resolutions which were framed.
But with all the indignation and resentment everything might have
been carried out quite calmly had not the match been applied to the
powder. Just as the meeting was breaking up somebody shouted:
“Let’s go down to the jail and take Berner out!”
It was an appeal to the mob spirit which responds so readily in an
angry crowd; they went; and of course the worst elements
immediately came to the top. They attacked the jail, which was in the
rear of the court house, but were held back until the militia, which
had been instantly summoned, arrived. Then they went around to
the front and set fire to the court house. With the streets packed with
raging humanity it was not possible to fight the fire and the building
was completely destroyed.
The militia charged the mob and this inspired somebody with the
idea of raiding a gun store and seizing arms and ammunition with
which to make a resistance. The idea caught on and spread rapidly.
One place attacked was Powell’s gun shop near Fourth and Main. But
Powell, either forewarned or foreseeing some such development, had
quietly made preparations to meet it. He lighted up the front of the
store as brightly as he could, then, with two or three other men who
were expert shots, he put himself behind a barricade in the rear. The
mob came on and as the ringleaders broke into the shop they were
picked off by the men behind the barricade and killed in their tracks.
Four or five of them went down in a heap and the crowd behind
them, not expecting such a reception, instantly was brought to its
senses. This was in April, 1884.
Such an outbreak was a disgrace to the city of Cincinnati, but it
had the effect of bringing the Campbell controversy to a head. A bar
committee of ten men, of which both my father and Mr. Taft were
members, was formed to see what could be done to rid the
community of the evil reputation it had acquired. This committee
made a thorough investigation of Campbell’s character and record,
prepared charges against him and, with my father as chairman,
presented them, in June, 1884, to the district court of three judges,
and asked a hearing and Campbell’s disbarment if the charges were
proved.
MEMBERS OF THE SALON. MR. TAFT IN THE CENTER, WITH
THE AUTHOR AT HIS RIGHT
The trip was full of interest to us both. We spent the greater part of
the summer in England and saw the sights of London and the
cathedral towns in great detail. Our only trip on the Continent was
through Holland to Paris. I remember that in Amsterdam I bought
some old and rather large Delft plates. They wouldn’t go into any
trunk we had, so I had them carefully packed in a wicker hamper and
this article became thereafter a part of our hand luggage, and was the
occasion for a decided disagreement between my husband and me as
to what the true object of travel was. He used to say that he “toted
that blamed thing all around Europe and after all it arrived in
Cincinnati with its contents in small pieces.” Which was true. He had
“toted” it all around Europe, but when we arrived in New York I
entrusted it to an express company with the result that when we
opened it we found its contents in such a condition that only an
accomplished porcelain mender could put a sufficient number of
pieces together to make what my husband always afterward referred
to as “the memento of our first unpleasantness.”
Our trip from Cincinnati to Cincinnati took just one hundred days
and cost us just one thousand dollars, or five dollars a day each. I
venture to say that could not be done nowadays, even by as prudent a
pair as we were.
During a subsequent trip abroad, two years later, I was able to
indulge my desire to hear music. We went to Beyreuth, to the
Wagner festival, and heard Parsifal and The Meistersingers
gloriously rendered; after which we went to Munich and attended
operas and concerts until Mr. Taft rebelled. He said that he enjoyed a
certain amount of music just as much as anybody, but that he did
want to get something more out of European travel than a nightly
opera and a daily symphony.
So—we went to Italy and saw Rome and Florence in true
Baedecker style. When we arrived in Rome we opened our Baedecker
and read that there was almost no foundation for Rome’s awful
reputation as an unhealthy place. “Rome is a very healthy place,” said
Baedecker, “at all times of the year except the first two weeks in
August, when a visit there is attended with risk.” We had arrived for
the first two weeks in August!
When we came home from our wedding trip we found that our
house was not yet completed, so we went to stay with Judge and Mrs.
Taft for a month at the old house in Mt. Auburn. It was a nice old
place, with about three acres of ground, but the air around it was just
about as sooty as if it had been located down under the factory
chimneys. Mt. Auburn is on a sort of promontory which juts out into
the city; it is on a level with the tops of the smoke stacks and it
catches all the soot that the air can carry that far.
Judge and Mrs. Taft had come home from their European mission
in time for our wedding. Judge Taft had been ill in St. Petersburg and
had given his family a great deal of anxiety, but he was now settled
down to the business of quiet recuperation and the enjoyment of
well-earned rest.
My husband’s father was “gentle” beyond anything I ever knew. He
was a man of tremendous firmness of purpose and just as set in his
views as any one well could be, but he was one of the most lovable
men that ever lived because he had a wide tolerance and a strangely
“understanding sympathy” for everybody. He had a great many
friends, and to know him was to know why this was so.
Mr. Taft’s mother, though more formal, was also very kindly and
made my visit to her home as a bride full of pleasure. The two, the
father and mother, had created a family atmosphere in which the
children breathed in the highest ideals, and were stimulated to
sustained and strenuous intellectual and moral effort in order to
conform to the family standard. There was marked serenity in the
circle of which Judge and Mrs. Taft were the heads. They had an
abiding confidence in the future of their children which strongly
influenced the latter to justify it. They both had strong minds,
intellectual tastes, wide culture and catholic sympathies.
Not long after we arrived my husband came to me one day with an
air of great seriousness, not to say of conciliation and said:
“Nellie, Father has got himself into rather a difficulty and I hope I
can rely on you to help him out—not make it too hard for him, you
know,—make him feel as comfortable about it as you can. The truth
is he used to have a messenger at the War Department in
Washington whom he was very fond of. He was a bright man—
colored, of course—and he was very devoted to Father. Now this man
called on Father down town to-day. He’s here on a private car and
Father says he’s made a great success as a porter. Father got to
talking to him, and there were lots of things they wanted to talk
about, and besides the man said he would like very much to see
Mother,—and Father, who was just about ready to come home to
lunch said—right on the spur of the moment—you understand he
didn’t think anything about it—he said to this man, ‘Come on home
and have lunch with us.’ He’s downstairs now. Father came to me
and said he had just realised that it was something of a difficulty and
that he was sorry. He said that he could take care of Mother if I could
take care of you. So I hope you won’t mind.”
As soon as I could control my merriment caused by this halting
and very careful explanation, I went down to luncheon. I didn’t mind
and Will’s mother didn’t mind, but the expression on the face of
Jackson, the negro butler, was almost too much for my gravity. I will
say that the porter had excellent manners and the luncheon passed
off without excitement.
We made a short visit at my mother’s on Pike Street before we
moved into our new house on McMillan Street; but we began the
year of 1887 under our own roof which, though it was mortgaged,
was to us, for the time being, most satisfactory.