Using JMP® Student Edition

For Windows and Macintosh

The User’s Guide to Statistics with JMP® Student Edition

The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2009. Using JMP® Student Edition. Cary, NC: SAS Institute Inc.
Using JMP® Student Edition

Copyright © 2009, SAS Institute Inc., Cary, NC, USA ISBN 978-1-60764-190-2 All rights reserved. Produced in the United States of America. For a hard-copy book: No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, or otherwise, without the prior written permission of the publisher, SAS Institute Inc. For a Web download or e-book: Your use of this publication shall be governed by the terms established by the vendor at the time you acquire this publication. U.S. Government Restricted Rights Notice: Use, duplication, or disclosure of this software and related documentation by the U.S. government is subject to the Agreement with SAS Institute and the restrictions set forth in FAR 52.227-19, Commercial Computer Software-Restricted Rights (June 1987). SAS Institute Inc., SAS Campus Drive, Cary, North Carolina 27513. 1st printing, April 2009 SAS® Publishing provides a complete selection of books and electronic products to help customers use SAS software to its fullest potential. For more information about our e-books, e-learning products, CDs, and hard-copy books, visit the SAS Publishing Web site at support.sas.com/publishing or call 1-800-727-3228. SAS® and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. ® indicates USA registration. Other brand and product names are registered trademarks or trademarks of their respective companies. For more information about this or any other JMP product, contact:
SAS INSTITUTE INC. SAS Campus Drive Cary, NC 27513 USA

For permission to use this work, contact us:
www.jmp.com/se fax: 919.677.4444 phone: 919.677.8000

Technical Support is provided by the publisher of the work that JMP-SE is bundled with, and to registered instructors.

Table of Contents
1 Getting Started with JMP Student Edition
Prerequisites For This Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Computer and Operating System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Learning JMP Student Edition with its Documentation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Conventions Used in This Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Starting JMP Student Edition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . JMP Student Edition Toolbars . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Windows . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Macintosh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . First Session . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Finding Means, Medians, and Standard Deviations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Where to Go from Here . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 11 11 11 12 12 13 13 15 15 16 19

2

The Distribution Platform
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . About the Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Launching the Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Using Histograms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Testing a Mean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Normality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Testing Probabilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Annotating Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Modeling Type of Variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Continuous Variable Graphs and Reports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Display Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Histogram Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Normal Quantile Plots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Outlier Box Plot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Quantile Box Plots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Stem and Leaf Plots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . CDF Plot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Fit Distribution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Categorical Variable Graphs and Reports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Statistical Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Testing a Mean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 21 22 22 24 25 26 28 29 30 30 30 31 31 32 32 33 33 34 34 35

Testing a Standard Deviation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Testing Categorical Probabilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Confidence Intervals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Saving Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Whole-Platform Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Capability Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

35 36 36 37 38 39

3

The Fit Y by X Platform
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Launching the Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Computing a t-test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Pooled t test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Selecting and Marking Points . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Analysis of Variance (ANOVA) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Comparison Circles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Fitting Lines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Correlation Coefficient . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Disclosure Icon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Residuals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Two-Way Contingency Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Logistic Regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Formula Editor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Scatterplots—The Continuous by Continuous Case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Summary of Fit Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Lack of Fit Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Analysis of Variance Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Parameter Estimates Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Other Fitting Commands . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . One Way ANOVA—The Continuous by Categorical Case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Script Submenu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Contingency Analysis—The Categorical by Categorical Case . . . . . . . . . . . . . . . . . . . . . . . . . . . . Logistic Regression—The Categorical by Continuous Case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Whole Model Test Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Platform Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45 45 46 47 48 50 50 51 53 53 53 54 55 56 58 59 60 60 60 61 61 66 66 67 68 69

4

The Matched Pairs Platform
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71 Preparing the Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71 Launching the Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72

. . . . . . . . . . . . . . 110 Process Information . . . . . . . . . . . . . . . . . . . . . . . Roles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .The Matched Pairs Launch Dialog . . . . . . . . . . . . . 105 7 Control Charts Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 All Possible Regressions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 Make Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Run Model . . . . . . . . . . . . . . . . . Regression Reports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75 5 The Fit Model Platform Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Row Diagnostics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Save Commands . . . . . . . . . . . . . . . . . . . . . . . 73 Interpreting the Matched Pairs Plot . . . . . . 107 The Control Chart Launch Dialog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 . . . . . . . . Exploring the Estimates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Examining Results . . . . . . . . . . Least Squares Means . . . . . . . . . . . . . . . Leverage Plots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Macros . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Fitting Personalities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Emphasis Choices . . . . . . . . . . . . . . . . . . . Model Effects . . . . . . . . . . . . . . . . . . . . . . . . 79 79 80 81 82 82 84 85 85 86 86 87 88 88 88 89 91 93 93 95 96 6 Stepwise Regression Introduction . . 102 Stepwise Regression Control Panel . . . . . . . . . . . . . . . Linear Contrasts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Fit Model Report Items . . . . . . . . . . . . . . . . . . . . . . Launching the Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Re-running an Analysis . . . . . . . . . . . . . Effect Details . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73 The Matched Pairs Scatterplot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Setting Titles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103 Step History Table . . . . . . . . . . . . . . . . 99 The Stepwise Window . . . . . . . . . . The Fit Model Dialog Box . . . . . 102 Current Estimates Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . Launch Options for Cusum Charts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . p. . . . . . . . . . and S. . . . . . . Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Time Series Graph . . . . . . . . . . . . . . . . . Individual Measurement Charts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Excluded. . . . . . . . . . . . . . . . . . . Using Specified Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Number of Forecast Periods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Run Charts . . . . . . . . . . . . . . . . . . . . . . . . . . . .Charts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . R-. . . . . 113 114 114 115 115 116 118 119 120 122 124 124 125 125 126 127 129 130 131 132 133 133 135 136 137 8 Time Series Introduction . . . . Time Series Commands . . . . . . . . . Autocorrelation . . . . . . . . . . . . . . . . . . . Model Summary Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Tests for Special Causes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Model Comparison Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Tailoring the Horizontal Axis . . . . .and np-Charts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143 147 147 148 149 149 149 150 150 151 151 152 . . . . . . . . . . . . . . . . . . . . . c-Charts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Shewhart Control Charts . . . . . . . . . . . . . . . . . . . Cusum Chart Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Time Series Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Shewhart Control Charts for Attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Display Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Westgard Rules . . . . . . . . . . . Cumulative Sum (Cusum) Charts . . . . . . . . . . . . . . . . Shewhart Control Charts for Variables . u-Charts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Western Electric Rules . . and Deleted Samples . . . . . . . . . . . . . . . . . . . . . . . . . . . . Single Chart Options . . . . . . . . . . . . . . . . Parameter Estimates Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Graph . . . Modeling Reports . . . . . . . . . . . . . . . . . . . . . Phases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Chart Type Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Window Options . . . . . . . . . . . . . . . . Hidden. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . XBar-. . . . . . . Partial Autocorrelation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . Smoothing Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Correlations Multivariate . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Results from Platforms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ARIMA Model . . . . . . Nonparametric Correlations . . . . Importing Text Files . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Launch the Platform and Select Options . . . . . . . Windows . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Exporting. . . Importing Data . . . . . . . . . . . . . . Residuals . Double (Brown) Exponential Smoothing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Smoothing Model Dialog . . . . . . . . . Computations and Statistical Details . . . . Using the Chart Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Damped-Trend Linear Exponential Smoothing . . . . . .Forecast Plot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Scatterplot Matrix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Model Report Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171 171 173 175 175 176 177 179 180 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Nonparametric Measures of Association . . . . . . . . . . . . . . . . . 153 153 153 154 154 155 156 157 157 158 158 158 159 9 Correlations and Multivariate Techniques Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Linear (Holt) Exponential Smoothing . . . . . . . . . . . . . . . . . . . . . . . . . . . . Pearson Product-Moment Correlation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161 163 163 163 164 166 167 167 168 169 169 169 170 10 Importing. . . . . . . . . . . . . . . Importing Microsoft Excel Files . . . . . . . . . . . . . . . . . . . . . . . . Using the Overlay Plot Platform . . . . . . . . . . . . . . . . . . Seasonal Exponential Smoothing . . . . . . . . . . . . . . . . . . . . Inverse Correlation Matrix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Simple Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Winters Method (Additive) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Macintosh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . and Charting Data Introduction . . . . Covariance Matrix . . . . . . . . . . . . . . . . . Pairwise Correlations . . . . . . . . . . . . . . . . . . . . . . . . . . Simple Exponential Smoothing . . . . Inverse Correlations and Partial Correlations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Iteration History . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . Entering Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Entering Responses and Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226 . . . . . . Single-Chart Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Continuing the Analysis. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Platform Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Continuing the Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Overlay Plot Platform . . . Platform Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Choosing a Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Viewing the Table . . . . . . . . . . . Frame Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225 One-Sample and Two-Sample Means . . . . . . . . . . Specifying Output Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Specifying Axial Value (Central Composite Designs Only) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Choosing a Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Entering Responses . . . . . . . . . . . . . . . . . . . . . . . . 216 218 219 219 220 221 221 222 14 Prospective Power and Sample Size Prospective Power Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Level Options . . . . . . . . . . . . Creating a Response Surface Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .The Chart Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Entering Responses and Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181 182 183 184 185 185 186 186 11 Full Factorial Designs Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Specifying Output Options . . . . . . . . . Selecting Output Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Creating a Screening Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199 201 202 203 203 207 210 211 211 13 Response Surface Designs Introduction . . . . . . . . . . . . . . . . . . . . . . 189 195 195 196 197 12 Screening Designs Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . If Needed . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Making the Table . . . . . . . . . . . . . . . . . . . . . . . . . Single-Plot Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Displaying and Modifying the Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Creating a Factorial Design . . . . . . . . . . . . . . . . . . Viewing the Design Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . Two-Sample Means . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . k-Sample Means . . . . . 228 229 230 231 232 233 235 236 Index JMP-SE . . . . . . . . . . . . . . . . . . . 239 Notices Technology License Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 251 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Counts per Unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Sigma Quality Level . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Power and Sample Size Animation for a Single Sample . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . One-Sample and Two-Sample Proportions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Single-Sample Mean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . One-Sample Variance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

.

standard menus. point-and-click format. or it can be used to search for a specific topic. Statistics Since JMP Student Edition is specially-made for the beginning statistics student. select the help tool (see Figure 1. all the data tables. All of the statistics are accessible in a familiar. like those in all introductory statistics texts. JMP Student Edition is easy to learn and easy to use. whenever they present themselves. The specific prerequisites are as follows: Computer and Operating System In this manual. We hope that this visualization makes learning statistics more fun and easier than it has ever been before. It can be read like a book. minimal knowledge about computers and statistics is necessary. This book shows how to accomplish simple statistical tasks. graphs. since it contains a complete table of contents. allowing for interactive exploration of patterns and outliers. familiarity with standard computer operations and operating system terminology is assumed. In addition. .1) and click anywhere inside a data table or report. Learning JMP Student Edition with its Documentation JMP Student Edition includes an extensive online help system. Prerequisites For This Book To use JMP Student Edition.1 Getting Started with JMP Student Edition Welcome to JMP Student Edition—the version of SAS Institute’s award-winning JMP Statistical Discovery software tailor-made for the introductory statistics student. See the reference books for the operating system and computer for more information on these topics. especially use of the mouse. it requires no formal statistics knowledge. Knowledge of opening. and saving files should also exist before reading this guide. To use it. In addition. and charts are dynamically linked together. closing. and the statistical concepts are supported with both graphs and appropriate numerical results. and commands. JMP Student Edition opens help specific to the item you clicked. JMP Student Edition is equipped with context-sensitive help.

special typefaces are used to designate commands. Starting JMP Student Edition JMP Student Edition can be started in two ways: • Double-click the JMP Student Edition icon • Double-click a JMP Student Edition data set or script.jmp sample data set using a specified platform or function.2). Each “Introduction” section explores the Denim. If the JMP Starter is not automatically opened. .1 The JMP Student Edition Help Tool Help Tool Conventions Used in This Book Throughout this manual. In fact. • Menu items. JMP Student Edition begins by opening a special navigation window. • Variables under study are arranged in columns in the data spreadsheet. • File names are opened and saved to disk or network folders. menu items. or other unique features. all the “Introduction” sections could be read from each chapter before reading the rest of the material in the book. and report titles are usually set by JMP Student Edition and are not alterable by the user. They are separate from the rest of the material in the chapter. Select View > JMP Starter. They are designated by a mouse on the left. By default. called the JMP Starter (see Figure 1. which is intended primarily as a reference. • The notation File > Open means to select the Open command from the File menu. so the words variable and column are often used interchangeably.12 1 Getting Started with JMP Student Edition Conventions Used in This Book Figure 1. • New or important words are emphasized. • Sections titled “Introduction” provide a hands-on approach to learning the basics of JMP Student Edition. Certain paragraphs are meant to be carried out while reading the text. buttons.

2 The JMP Starter Window This window provides quick and easy access to all the menu commands of JMP Student Edition. There are nine tabs groups that partition the commands based on their function: • The File group contains commands related to opening and closing several types of files. Windows Toolbars that duplicate the JMP Starter’s commands include the File/Edit toolbar (Figure 1. • The Control group contains commands for control charts. • The DOE group contains commands for designing an experiment. • The Measure group contains tools for capability analysis.1 Getting Started with JMP Student Edition JMP Student Edition Toolbars 13 1 Introduction to JMP Student Edition Figure 1. they are also presented in the JMP Starter in a logical. the Analyze toolbar (Figure 1. Although these commands are accessible through menus and toolbars. the Graph toolbar (Figure 1. • The Tables tab contains commands used to manipulate data tables. • The Basic group contains commands that perform analyses for one-variable and two-variable situations.6). JMP Student Edition Toolbars An alternative way of accessing JMP Student Edition commands is by using toolbars.5). • The Model group contains commands for matched pairs (a special two-variable situation) and multivariate models. • The Graph group contains commands for charts and 3D graphics. organized way.4). • The Survival group contains reliability and survival commands. and the . the Tools toolbar (Figure 1.3).

Each of these commands is explained fully in later chapters. as well as user-customizable toolbars. There is also a Data Files toolbar.5 The Analyze Toolbar Distribution Survival/ Fit Reliability Fit Y By X Model Multivariate Matched Time Series Pairs Figure 1.7). used to switch between open data tables.8) .7 The Tables Toolbar Subset Summary Stack Sort Split Some of these toolbars are not displayed by default. Figure 1. To activate toolbars that are not showing.4 The Tools Toolbar CrossArrow Hand Simple hair Lines Shapes Help Brush Selection Zoom Polygon Scroller Lasso Annotate Figure 1.14 1 Getting Started with JMP Student Edition JMP Student Edition Toolbars Tables toolbar (Figure 1. • Select View > Toolbars > Show Toolbars to open the Show Toolbars window (Figure 1.6 The Graph Toolbar Overlay Pareto Plot Bar and Plot Pie Spinning Charts Plot Figure 1.3 The File/Edit Toolbar Run New Script Cut Data Open Copy Save Table New Paste Print Script Figure 1.

. The data should appear like the listing in Figure 1. but are all available to be added to a single toolbar. or click the Open button on the File/Edit toolbar.jmp and click Open. Control-click on the toolbar area of a window. First Session This section is a guide through a few simple steps that demonstrate opening a data table.9.8 Show Toolbars Window Macintosh On the Macintosh. select File > Open. To open a data table. To see the definitions of each button on the toolbar.1 Getting Started with JMP Student Edition First Session 15 1 Introduction to JMP Student Edition Toolbars that are checked become visible. toolbars are not set in groups. Those that are unchecked are hidden. or to add and subtract buttons from the toolbar. Figure 1. and closing a data table. drag buttons onto the toolbar to add them. select Open Data Table from the JMP Starter. Select the file Denim. From the window that appears. requesting an analysis.

16

1 Getting Started with JMP Student Edition First Session

Figure 1.9 Partial Listing of the Denim Data File

This data set contains data on the starch content of processed denim. In this example, we examine the data for the Starch Content (%) variable and answer the following questions: • What is the mean of the data? • What is its median? • What is its standard deviation? • Also, produce a histogram of the data.

Finding Means, Medians, and Standard Deviations
To answer these questions, use the Distribution platform. Select Analyze > Distribution. This brings up the launch dialog as seen in Figure 1.10.

1 Getting Started with JMP Student Edition First Session

17 1 Introduction to JMP Student Edition

Figure 1.10 The Distribution Dialog

Select the variable Starch Content (%), then click the Y, Columns button. This step tells JMP Student Edition the variable to analyze. Since Starch Content (%) is the only variable of interest, we are finished with this dialog. Click OK. The report is presented in its default vertical format. However, some people prefer a horizontal layout for the Distribution report. To change the layout to horizontal, Click on the red triangle next to the word Starch Content (%) in the report (see Figure 1.11). Figure 1.11 Red Triangles Reveal Popup Menus

Popup Menus

All of these red triangles reveal popup menus when they are clicked. Watch closely for them—they reveal further options and explorations available during the data exploration process. The menu next to Starch Content (%) shows the options for this single variable, although there are cases (seen later) where the Distribution platform operates on several variables. Options available to all the variables in the report are in the menu next to the word Distributions. Select Display Options > Horizontal Layout to see the report in Figure 1.12.

18

1 Getting Started with JMP Student Edition First Session

Figure 1.12 The Starch Content Distribution Report

The answers to the four questions are all in this report. Read off the mean (25.516634), the median (24.349), and the standard deviation (9.6568876). The histogram is shown on the left. If a printed copy of this report is needed, Select File > Print. Alternatively, this output may be included in a lab report written using a word processor. To move the report into another program, use the cut and paste features of JMP Student Edition: Select the Selection tool, which looks like a fat plus.

Selection tool

Hold down the Shift key and click on each part of the report that needs to be copied. In Figure 1.13, all the text columns and the histogram have been selected. None of the headings have been, nor has the box plot. Note that the histogram’s axis is selected separately from the histogram itself. Figure 1.13 Selection of Report Parts

Select Edit > Copy.

1 Getting Started with JMP Student Edition First Session

19 1 Introduction to JMP Student Edition

In the word processor, select Edit > Paste. Now that the analysis is completed, close JMP Student Edition. Select File > Exit.

Where to Go from Here
This simple example has shown all the steps needed to complete a JMP Student Edition analysis. From here, feel free to explore any of the sample data files that came with JMP Student Edition, explore the online help, or continue reading this book.

15. Introduction Open the data file Denim. after the initial washing process. . see “First Session. and is referred to in each introductory section of this book. However. the time it takes for some of the starch present in the jeans to wear away and wash out. • Caustic Soda is a chemical dissolved in the wash water that chemically destroys the starch. with each roll identified in the Lot Number column. When blue jeans are manufactured. This experiment used three such treatments in differently-sized wash loads. two measurements were taken: one to quantify the starch content of the fabric (measuring stiffness. creating stiffness and stability in the fabric. recorded as a percentage of weight) and one as a count of destroyed threads (measuring wear-and-tear. recorded in the Method column. In addition. denim manufacturers subject the fabric to a variety of treatments to remove some of the starch. The measured thread wear has been converted into an ordinal variable in the Thread Wear column by using the Formula Editor. These abrasive pebbles are the source of the so-called stone-washed jeans. After treating the jeans.” p. • Pumice Stone are a physical abrasive that is added to the wash water that literally pounds the starch out.jmp. in actuality. For information on opening a file. In an effort to minimize the amount of time needed to break in a new pair of jeans. are as follows: • Alpha Amalyze is an enzyme added to the wash water that eats the starch. they usually contain a fair amount of starch. displays graphs. some customers say that jeans have a “breaking in” period before they become truly comfortable. and computes hypothesis tests for these variables. recorded in the Thread Wear Measured column). It calculates summary statistics. This breaking in period is. most people find this stiffness undesirable—in fact. Whether or not the fabric was sand blasted is recorded in the Sand Blasted? column.2 The Distribution Platform Single-variable statistics are the domain of JMP Student Edition’s Distribution platform. The samples came from several different rolls of fabric. The three different treatments. About the Data This file contains information from an experiment with blue jeans. some jeans are sand blasted.

suppose the bar widths and positions of the Starch Content (%) histogram need modifying. 30 and “Categorical Variable Graphs and Reports. To change them. . Histograms and textual information on all the variables now appear. 34. see “Continuous Variable Graphs and Reports. the variables are examined one at a time. Two of the histograms from this report are used later in this chapter. For details on these reports. Click the Y.” p.22 2 The Distribution Platform Introduction Launching the Platform In this example. Figure 2. it is desirable to change these settings.” p. Move the mouse horizontally (assuming the histogram is in its default vertical layout) to change the bar widths of the histogram. and are seen in Figure 2.2 Hand Tool Hand Tool Position the hand tool over the Starch Content (%) histogram and press the mouse button.2) from the Tools toolbar.9 on page 29. Columns button. For example. This brings up the Distribution platform launch dialog.1 Distribution Launch Dialog Select the variables Method and Starch Content (%) from the list on the left by clicking on the first variable name. holding down the Control (Windows) or Option (Macintosh) key. Select the hand tool (Figure 2. Select Analyze > Distribution from the menu bar. Click OK. Figure 2. Many descriptive statistics can be read directly off the text reports accompanying these histograms. Sometimes. and clicking on the second variable name. Using Histograms Histograms appear with bar widths and positions calculated internally by JMP Student Edition.

To get. select Window > Denim-Distribution. or labels—that persist in all of JMP Student Edition’s active plots. colors. Try to determine if one of the methods results in lower starch content than the others. Data rows are highlighted in the data table so that they can be assigned row states—specific markers. select Window > Denim. For example. paying attention to the starch content that gets highlighted with each one. a 95% confidence interval on the levels of Method and Starch Content (%). its selection status ripples through all of JMP Student Edition’s open windows. Click on the histogram corresponding to Alpha Amalyze in the Method histogram. Look at the corresponding points that are highlighted in the Starch Content (%) histogram. It is often useful to have confidence intervals on the means or levels in these histograms.2 The Distribution Platform Introduction 23 2 Distribution: Single-Variable Statistics Move the mouse vertically to change the position of the bars. as are the bars in the other histograms for all the data points that have Alpha Amalyze as their method. To bring the Distribution report to the front. Notice that the corresponding rows in the data table are also highlighted. These histograms are also useful in looking at some relationships among the variables. Whenever a row is selected in any plot. To bring the data table to the front. Click in the histogram bars for Caustic Soda and Pumice Stone. for example. Highlight and explore the other wash methods. The bar for Alpha Amalyze is highlighted. .

select Test Mean from the drop-down list next to Starch Content (%).24 2 The Distribution Platform Introduction Select Confidence Interval > . Click OK. suppose that prior research claims that the mean starch content of Alpha Amalyze-washed denim is 20%. • Make separate histograms for each of the three levels of the Method variable. with the corresponding level indicated in the title bar of the histogram. Three histograms should appear. Bring up the Distribution launch dialog by again selecting Analyze > Distribution from the menu bar.95 from the drop-down menu next to the variable names in the histograms’ title bar. Select Starch Content (%) in the list of variables and click the Y. Testing a Mean Continuing the analysis. In the Method=Alpha Amalyze section. To test that the mean of the Alpha Amalyze denim has a mean of 20%. • Test the mean using the Alpha Amalyze histogram. Select Method in the list of variables and click the By button. two steps are required. To accomplish these two steps. Figure 2. Columns button.3 Test Mean .

31. examine the p-value listed beside Prob > |t|.2 The Distribution Platform Introduction 25 2 Distribution: Single-Variable Statistics Since the hypothesized mean is 20%. Normality Many statistical tests make an assumption that the data is approximately normally distributed. Click OK. This is a nonparametric test that is not usually covered in an introductory course. so leave the other entry field blank. The results of the test are appended to the Distribution report.5740. a z-test would be performed. In this case. This tells JMP Student Edition to compute a t-test of the mean. The online help contains further information on these topics. the t-test is two tailed. Also.” p. To produce a Normal Quantile Plot. Therefore.5 Normal Quantile Plot . Complete details of the Normal Quantile Plot are in the section “Normal Quantile Plots. Type 20 in the entry field for Specify Hypothesized Mean. which in this case is a non-significant 0. leave the box for the Wilcoxon Signed-Rank test unchecked. Figure 2. Although there are usually more important things to worry about than the exact normality of the data. since the percentage could be higher or lower than 20%. If the standard deviation had been known and entered. Figure 2.4 Test Mean Dialog The true standard deviation is not known. JMP Student Edition provides a quick way of assessing normality through the Normal Quantile Plot. Select Normal Quantile Plot from the drop-down list next to one of the variable’s name.

then performing a goodness-of-fit test. Normal. select Window > Denim. You can also test for Normality by fitting a Normal distribution. . yet it is tedious to select the same command many times. a Normal Quantile plot is appended to every histogram. JMP Student Edition therefore provides a way to “broadcast” a command throughout a report. Hold down the Control (Windows) or (Macintosh) key and again select Normal Quantile Plot from the drop-down list next to the variable’s name. a distribution of the Thread Wear variable is necessary. If not. To test this assumption.jmp. This produces a report showing the parameters of the distribution. Small p-values indicate a non-normal distribution. along with a goodness-of-fit statistic testing the null hypothesis that the distribution is. Many times. using the Control (Windows) or (Macintosh) key. a command needs to be sent to all the variables in the report. select Goodness of Fit from the fitted distribution report. in fact.26 2 The Distribution Platform Introduction Scroll down the report to see that this command only added a Normal Quantile Plot for one variable in the report. When the report appears. This shortcut works for most commands in drop-down menus. Testing Probabilities Another question that could be asked about this data is whether the three levels of Thread Wear occur with equal frequency. Make sure the original Denim data table is the front window. This time. Select Fit Distribution > Normal from the platform drop-down list.

in Figure 2.2 The Distribution Platform Introduction 27 2 Distribution: Single-Variable Statistics Request a Distribution of Thread Wear (not Thread Wear Measured). rescale hypothesis is selected.6 Test Probabilities Results . Make sure Fix omitted at estimated values. An addition to the report appears. select Test Probabilities from the drop-down list next to Thread Wear in the title bar of the histogram. Enter 1 into each Hypoth Prob entry field in the Test Probabilities section of the report. A screen shot of the addition appears later in this chapter. JMP Student Edition automatically scales the numbers entered into the entry fields so that they sum to one. Click Done. This allows an easy way to test for equal probabilities—simply enter 1 in each entry field.16 on page 36. After the histogram appears. Figure 2.

Figure 2. use the Annotate tool from the Tools toolbar.3333” in the blank space to the right of the Test Probabilities table. shown to the right.7. Move the mouse over the lower right corner until the cursor turns into a double arrow. The annotation turns yellow. Click and drag the corner to resize the annotation. Click somewhere outside the annotation box. Control-click (Windows) or together. Select Tag Line. Type “These results are highly significant” in the box that appears.8. . you may wish to annotate the tested probabilities from above to state that the results of the test were significant. Annotating Results If these results are to be cut and pasted into a word processing program (for a lab report. For example.001). Click and drag inside the annotation box to reposition it with the tag line anchored. so it is safe to say that the three levels occur with different probabilities. Click next to the top “0.7 The Annotate Tool Annotate Tool Select the Annotate tool. Figure 2. there may be some annotation necessary.28 2 The Distribution Platform Introduction The results of the test are listed in the column labeled Prob>Chisq. The initial click point is used in the Tag Line option below. as shown in Figure 2. To add comments to any JMP Student Edition output. or to turn in as a homework assignment). This test shows some highly significant results (p<0.8 Final Annotation -click (Macintosh) and drag to move the annotation and tag line The final annotation is shown in Figure 2. Right-click in the yellow annotation to bring up a formatting menu.

It produces histograms in any case. In the Denim. The Distribution platform shows exactly this behavior. with values “low”. . the age of people is often recorded on an ordinal scale — seldom do people report their age as 24. For example. and “severe”. although the data are not necessarily numerical. Identical platforms often result in different reports and graphs because the variables analyzed were of differing types.9 Distribution Graphs Histograms Outlier Box Plot • Categorical variables show a histogram. temperature measurements are often on a continuous scale.jmp. but are instead combined within their respective reports as a single graphic. There is an implicit order in the measuring scale. To paste a report with annotations into another document. Thread Wear. • Ordinal variables are measured on a discrete scale. In the example data set Denim. Figure 2. for example. For example. There is no order in the scale. are represented as a nominal variable in JMP Student Edition. choose Edit > Paste Special and choose one of the graphic formats. The modeling type of a variable determines what analyses JMP Student Edition performs. • Nominal variables simply name data. “moderate”.461 years.2 The Distribution Platform The Modeling Type of Variables 29 2 Distribution: Single-Variable Statistics Annotations are not preserved in standard (RTF) cut and paste. Ordinal and Nominal variables are often referred to collectively as categorical variables. People’s names.jmp data set. Variables can have one of three modeling types: • Continuous variables are numeric and measured on a continuous scale. The Modeling Type of Variables JMP Student Edition bases its reports on the modeling type of the variables it analyzes. limited only by the exactness of the measuring instrument.jmp data set. yet it is obvious that some ages are older than others. Method is a nominal variable in the Denim. but appends other graphs based on variable types. the variable Size of Load is a continuous variable. is an ordinal variable.

standard deviation. such as the mean. kurtosis. skewness. number of data points. confidence interval on the mean. standard deviation. • Horizontal Layout arranges text reports to the right of their corresponding graphs and shows the histogram as a horizontal bar chart. • More Moments adds to the Moments table the variable’s sum. . Figure 2.10 Continuous Variable Popup Menu Display Options The Display Options menu contains the following items: • Quantiles shows or hides the Quantiles table. and quantiles. JMP Student Edition produces graphs and text reports to give information from the analysis. variance. The text reports for continuous variables summarize typical univariate statistics. 31. and data set size.10) shows the options available for continuous variables.” p. Selecting this option again returns the report to a vertical layout. Histogram Options The Histogram Options menu contains the following items: • Histogram shows or hides the histogram. standard error of the mean. • Moments shows or hides the Moments table. This table displays the mean. Outlier plots are discussed in the section “Outlier Box Plot. constructed to show possible outliers in continuous variables. Continuous Variable Graphs and Reports Initially.30 2 The Distribution Platform Continuous Variable Graphs and Reports • Continuous variables have an outlier box plot. upper and lower 95% confidence limits for the mean. The popup menu for continuous variables (Figure 2. and the coefficient of variation.

Normal Quantile Plots The Normal Quantile Plot option adds a graph to the report that is used to visualize the extent to which the variable is normally distributed. This makes the identification of points with extreme values. • The Density Axis is the length of the bars in the histogram. The standard error bar automatically adjusts to reflect the histogram’s bar widths and positions when you change them using the hand tool. . Outliers are often identified as points that fall above the upper quartile + 1. Outlier Box Plot The Outlier Box Plot is a schematic that shows the dispersion of a variable. Any combination of these axes can be added to categorical or continuous histograms. The difference between the quartiles is the interquartile range. relatively easy. As the length of the bars is changed with the hand tool. • Prob Axis adds an axis that shows the proportion of each value represented by histogram bars. the data are approximately normal. the Count and Prob axes change. extending from both ends of the box. The dashed lines in the outlier box plot are sometimes called whiskers. The whiskers extend to the outermost data point that falls within the distances computed for judging outliers. also called the quartiles. the normal quantile plot is approximately a diagonal straight line. The line across the middle of the box identifies the median sample value.2 The Distribution Platform Continuous Variable Graphs and Reports 31 2 Distribution: Single-Variable Statistics • Std Err Bars draws the standard error bar on each level of the histogram. but the Density axis remains constant.5×(interquartile range) or below the lower quartile – 1. or q-q plot. and the means diamond indicates the sample mean and 95% confidence interval. sometimes called outliers. The Normal Quantile plot also shows confidence bounds. This kind of plot is sometimes also called a quantile-quantile plot. If a variable is normal. If the data fall within these confidence bounds. The ends of the box are the 25th and 75th quantiles. • Count Axis adds an axis that shows the frequency of each value represented by the histogram bars.5×(interquartile range).

Values are reconstructed by using the legend at the bottom of the plot. Each line of the plot has a stem value that is the leading digit of a range of column values. Figure 2. Quantiles are values that divide a distribution into two groups — where the pth quantile is larger than p% of the values. but have greater spacing at the other end (as in this picture). the quantiles are approximately equidistant from each other.11. .11 Stem and Leaf Plot Stem and leaf plots have similar interactive capabilities to JMP Student Edition’s graphics plots. the Quantile Box Plot is useful for seeing normality in a graphical way. The leaf values are made from the remaining digits of the values. They remain useful because they show the actual data at the same time as the shape of the data. Like the Normal Quantile Plot. Quantile Box Plots The Quantile Box Plot command shows additional quantiles (sometimes called percentiles) on the axis of the histogram. half the data are below the 50th percentile (median).32 2 The Distribution Platform Continuous Variable Graphs and Reports The red bracket along the edge of the box identifies the shortest half. Stem and Leaf Plots The Stem and Leaf command constructs a plot that is essentially a variation on the histogram. the smallest length that contains 50% of the data. the distribution is skewed toward the end with more spacing. It was developed for tallying data in the days when computer printouts were neither graphical nor easy to produce. in that they highlight corresponding data points in the data table when they are selected in the plot. For example. For example. if one exists). This is useful when determining the shape of underlying distributions. the third line of the table reveals that there are data points with values 40 and 41. Note that the quantile box plot is not the same as the outlier box plot from page 31. If a distribution is normal. The data values can be reconstructed by joining the stem and leaf (and multiplying by the scale factor. if the quantile marks are grouped closely at one end. In the example pictured in Figure 2.

Weibull) to the data.2 The Distribution Platform Continuous Variable Graphs and Reports 33 2 Distribution: Single-Variable Statistics CDF Plot The CDF Plot command plots a cumulative distribution function step plot using the observed data (with weights or frequencies if specified). you can select among several options. After fitting. including a Goodness of Fit test. A CDF plot (Figure 2.12 CDF Plot for Size of Load Fit Distribution The Fit Distribution menu allows you to fit certain distributions (Normal. Lognormal. Consult a statistics text for a definition of a density function. Figure 2. .12) estimates the area under the density curve up to each data point.

and the probability associated with each variable. Right-click on the table to bring up a popup menu (Figure 2.14) Select Columns to reveal a popup menu of all possible columns for the table.13). including: . Select the column to be shown or hidden. the count (sometimes called the frequency) of each level. Figure 2.15) work the same as those for continuous variables (see “Continuous Variable Graphs and Reports. 30. Tables that are currently shown have a check mark beside them.15 Drop-down Menu for Categorical Variables Statistical Tests JMP Student Edition contains numerous statistical tests for single variables. The standard error of these probabilities (StdErr Prob) and the cumulative probabilities (Cum Prob) for the data are also computed. Figure 2. To see them. This probability is simply the ratio of each level’s count to the total count.13 Frequencies Table This table lists the levels of a categorical variable.” p. but are not initially shown in the results table.34 2 The Distribution Platform Categorical Variable Graphs and Reports Categorical Variable Graphs and Reports The only text report that appears by default in categorical distribution reports is a frequencies table (Figure 2.) Figure 2.14 Table Popup Menu The options listed in the drop-down menu for categorical variables (Figure 2.

the Test Standard Deviation table is appended to the bottom of the reports for that variable. The Test Mean command calculates and displays the following statistics: • t Test (or z test) lists the value of the test statistic and the p-values for the two-sided and one-sided alternatives. The value of this probability is 1 – Prob>t. After clicking OK. The test assumes nothing about the normality of the distribution.2 The Distribution Platform Statistical Tests 35 2 Distribution: Single-Variable Statistics • a test of the mean of a continuous variable • a test of the standard deviation of a continuous variable • a test of the probabilities of a categorical variable These tests are all accessed through the popup menu next to the variable’s name at the top of the report. a z-test is computed. Testing a Mean The Test Mean command prompts for a test value to compare to the sample mean. The test assumes the distribution is normal. This is the p-value for observed significance of the two-tailed t-test. the Test Mean table is appended to the bottom of the reports for that variable. Each time the mean is tested. Otherwise. a new table is appended to the text report. the sample standard deviation is used to compute a t-statistic. The value of this probability is half of Prob > |t|. only that it is symmetric. The probability values given in the Test Mean table are defined: • Prob > |t| is the probability of obtaining a greater absolute t value by chance alone when the sample mean is not different from the hypothesized value. • Prob < t is the probability of obtaining a t value less than the computed sample t ratio by chance alone when the sample mean is not the hypothesized value. This is the p-value for observed significance of a one-tailed t-test. This is the p-value for observed significance of a one-tailed t-test. • Prob > t is the probability of obtaining a t value greater than the computed sample t ratio by chance alone when the sample mean is not the hypothesized value. the nonparametric Wilcoxon signed-rank test can be requested. After clicking OK. The Test Standard Deviation table shows the computed Chi Square statistic that tests whether the hypothesized standard deviation is the same as the computed sample standard deviation. Each time a standard deviation is tested. Testing a Standard Deviation The Test Std Dev command requests a test value for statistical comparison to the sample standard deviation. If a value is entered for the standard deviation. a new Test Mean table is appended to the text report. The Test Std Dev command can be used repeatedly to test different values. and the probabilities associated with that Chi Square value: . Use the Test Mean command repeatedly to test different values. • Signed-Rank lists the value of the Wilcoxon signed-rank statistic followed by the p-values for the two-sided and one-sided alternatives. Optionally.

• Prob<ChiSq is the probability of obtaining a Chi Square value less than the computed sample Chi Square by chance alone when the sample standard deviation is not the hypothesized value. the easiest way to test that all the probabilities are equal is to enter a one in each field. Therefore. where hypothesized probabilities are entered. The Likelihood Ratio and Pearson Chi Square tests are calculated for those probabilities. This is the p-value for observed significance of the two-tailed t-test. leave the levels that are not involved blank. • Prob>ChiSq is the probability of obtaining a Chi Square value greater than the computed sample Chi Square by chance alone when the sample standard deviation is not the hypothesized value. JMP Student Edition substitutes estimated probabilities for those left blank. To test a subset of the probabilities. The radio buttons on the dialog allow a choice between rescaling hypothesized values to sum to one or using the entered value without rescaling. Testing Categorical Probabilities The Test Probabilities option displays the dialog shown in Figure 2. This is the p-value for observed significance of a one-tailed t-test. .17).16 Test Probabilities Test Probabilities can scale the hypothesized values so that the probabilities sum to one. Figure 2. This is the p-value for observed significance of a one-tailed t-test.36 2 The Distribution Platform Confidence Intervals • Prob>|ChiSq| is the probability of obtaining a greater absolute Chi Square value by chance alone when the sample standard deviation is not different from the hypothesized value. confidence intervals for a mean are automatically displayed in the Moments table (Figure 2.16. Confidence Intervals For continuous variables.

19 Confidence Intervals for Categorical Variables Saving Information To save information computed from continuous response variables. Select the desired level to see. select Confidence Interval > Other and enter the desired level. Figure 2. 0.19. An example is shown in Figure 2. A dialog box appears that lets you specify which confidence interval to display. Each command generates a new column in the current data table named by appending the response column name (denoted colname in the following definitions) to the saved statistic’s name. This enables the same statistic to be saved multiple times under different circumstances. Figure 2. Select Confidence Interval from the popup menu next to the variable name in the title bar.95. select Confidence Interval > 0. use the Save menu commands. To obtain a confidence interval alpha that is not listed on the Confidence Interval menu.17 The Moments Table for Continuous Variables To calculate confidence intervals for a specific value of α. If the Save com- . The Save commands can be used repeatedly. for example.95.2 The Distribution Platform Saving Information 37 2 Distribution: Single-Variable Statistics Figure 2.18 Confidence Intervals for Continuous Variables To obtain the equivalent table for categorical variables. such as before and after combining histogram bars.

to create unique column names. If a value is unique. • Ranks creates a new column called Ranked colname that contains a ranking for each of the corresponding column’s values. and so forth. if it is saved with the current data table. This column is similar to the empirical cumulative distribution function. starting at 1. the column name for the statistic is named colname1. • Level Midpoints creates a new column. For N non-missing scores. • Standardized creates a new column. called Prob colname. which is then divided by the column standard deviation). Whole-Platform Options Each statistical platform has a popup menu in the outermost outline level next to the platform name. called Level colname. This contains the original column’s standardized values (each value in the column has had the column mean subtracted. The histogram bars are numbered from low to high. If the script is saved to a file. called N-Quantile colname. the probability score of a value is computed as the averaged rank of that value divided by N+1. they are assigned consecutive ranks in order of their occurrence in the spreadsheet. If a value occurs k times. • Stack lets you orient all the output in the report window as either portrait or landscape. you can edit it. colname2. If there are duplicates in the column. called Midpoint colname. it is available to run the next time you open the table. . called RankAvgd colname. • Normal Quantiles creates a new column. maximum. When there are By-Groups the script includes JSL Where clauses that identify the By-Group levels. and intervals so that the distributions are easily compared. • Prob Scores creates a new column. • Script lets you rerun or save the JSL script that produced the platform results. its averaged rank is the same as the rank. The whole-platform options for the Distribution platform include the following: • Uniform Scaling scales all axes with the same minimum. called Std colname. beginning with 1. The JSL generated by Save Script for All Objects is the same as Save Script to Script Window if there are no By-Groups. The Save menu contains the following commands: • Level Numbers creates a new column. This option applies to reports for all response variables when selected.38 2 The Distribution Platform Whole-Platform Options mand is used multiple times. Options and commands in this menu affect all text reports and graphs on the platform. The midpoint value for each observation is computed by adding half its level width to its lower level bound. its average rank is computed as the sum of its value’s ranks divided by k. These normal scores are Van Der Waerden approximations to the expected order statistics for the normal distribution. The level number of each observation corresponds to the histogram bar that contains the observation. • Ranks averaged creates a new column.

The capability study measures the conformance of a process to given specification limits. grouped by a column or a fixed sample size. which is especially useful when there are By-Groups.21 Capability Analysis Dialog Box All capability analyses use the same formulas. The Dialog box shown in Figure 2.21 appears. and has sigma computed as . This is the option used for Ppk statistics.2 The Distribution Platform Capability Analysis 39 2 Distribution: Single-Variable Statistics Figure 2. select Capability Analysis from the popup menu on the outline bar for the variable of interest. Capability Analyses can calculate capability indices using several different short-term estimates for σ. A dialog prompts you for Lower Spec Limit. allowing specification of long-term or one or more short-term sigmas. Capability Analysis The Capability Analysis option gives a capability analysis for quality control applications.20 The Script Menu • Data Table Window gives a view of the underlying data table. You only have to enter one of the three values. These options for sigma can be explained as: • Long-term uses the overall sigma. Upper Spec Limit. Figure 2. the process standard deviation. After requesting a Distribution. and Target. Optionally. Only those fields you enter are part of the resulting Capability Analysis table. The difference between the options lies in how sigma is computed. you can enter a known value for sigma.

if r is the number of subgroups and each ith subgroup is defined by the order of the data. user-specified and is therefore not computed.40 2 The Distribution Platform Capability Analysis n σ = ∑ i=1 • Specified Sigma allows the user to enter a specific. . Grouped by fixed subgroup size and is commonly referred to as the Root Mean Square Error or RMSE. Sigma is. they appear on the histogram when opened at a later time.22. This option is found using File > Preferences. When you click OK. 2 i=1 ----------------------------------n–r–1 (Note that this is the same formula for Short Term. 2 i=1 ----------------------------------n–r–1 • Short Term. sigma is computed as n ∑ σ = ( x ij – x ) i. choose Spec Limits from the Save command menu. Grouped by fixed subgroup size computes σ using the following formula. the platform appends a Capability Analysis table. To do this. with r equal to the number of subgroups. This is the option used for control chart-generated capability analyses.) Note: There is a preference for Distribution called Ppk Capability Labeling that will label the long-term capability output with Ppk labels. where the sigma used in the chart is entered (in the dialog) as the specified sigma. • Short Term. sigma is computed as n ∑ σ = ( x ij – x ) i. The specification limits can be stored and automatically retrieved as a column property. Grouped by Column brings up a column list dialog from which you choose the grouping column. at the bottom of the text reports. like the one in Figure 2. obviously. You can remove and redo a Capability Analysis as many times as you want. In this case. When you save the specification limits. known sigma used for computing capability ( xi – x ) ------------------n–1 2 analyses. In this case.

• The Value column lists the values you specified for each limit and the target • %Actual is the observed percent of data falling outside the specification limits. Sigma Quality = Normal Quantile ⎛ 1 – Expected ( # defects )⎞ + 1. 41. Table 2.” p. Upper Spec Limit. describes these indices and gives computational formulas.2 The Distribution Platform Capability Analysis 41 2 Distribution: Single-Variable Statistics Figure 2.03 Sigma process.5 -----------------------------------------------⎠ ⎝ n For example.000. They are Lower Spec Limit.1 Capability Index Names and Computations Index Index Name Computation CP process capability ratio. if there are 3 defects in n=1. The above and below columns do not sum to the total column because Sigma Quality uses values from the Normal distribution.22 The Capability Analysis Table The Capability Analysis table is organized into two parts.1 “Capability Index Names and Computations. It also lists the percent and PPM for areas outside the spec limits.000 observations. The upper part of the table shows these quantities: • The Specification column lists the names of items for which values are shown. This Sigma Quality measurement is frequently used in Six Sigma methods. Table 2. The lower portion of the Capability Analysis table lists five basic process capability indexes.03.The PPM column (parts per million) is the Percent column multiplied by 10. and is also referred to as the process sigma. and their upper and lower Confidence Intervals. the formula yields 6. Cp (USL – LSL)/6s where USL is the upper spec limit LSL is the lower spec limit . and is therefore not additive. or a 6. their values.000. and Spec Target.

Cpk Expected Let Value ˆ E ⎛ C pk⎞ ⎝ ⎠ min(CPL.42 2 The Distribution Platform Capability Analysis Table 2. CPU) n ⎛ µ – USL + LSL⎞ ------------------------.d n – 2 -. the variance is d .1 Capability Index Names and Computations CIs for Lower CI 2 χ1 – α CP on CP w CP --------------------------n–1 ------------. Then the expected value is n–2 Γ ⎛ -----------⎞ 2 1 n – 1 ⎝ 2 ⎠⎛ 2 -.-------------------.⎛ n – 1⎞ 36 ⎝ n – 3⎠ 9 n ⎝ n – 3⎠ 2 –c 2 -.----------. n – 1 (continued) Upper CI on CP χ1 – (1 – α) 2 w ---------------------------------------n–1 ------------------------.⎠ ⎝ 2 c = -----------------------------------------------σ denote the noncentrality parameter and d = USL – LSL ------------------------σ represent the specification limit in σ units.exp ⎛ ⎛ – c -⎞ – 2c ( 1 – 2Φ ( – c ) )⎞ ⎞ ------⎝ ⎝⎝ 2 ⎠ ⎠⎠ π 2 2 Lower CI Upper CI ˆ ˆ E ⎛ C pk⎞ – k Var ( C pk ) ⎝ ⎠ ˆ ˆ E ⎛ C pk⎞ + k Var ( C pk ) ⎝ ⎠ . n – 1 - CPK (PPK for AIAG) CIs for CPK process capability index.⎛ n – 1⎞ – --------..----------d .--------------.exp ⎛ ⎛ – c -⎞ – 2c ( 1 – 2Φ ( – c ) )⎞ ⎞ ------⎝⎝ 2 ⎠ ⎠⎠ 6 2n ⎛ n – 1⎞ ⎝ π Γ ----------⎝ 2 ⎠ Variance Var ⎛ C pk⎞ ⎝ ⎠ Using c and d from above.exp ⎛ -------⎞ + c ( 1 – 2Φ ( – c ) ) ⎝ 2 ⎠ π 2 n–2 2 Γ ⎛ -----------⎞ ⎝ 2 ⎠ 2 1 ------------------n–1 n – 1⎞ -------------------⎛ -----------+ (1 + c ) – ⎝ 9 n(n – 3) 72n ⎠ ⎛ n – 1⎞ Γ ----------⎝ 2 ⎠ 2 ⎛ d n – 2 -.

2 The Distribution Platform Capability Analysis 43 2 Distribution: Single-Variable Statistics Table 2. γ - CPL CPU process (mean – LSL)/3s where s is the estimated standard deviation capability ratio case of one-sided lower specification process (USL . (1990). a capability index of 1. At the bottom of the report.mean)/3s capability ratio of one-sided upper specification In Japan. Z represents (according to the AIAG Statistical Process Control manual) the number of standard deviation units from the process average to a value of . γ 2 CPM -------------------γ 2 Upper CI on CPM χ1 – (1 – α) 2 CPM -------------------------------γ ------------------------. Exact 100(1 – α)% lower and upper confidence limits for CPL are computed using a generalization of the method of Chou et al. who point out that the 100(1 – α) lower confidence limit for CPU (denoted CPULCL) satisfies the equation Pr { T n – 1 ( δ = 3 n ) ( CPULCL ≥ 3CPL n ) } = 1 – α where Tn–1(δ) has a non-central t-distribution with n – 1 degrees of freedom and noncentrality parameter δ. Exact 100(1 – α)% lower and upper confidence limits for CPU are also computed using a generalization of the method of Chou et al. who point out that the 100(1 – α) lower confidence limit for CPL (denoted by CPLLCL) satisfies the equation Pr { T n – 1 ( δ = 3 n )CPLLCL ≤ 3CPL n } = 1 – α where Tn–1(δ) has a non-central t-distribution with n – 1 degrees of freedom and noncentrality parameter δ. For a normal distribution. this gives an expected number of nonconforming units of about 6 per 100.33 is considered to be the minimum acceptable. Cpm CIs for Lower CI 2 χ1 – α CPM on CPM ------------. Z statistics are reported.000.. (1990). USL – target ) --------------------------------------------------------------------------capability 2 2 3 s + ( mean – target ) index.1 Capability Index Names and Computations (continued) CPM process min ( target – LSL.

Cum Prob(Z LSL) P(USL) = Prob(X > USL) = 1 . . If you give a target value. described in the next section. a capability analysis is automatically generated by using the quantile values and target you specified.Cum Prob(Z USL) Note: You can also do a non-normal capability analysis through Fit Distribution options. After you fit a distribution. you have the option to generate quantiles and a target value for the fitted distribution.P(USL)) where P(LSL) = Prob(X < LSL) = 1 . Z USL = (USL-Xbar)/sigma = 3 * CPU Z LSL = (Xbar-LSL)/sigma = 3 * CPL Z Bench = Inverse Cumulative Prob(1 . Z USL is the distance to the upper specification limit and Z LSL is the distance to the lower specification limit.P(LSL) .44 2 The Distribution Platform Capability Analysis interest such as an engineering specification. When used in capability assessment.

Details about this data are found in Chapter 2. This single platform produces one way ANOVA. the variables are examined in pairs.1 Fit Y By X Launch Dialog Notice the word “Contextual” in the title bar. Introduction After starting JMP Student Edition.” p. Select Analyze > Fit Y By X from the menu bar. 21. “The Distribution Platform” in the section “About the Data. and contingency table analysis—most of the two-variable analyses seen in an introductory statistics course. the Fit Y by X platform is the choice to make. scatterplots. Open the data file Denim.3 The Fit Y by X Platform Any time two variables need to be compared. More . This brings up the Fit Y By X platform launch dialog as seen in Figure 3.1 Figure 3. It is there because this dialog launches other platforms depending on the modeling types (continuous or categorical) of the variables in the analysis. Launching the Platform In this introduction section.jmp.

Computing a t-test As a simple example. Three plots appear as in Figure 3. Select Starch Content (%) from the list of columns and click the Y.3 t-test Results Some things should be noticed about this report. Is the starch content different for the two levels of Sand Blasted? This is a typical situation examined with a two-sample t-test. Response button. this example consists of three analyses.46 3 The Fit Y by X Platform Introduction information on modeling types is found in “The Modeling Type of Variables. Select t test from the drop-down menu in the plot’s title bar. . All three analyses are requested at the same time to illustrate some of JMP Student Edition’s interactive capabilities. then click Sand Blasted?.2 Fit Y by X Results On the far left and far right. Note that these dialog boxes respond to dragging as well as button clicks. click on Method.2. To conduct the t-test. hold down the Shift key. and Sand Blasted? are the X variables. Figure 3. a situation leading to fitting lines and curves. The t-test report appears in the outline beneath the plot labeled t test. To select all three X variables. Method. Figure 3. dot plots of each level of a nominal variable are plotted side by side. 29. a situation leading to one-way ANOVAs. These analyses would be equally valid if performed separately. Size of Load (lbs). In the middle plot. as in the next step. Factor button. Initially.” p. Drag these highlighted variables to the box to the right of the X. Click OK. JMP Student Edition produces a scatterplot of two continuous variables. with Starch Content (%) as the Y variable in all of them. examine the plot on the far right. relating starch content to whether the fabric was sand blasted or not.

2. with two different methods. All of these are discussed later in this tutorial in “Analysis of Variance (anova). 71 in the “The Matched Pairs Platform” chapter. Denim washed with Alpha Amalyze appears to have a lower starch content than denim washed with . Here. Method. and other text tables. In essence. JMP Student Edition has tested the same hypothesis twice. Notice that this is the same value as listed in the new Analysis of Variance report in the column labeled Prob>F. of Starch Content (%) vs. select the Means/Anova/Pooled t command. Select the Means/Anova/Pooled t command from the report’s drop-down menu. not a matched pairs t-test.3 The Fit Y by X Platform Introduction 47 3 Fit Y by X: Comparing T wo Variables • There is a statement on the second line of the t-test report that says “Assuming unequal variances”. In fact.” p. examine the plot on the left side in the report in Figure 3. select Means/Anova/t test to remove the pooled t report. • This is a two sample t-test. From the drop-down menu in the Oneway Analysis title bar. 50. • The plot gets embellished with means diamonds.” p. Details on matched pairs are found in “The Matched Pairs Platform. the before-and-after measurements of a patient taking an experimental medication). If you want the pooled version (where the variances are assumed to be equal). the square of the t statistic (listed under t-Test) is equal to the value of the F statistic (listed in the ANOVA table as F Ratio). note that the p-value for the unpooled t-test is listed beside Prob>|t| in the original t test report. This test is also known as the unpooled t-test. use the Matched Pairs platform. Pooled t test Now. and both methods agree (as they always should!). If the data from the two groups have a natural pairing (for example.

Text reports appear below the plot. examining the mean. as shown in Figure 3. but also for selecting results in other plots. Details on these additions are found in “One Way anova—The Continuous by Categorical Case. or the online help. select Quantiles. which are produced as follows.” p. Box plots are superimposed on the plot. The “captured” points become selected. For more specificity. 61. it is helpful to look at text reports of these results. see the section titled “Quantile Box Plots. From the same drop-down menu. Figure 3.48 3 The Fit Y by X Platform Introduction Caustic Soda or Pumice Stone. standard deviation. Selecting and Marking Points This plot is not only useful for computing results. . and box plots are superimposed on the plot. giving clues to the underlying distribution of each level.” p. and quantiles for the three levels of the Method variable. mean error bars and standard deviation lines appear on the plot. These additions can also be removed. select Quantiles and Means and Std Dev again. median.4 Lasso Tool Lasso Tool The Lasso tool is used to draw curves around points. In the same drop-down menu. From the drop-down menu in the Oneway Analysis title bar.4. In addition to the new text reports. Select the Lasso tool from the Tools toolbar. 32. select Means and Std Dev. For details on box plots.

5 Selecting Points with the Lasso Tool JMP Student Edition briefly flashes how many points are contained in the selection region (32 in this case) and selects the points. Release the mouse button. The Alpha Amalyze points turn red in all the plots. select Color or Mark by Column. Set Marker by Value. suppose that plots are needed that clearly distinguish the three levels of the Method variable. and Make Window with Legend checkboxes are checked. Select the small triangle from the markers palette. there is an easier way to change the colors and markers of points in a plot if there is a certain column that divides up the data. Make sure that the Set Color by Value. In the resulting dialog box. To make these points distinctive. as shown in Figure 3. . Again. To mark all the data at once. With the points selected. right-click inside the plot.3 The Fit Y by X Platform Introduction 49 3 Fit Y by X: Comparing T wo Variables While holding down the mouse button. Unique colors and markers are assigned to each level of the Method variable in all the plots. select Row Colors from the popup menu. From the Rows menu. select Method from the list of variables. The Lasso tool works with all plots that show individual points. like scatterplots and leverage plots. In fact. For example. Notice that these points are highlighted in all the plots. and choose a red color from the color palette. and in the data table. The Alpha Amalyze points change to the triangle in all the plots. right-click inside the plot and select Row Markers. Figure 3. drag the Lasso tool completely around the points for Alpha Amalyze.5. assign them a unique color and marker.

6 Means Diamonds Illustrated Group Mean Overlap Marks 95% CI Examining the ANOVA table shows that Method is a highly significant predictor of starch content. and means diamonds appear on the plot. Select Compare Means > Each Pair. The horizontal line across each diamond represents the group mean. levels that are not significantly different from it turn red. When a circle is clicked. An ANOVA table appears beneath the plots. it turns red. the t test is not appropriate. Comparison Circles To show comparison circles. at least one level of the Method variable has a significantly higher or lower starch content than the others. Overlap marks are drawn above and below the group mean. Click on the bottom comparison circle.50 3 The Fit Y by X Platform Introduction Analysis of Variance (ANOVA) Is knowledge of the wash method useful in predicting starch content of the denim? The statistical test to answer this question is called a one-way ANOVA. JMP Student Edition uses comparison circles to explore this. overlapping overlap marks indicate that the two group means are not significantly different at the 95% confidence level. The vertical span of each diamond represents the 95% confidence interval for each group. as shown by the schematic in Figure 3.6. Figure 3. A means diamond illustrates a sample mean and its 95% confidence interval. so is not available on the menu. Note that his command reads Means/Anova/t test when the categorical variable only has two levels. and are clickable. and is produced in the same way as the t-test above. In all other cases (like this one). corresponding to Alpha Amalyze. . Select Means/Anova from the drop-down menu next to Oneway Analysis. To see this. For groups with equal sample sizes. they show differences among levels of a variable. and levels that are significantly different from it turn gray. The obvious question is which levels are different from each other. Student’s t Complete details of comparison circles are on page 63. Put simply. In other words.

Figure 3.7 Starch Content Comparison Circles The display changes to the one shown in Figure 3.8 Fit Mean Results Platform Menu Fit Menu . To see this mean. Notice that the comparison circles change diameter when the α-level changes.8. Select Fit Mean from the drop-down list in the title bar of the plot. For example. it can be changed.2 is of two continuous variables.3 The Fit Y by X Platform Introduction 51 3 Fit Y by X: Comparing T wo Variables Figure 3. A good guess may simply be the mean starch content from all the data points. Notice that Fit Mean below the plot has its own drop-down menu. select Set α Level > . and Fit Mean appears in a legend below the plot. as shown in Figure 3. the α-level is 5%. a situation that allows fitting of lines and curves through least-squares regression.7. These comparison circles are based on the confidence interval around the mean. Click on the other two circles to discover their relationships. which is itself based on the α level. A line representing the mean appears on the plot. suppose you want to predict starch content based on the size of the wash load. Fitting Lines The middle plot of the report in Figure 3.10. This shows that Alpha Amalyze is significantly different from the other two wash methods. By default. From the popup menu in the Oneway title bar. However.

the dotted confidence interval around the linear fit does not contain the mean. then the fitted line is helpful. Values of the slope and intercept are also printed in the Parameter Estimates section of the report. there are numerical measures of the significance. One is the p-value associated with the slope of the line. The equation of the line. To fit a regression line to this data. Select Confid Curves Fit from the Linear Fit menu in the legend below the plot. the linear fit is statistically significant. It is statistically sound to use the fitted line in predictions. In this case. using the Confid Shaded Fit command.52 3 The Fit Y by X Platform Introduction A more interesting statistical question is whether a line or a curve is a better predictor of starch content than this simple mean. also found in the Parameter Estimates report. as well as several computed statistics. JMP Student Edition can draw confidence intervals for the fit around the fitted line. To do this comparison.9. significant by almost any standard. reinforcing the graphical results above. Aside from the graphical confidence-curve method detailed above. This line should be compared with the simple mean to see if it is helpful in prediction. . Therefore. Figure 3. A line is superimposed on the graph. If these confidence intervals do not contain the horizontal mean. the p-value is 0. Select Fit Line from the platform menu in the plot’s title bar.9 Fit Line Results There is also an option to produce shaded confidence curves. As seen in Figure 3.0045. are found in the Linear Fit report.

. Residuals One of the best diagnostic tools for a linear fit is its residuals.10 The Disclosure Icon Click here.11 Residual Plot The plot is getting cluttered. A plot of the residuals appears at the bottom of the report.. Another interesting question is whether a single line (like this model) is enough to describe starch content for all wash methods. Select Plot Residuals from the fit popup menu in the legend below the plot. It is interesting to note that its significance (p=0. To compute the value of r itself. For each of the fits below the plot. there are different residuals for each fit. but can be opened by clicking on the blue disclosure icon (Figure 3. the first step in interpretation of residuals is to see them plotted.95 from the platform menu on the plot’s title bar. A new report named Correlation appears at the bottom of the text reports. To see residuals for the linear fit. Of course. frequently denoted by r.3 The Fit Y by X Platform Introduction 53 3 Fit Y by X: Comparing T wo Variables Correlation Coefficient Another measure of fit is the correlation coefficient. ideally. request a density ellipse.10). and the same as the Prob>F value in the Analysis of Variance table. so residuals commands are found in the fit menus in the legends below the plots. or if a different line is needed for each level of the Method variable. The red horizontal line on the plot shows the mean of the residuals (which should.to reveal this report The correlation coefficient is listed under the word Correlation. Its value does not appear on any of the reports so far. Disclosure Icon Figure 3. Select Density Ellipse > .0045) is the same as that listed for the slope coefficient in the Parameter Estimates table. . which is not duplicated here. although the square of its value (r2) is listed beside RSquare in the Summary of Fit text report. In other .. Figure 3. so before continuing. select Remove Fit from the fit popup menu. However. be near zero).. remove the fits that are there now. All (good) introductory statistics textbooks discuss interpretation of residuals. It is initially closed.

Factor role. instruct JMP Student Edition to group its calculations for each level of the Method variable. request JMP Student Edition to fit a line as before. some initial investigation is easy with this platform. In the resulting dialog box. To generate a contingency table for this problem. Note that the mosaic plot is clickable. instead of the more compact (but possibly less accurate) reporting of the single line found previously. select Group By. both X and Y are categorical variables. Response role. From the platform menu on the plot’s title bar.54 3 The Fit Y by X Platform Introduction words. Click OK.” p. Two-Way Contingency Tables In the next example. Select Analyze > Fit Y by X from the menu bar.13) . red section in the mosaic plot in the bar above Alpha Amalyze (see Figure 3. a mosaic plot appears. Details of these displays are in the section “Contingency Analysis—The Categorical by Categorical Case. For example. select Method as the grouping variable. Figure 3. Since both variables are categorical. as shown in Figure 3. Select Fit Line from the Platform popup menu on the title bar above the plot. 66. Separate lines for each wash method appear on the plot. Assign Method to the X. First. Assign Thread Wear (not Thread Wear Measured) to the Y. Click in the lower. to select all rows washed in Alpha Amalyze with a low thread wear. is starch content related to load size in the same way when washed in Alpha Amalyze as when washed in Caustic Soda or Pumice Stone? Although this question is more in the realm of the Fit Model platform (detailed in “The Fit Model Platform” chapter). Now. The question is whether the method of washing denim has an effect on thread count. followed by a contingency table. The analysis uses contingency tables—orderly ways of arranging count data.12 Group By and Linear Fits The statistical question is whether these lines are different enough to warrant the extra trouble in reporting all three.12. like all plots in JMP Student Edition.

which is shifted horizontally to produce probabilities for the ordered categories. there is not enough evidence to say that the thread count of denim is affected by wash method. Factor. .3 The Fit Y by X Platform Introduction 55 3 Fit Y by X: Comparing T wo Variables Figure 3. There is a distinction between nominal and ordinal responses on this platform: • Nominal logistic regression estimates a set of curves to partition the attributed probability among the responses.76. moderate. Click OK. Select Analyze > Fit Y By X Assign Thread Wear as Y. and is recommended for ordered responses. The p-value for this test appears in the column labeled Prob>ChiSq.13 Mosaic Plot Click in this area to select Alpha Amalyze-washed jeans with low thread wear Just below the contingency plot are tests for the independence of the two variables. As an example. This has the effect of estimating a single logistic curve. The fitted model estimates probabilities attributed to each x value. There is not enough evidence to say that these two variables are not independent—in other words. • Ordinal logistic regression models the probability of being less than or equal to a given response. Logistic Regression The Logistic platform fits the probabilities for response categories to a continuous x predictor. This model is less general but more parsimonious. The report that appears shows the probability that the thread wear is low. The logistic platform is the nominal/ordinal by continuous personality of the Fit Y by X command. Response and Size of Load (%) as X. which in this case is the non-significant 0. or severe for each load size.

Select Formula from the menu that appears. A new column is needed to store the new values in. Its values are automatically recomputed whenever the values in these columns are edited. It is locked so that its data values cannot be edited individually.56 3 The Fit Y by X Platform Introduction The p-value of 0.0657 hints at a weak association between these two variables. the ordinal variable Thread Wear is computed with a formula that partitions the values of Thread Wear Measured into low. and an extensive list of easy-to-use features. Right-click in the heading of the column Thread Wear. A column whose values are computed by a formula is both linked and locked. Formulas serve a wide variety of purposes. . It is linked to (and dependent on) all other columns that are part of its formula. To see the formula. In the Denim. Example The essential features of the formula editor are best seen through an example. moderate. Click Cancel to return to the data table. from assigning simple values to computing complex calculations with parameters and conditional clauses. Suppose that it is necessary to calculate the logarithm of the values in the Starch Content (%) column (This calculation is common in real-world statistics). The Formula Editor A powerful (and often under used) feature of JMP Student Edition is its formula editor. There are two methods to create a new column. The Formula Editor window operates like a pocket calculator with buttons. and severe categories.jmp sample data table. They are especially useful when transforming data. displays.

are found in JMP Student Edition’s online help. add a formula to the column. Click on Transcendental from the Functions list. Change the title of the new column to Log Starch. the default title is highlighted and ready to be changed. . double-click in the area to the right of the last column in the data table. Click OK to close the Formula Editor and apply the formula. Now. Right-click in the heading of the column Log Starch. Note that Log is the natural logarithm (base e). When the formula editor appears. Alternatively. as well as complete documentation of all the formula editor functions. Select Formula from the menu that appears. Select Starch Content (%) from the columns list.3 The Fit Y by X Platform Introduction 57 3 Fit Y by X: Comparing T wo Variables From the main menu bar. Further examples. select Cols > New Column. then select Log from the resulting menu. Common (base 10) logs are computed using the Log10 function. When a new column is created.

its standard deviation.58 3 The Fit Y by X Platform Scatterplots—The Continuous by Continuous Case Scatterplots—The Continuous by Continuous Case If both the X and Y variables are continuous.The Fit Mean Fit table shows the value of the mean. where additional commands for each fit are accessed. its standard error. .15) has commands to save predicted values and residuals for the linear fit as new columns in the current data table.14) Figure 3. There are a number of options once the scatterplot appears. The Fit Mean command adds a horizontal line to the plot at the mean of the response variable (Y). a legend appears below the plot with its own drop-down menu.15 Fit Commands The Fit Line command adds a straight-line fit to the plot using least squares regression. It is often useful to first fit the mean as a reference line for other fits. Its drop-down menu (accessed as in Figure 3. and the sum of squared errors around the mean Figure 3. JMP Student Edition produces a bivariate analysis that initially shows a scatterplot.14 Bivariate Popup Menu Show Points alternately hides or displays the points in the plot. As with all the fitting commands in this platform. all accessed through the popup menu beside the variable name in the title bar (Figure 3.

three additional tables are appended to the report (see Figure 3.16).3 The Fit Y by X Platform Scatterplots—The Continuous by Continuous Case 59 3 Fit Y by X: Comparing T wo Variables If the confidence area around this line (produced through the Confid Curves Fit command) includes the horizontal line at the response mean. 95% confidence limits are plotted with the Confid Curves Fit and Confid Curves Indiv display options. if weighted variables are involved. see “The Lack of Fit Table. The Fit Polynomial option can be selected multiple times with different polynomial degrees for comparison. a measure of how well the line fits • The adjusted R2 (RSquare Adj). the curve is fit to the data points using least squares regression. an estimate of the standard deviation of the random error • The mean of the response variable • The number of observations (or. A Lack of Fit table also appears if there are replicates. As with the linear fit. The Fit Polynomial command fits a polynomial curve of the degree selected from the Fit Polynomial submenu. the sum of the weights). options can save predicted values and residuals as new columns in the current data table for each polynomial fit. 90 or the online help. After selecting the polynomial degree. Figure 3. used to compare models with different numbers of variables • The Root Mean Square Error. then the slope of the line of fit is not significantly different from zero at the 0.05 significance level.” p. . For details of the lack of fit test. Each time a linear or polynomial fit is chosen.16 Linear Fit Text Reports The Summary of Fit Table The summary of fit table shows the following information: • R2 (RSquare).

05 is interpreted as evidence that the parameter is significantly different from zero. If a parameter is a significant model effect. However. The intercept is a constant term in all models. • Estimate lists the parameter estimates of the linear model. It is the ratio of the parameter estimate to its standard error. given a true hypothesis. see “The Lack of Fit Table. The underlying hypothesis of the fit is that all the regression parameters (except the intercept) are zero. because it approximates the 0. Error. . • F Ratio is the model mean square divided by the error mean square. • DF records the associated degrees of freedom (DF) for each source of variation • Sum of Squares records an associated sum of squares (SS for short) for each source of variation. JMP Student Edition uses the following: • Source lists the three sources of variation. • Std Error lists the estimates of the standard errors of the parameter estimates. the F Ratio is usually higher than expected by chance alone. • Prob > F is the observed significance probability (p-value) of obtaining a greater F value by chance alone if the specified model fits no better than the overall response mean. For details of the lack of fit test. Often. a value below 0. A significant F statistic in this table indicates that a different model should be examined. They are used in constructing tests and confidence intervals. Parameter Estimates Table The terms in the Parameter Estimates table for a linear fit (seen previously in Figure 3. and C Total. It is the probability of getting.60 3 The Fit Y by X Platform Scatterplots—The Continuous by Continuous Case The Lack of Fit Table The Lack of Fit table shows a special diagnostic test that appears only when the data and the model provide the opportunity. Looking for a t ratio greater than 2 in absolute value is a common rule of thumb for judging significance. there may be some differences in the terminology for the ANOVA table’s parts. Analysis of Variance Table This table look similar to the ANOVA tables in most textbooks. • t Ratio lists the test statistics for the hypothesis that each parameter is zero.05 significance level. It is a test to see if a different form of the model would fit the data better. by chance alone. • Prob>|t| lists the observed significance probability calculated from each t ratio. as in corrected for the mean. 90 or the online help.” p. • Mean Square is a sum of squares divided by its associated degrees of freedom. The Parameter Estimates table displays the following: • Term lists the name of each parameter in the requested model. a t ratio greater (in absolute value) than the computed value. called Model. The “C” in C Total stands for corrected. These estimates are the coefficients in the linear model.16) are the intercept and the single X variable.

The fit for each level of the grouping variable is identified beneath the scatterplot. but in the continuous by continuous bivariate platform.3 The Fit Y by X Platform One Way anova—The Continuous by Categorical Case 61 3 Fit Y by X: Comparing T wo Variables Other Fitting Commands The Fit Special command displays a dialog with choices for transformations of both the Y and X variables. The Group By command in the fitting menu displays a dialog. determined by the probability chosen from the Density Ellipse submenu. Compare it to other fitted lines to see the concept of lack of fit. JMP Student Edition produces a one way ANOVA. and exponential.17). The Fit Each Value command fits a value to each unique X value. When a grouping variable is selected. Select the Group By command to remove (uncheck) the existing variable. The Group By command is checked in the fitting menu when a grouping variable is in effect. The regression report is shown with the transformed variables. square. The Other selection allows the specification of any probability greater than zero and less than or equal to one. initially displaying a plot that shows a vertical distribution of Y points for each X value. but an extra report shows measures of fit transformed in the original Y scale (if there was a Y transformation). There are a number of options once this scatterplot appears. The density ellipsoid is a good graphical indicator of the correlation between two variables. Then. The ellipsoid collapses diagonally as the correlation between the two variables approaches either 1 or –1. Fitting each value is like doing a one-way analysis of variance. select the Group By command again and respond to its dialog as before. The Density Ellipse command draws an ellipse that contains the specified mass of points. The ellipsoid is more circular (less diagonally oriented) if the two variables are uncorrelated. To change a grouping variable that is already in effect. with individual popup menus to save or remove fitting information. . One Way ANOVA—The Continuous by Categorical Case If the X variable is categorical and the Y variable is continuous. The fitted line is plotted on the original scale. The Density Ellipse table that accompanies each Density Ellipse fit shows the correlation coefficient (r) for the X and Y variables and the probability that the correlation between the variables is significant. so it appears as a curve on the plot. Transformations include log. reciprocal. the Fit Y by X platform computes a separate analysis for each level of the grouping variable. allowing selection of a classification (grouping) variable. square root. all accessed through the popup menu beside the variable name in the title bar (Figure 3. and overlays the regression curves or ellipses on the scatterplot.

Tukey HSD displays a table that shows the Tukey-Kramer HSD (honestly significant difference) comparisons of group means.17 One Way ANOVA Popup Menu The Quantiles command displays the Quantiles table. a t-test also shows. Dunnett’s displays a table showing Dunnett’s comparisons of group means with a control group. means. which lists the 0% (minimum). The Means/Anova/t test command fits means for each group and performs a one-way analysis of variance to test if there are differences among the means. and a table that lists group frequencies. a one-way analysis of variance table. • Each Pair. • With Best. This command also displays Means Dots. Error Bars. 25%. 50 for a detailed description of means diamonds. • All Pairs. See “Analysis of Variance (anova).62 3 The Fit Y by X Platform One Way anova—The Continuous by Categorical Case Figure 3. 90%. but uses standard deviations computed within each group rather than the pooled estimate of the standard deviation used to calculate the standard errors of the means. and Std Dev Lines display options. Compare Means has a submenu that provides the following four multiple comparison methods for comparing sets of group means.” p. 75%. This option automatically activates the Means Diamonds display option. . Student’s t displays a table with Student’s t statistics for all combinations of group means. All activate the Comparison Circles display option. 10%. If there are only two groups. Hsu’s MCB displays a table that shows Hsu’s MCB (Multiple Comparison with the Best) comparisons of group means to the best (maximum or minimum) group mean. Three tables are produced: a summary table. and standard errors computed with the pooled estimate of the error variance. 50% (median). • With Control. It also activates Box Plots from the Display Options menu. and 100% (maximum) quantiles for each group. The Means and Std Dev command fits means for each group.

3 The Fit Y by X Platform One Way anova—The Continuous by Categorical Case 63 3 Fit Y by X: Comparing T wo Variables Each multiple comparison test begins with a comparison circles plot.18 shows the alignment of comparison circles with the confidence intervals of their respective group means. Figure 3. The illustration in Figure 3. If the intersection angle is close to 90°.20). To deselect circles. or if they are nested.19 Angles in Comparison Circles angle greater than 90 degrees angle equal to 90 degrees angle less than 90 degrees not significantly different borderline significantly different significantly different . The highlighted circle appears with a thick solid line. click in the graph outside the circles. it is easy to verify whether the means are significantly different by clicking on the comparison circle. Figure 3. The outside angle of intersection tells whether group means are significantly different (see Figure 3. Circles for means that are significantly different either do not intersect or barely intersect.18 Alignment of Comparison Circles Compare each pair of group means visually by examining how the comparison circles intersect. thus highlighting it. Circles representing means that are not significantly different from the highlighted circle show with thin lines (see Figure 3. The plot follows with a table of means comparisons. If the circles intersect by an angle of more than 90°. so that the outside angle of intersection is less than 90°.19). Circles representing means that are significantly different show with a thick gray pattern. a visual representation of group mean comparisons. the means are not significantly different.

• Median rank scores are either 1 or 0 depending on whether a rank is above or below the median rank. The Wilcoxon test is the most powerful rank test for errors with logistic distributions.64 3 The Fit Y by X Platform One Way anova—The Continuous by Categorical Case Figure 3. which is displayed. the usual analysis of variance assumption of normality is not made. The Van der Waerden test is the most powerful rank test for errors with normal distributions. this plot shows both the differences in the means (vertical position) and the variances (slopes) for each level of the categorical X factor (Figure 3. • Van der Waerden rank scores are the ranks of the data divided by one plus the number of observations transformed to a normal score by applying the inverse of the normal distribution function. and Bartlett’s test. Along with the standard normality-assessing capabilities of the single-variable Normal Quantile Plot. When the variances across groups are not equal. and modifies the upper and lower confidence level values in reports. Changing the alpha level recalculates any confidence lim- its. so the standard ANOVA F test is not valid.20 Comparison Circles after Clicking The Nonparametric submenu allows computation of three nonparametric tests: the Wilcoxon. Nonparametric tests are useful to test whether group means or medians are located the same across groups. The Median test is the most powerful rank test for errors with doubly exponential distributions. adjusts the means diamonds on the plot if they are showing. called the Welch ANOVA. Normal Quantile Plot shows overlaid normal quantile plots for each level of the X variable. • Wilcoxon rank scores are the simple ranks of the data. Levene’s test. the usual analysis of variance assumptions are not satisfied. . Nonparametric tests use functions of the response variable ranks. or the specification of any level with the Other selection. However. The UnEqual Variances command tests for equality of group variances. the Brown-Forsythe test.21). There is a valid variant of the standard ANOVA. It uses (and reports) four different tests: O’Brien’s test. and van der Warden tests. Set Alpha Level has a submenu that allows a choice from the most common alpha levels. called rank scores. Median.

as shown in Figure 3. The plot shows quantiles computed within each level of the categorical X factor. • Probability Labels shows probabilities on the right axis of the Quantile by Actual plot and on the top axis of the Actual by Quantile plot. This is the centered response divided by the standard deviation within each level. Save has a submenu of commands to save the following quantities as new columns in the current data table: • Save Centered saves values computed as the response variable minus the mean of the response variable within each level of the factor variable. • Plot Quantile by Actual reverses the x. CDF plots the cumulative distribution function for all the groups in the Oneway report. • Save Standardized saves standardized values of the response variable computed within each level of the factor variable. • Mean CI Lines draws lines at the upper and lower 95% confidence levels for each group. • Means Diamonds draws Means Diamonds. • Mean Lines draws a line at the mean of each group. • Grand Mean draws the overall mean of the Y variable on the scatterplot.” p.and y-axes. • Box Plots shows outlier box plots for each group. • Line of Fit draws the straight diagonal reference lines for each level of the X variable. Complete details of means diamonds is found in “Analysis of Variance (anova).21.21 Normal Quantile Plot Slopes show standard deviations Separations show differences in means Normal Quantile Plot has these additional options: • Plot Actual by Quantile generates a quantile plot with the response variable on the y-axis and quantiles on the x-axis. Display Options allows addition or removal of plot elements. • Mean Error Bars identifies the mean of each group with a large marker and shows error bars one standard error above and below the mean.3 The Fit Y by X Platform One Way anova—The Continuous by Categorical Case 65 3 Fit Y by X: Comparing T wo Variables Figure 3. • Points shows data points on the scatterplot. • Std Dev Lines shows dotted lines one standard deviation above and below the mean of each group. . 50. • Save Normal Quantiles saves normal quantile values computed within each level of the categorical factor variable. • All Graphs shows or hides all graphs.

and a table of chi-square tests. • Data Table Window brings the data table to the front of the display. or from several variables in a single Distribution report— are in the same window. • Save Script to Report appends a script to the top of the report. • Points Jittered adds random horizontal jitter so that points that overlay on the same Y value can be seen.22 Contingency Analysis . • Points Spread spreads points over the width of the interval. JMP Student Edition produces a contingency analysis that initially shows a mosaic plot. • Save Script for All Objects is useful when several analyses — like those from a By group. Script Submenu The Script submenu contains commands related to saving a script to redo an analysis. and attaches it to the data table. • Connect Means connects the group means with a straight line. contingency table (sometimes referred to as a crosstabs table).66 3 The Fit Y by X Platform Contingency Analysis—The Categorical by Categorical Case • Comparison Circles show comparison circles computed for the multiple comparison method selected in the platform menu. Figure 3. • Save Script to Data Table generates a script that can redraw the report. • Save Script to Script Window produces a script that can re-create the report in a text window. Contingency Analysis—The Categorical by Categorical Case If both the X and Y variables are categorical. • Mean of Means • X-Axis Proportional makes spacing on the x-axis proportional to the sample size of each level. This script can then be edited or saved to an external file. • Redo Analysis repeats the analysis represented in the report. The resulting script generates all reports in the window.

The cumulative logistic probability plot gives a complete picture of what the logistic model is fitting.” p. 66.24 Contingency Table Popup Menu • Count is the cell frequency. At each x value. Mosaic Plot. • Row % is the percentage of each cell count to its row total • Col % is the percentage of each cell count to its column total • Expected is the expected frequency of each cell under the assumption of independence. Figure 3.3 The Fit Y by X Platform Logistic Regression—The Categorical by Continuous Case 67 3 Fit Y by X: Comparing T wo Variables The popup menu for contingency analyses contains items to turn parts of the report on and off. Script. It is computed as the product of the corresponding row total and column total.23 Contingency Popup Menu The contingency table itself has a popup menu to turn its cell contents on and off. margin total frequencies. • Total % is the percentage of cell counts and margin totals to the grand total. Figure 3. is explained in the section “Script Submenu. the probability scale in the y direction is divided up (partitioned) into probabilities for . and Tests all operate as toggles. divided by the grand total • Deviation is the observed (actual) cell frequency minus the expected cell frequency • Cell ChiSq is the Chi Square values computed for each cell as ( Observed – Expected ) -------------------------------------------------------Expected 2 Logistic Regression—The Categorical by Continuous Case If the Y variable is categorical and the X variable is continuous. Display Options > Horizontal Mosaic rotates the mosaic plot 90 degrees. JMP Student Edition produces a logistic analysis that initially shows a logistic plot and text reports. and grand total (total sample size). The final item. Contingency Table.

The negative sum of logs of the observed probabilities is called the negative log-likelihood (–LogLikelihood). This table is analogous to the Analysis of Variance table for a continuous response model. It is a specific likelihood-ratio Chi-square test that evaluates how well the categorical model fits the data. 66 for details on scripting options. and Script options. which are points crowded by the curves of fit. (See “Script Submenu.” p.25 Interpreting the Logistic Plot P(thread wear is severe) at load size 250 P(thread wear is moderate) at load size 250 P(thread wear is low) at load size 250 These three probabilities sum to one Markers for the data are drawn at their x-coordinate. You can see that the points tend to push the lines apart and make vertical space where they occur in numbers. and allow the curves to get close together where there is no data. The . The fit tries to avoid points attributed to have a small probability. There are only a couple of options in the Logistic drop-down menu (see Figure 3. Figure 3. with the y position jittered randomly within the range corresponding to the response category for that row.68 3 The Fit Y by X Platform Logistic Regression—The Categorical by Continuous Case each response category.26)—turning the plot on and off through the Logistic Plot command. The data pushes the curves because the criterion that is maximized is the product of the probabilities fitted by the model. with the total across all Y category probabilities sum to 1. The probabilities are measured as the vertical distance between the curves.) The Whole Model Test Table The Whole Model Test table shows if the model fits better than constant response probabilities.

the standard Script menu appears. The difference in the log-likelihood from the model fitted by the data and the model with equal probabilities is a Chi-square statistic.26. the p-value of the Chi-square statistic is 0. This test statistic examines the hypothesis that the x variable has no effect on the response. which is not statistically significant.3657. In Figure 3. Platform Options The menu of platform options is shown in Figure 3. Figure 3. They turn various plot elements on and off.3 The Fit Y by X Platform Logistic Regression—The Categorical by Continuous Case 69 3 Fit Y by X: Comparing T wo Variables negative log-likelihood for categorical data plays the same role as sums of squares in continuous data.25. In addition. and allow adjustment of the line color.26 Logistic Plot Popup Menu .

.

and the other not. Select Starch Content(%) from the list of columns and click Split Columns. In the forthcoming example analysis. select Sand Blasted? from the list of columns and click Split By. all of the original variables are not used. Select the Drop All radio button at the bottom of the Split dialog. Introduction After starting JMP Student Edition. However. type “Paired Denim” to name the new data table. Details about this data are found in “About the Data. and call for the Matched Pairs platform. The dialog should appear like the one in Figure 4. in the Denim file. A classic example is a before-and-after study of the effect of a medication. Data in this form are handled by the Matched Pairs platform. Preparing the Data This example examines the starch content of blue jeans. with one group having been sand blasted. In fact.jmp. 21 in the “The Distribution Platform” chapter. based on whether the denim was sand blasted or not. Select Lot Number and Method and click Group. To split the data in this way. At this point.4 The Matched Pairs Platform Some two-variable data have a natural pairing to them. the column needs to be split into two starch columns. Open the data file Denim. In the Output table name box. Select Tables > Split. the split command is set to make a new data table. To use the Matched Pairs platform. all the starch data is in the single column Starch Content (%). The examination is of jeans that come from the same lot. so they form a paired situation.” p. . the paired data must be in two columns. having split Starch Content (%) into two columns based on the value in the Sand Blasted? column. no variables other than the ones already in the dialog need to be retained. To drop the unnecessary variables. so they do not need to be included in this new table. Therefore. In the dialog that results.1.

with new columns no and yes containing starch information. Figure 4.2.2 Paired Denim Data Launching the Platform Select Analyze > Matched Pairs from the menu bar. This brings up the Matched Pairs platform launch dialog as shown in Figure 4.4. The data table appears as in Figure 4.72 4 The Matched Pairs Platform Introduction Figure 4. .1 Split Columns Dialog Click OK to create the data table.

the difference is significant The text reports below this plot show the same result—there is a difference in starch content of denim based on sand blasting. The Matched Pairs Scatterplot After it is launched. See the online help for an example using a grouping variable. The resulting report is easily interpretable.4 Matched Pairs Launch Dialog Optionally. the Matched Pairs platform displays a scatterplot and numerical results. and test both between and among the pairs. . The primary graph in the platform is a plot of the difference of the two responses on the y-axis. Select the no and yes variables from the columns list and click the Y.4 The Matched Pairs Platform Introduction 73 4 Paired Variables To launch the dialog. and the mean of the two responses on the x-axis. The Matched Pairs Launch Dialog The Matched Pairs platform launch dialog (Figure 4. a grouping variable can be entered in the X.4) requires at least two variables to be entered. as shown in Figure 4.3 Paired Denim Matched Pairs Report The red line is the difference given by the data The horizontal gray line represents zero The dotted lines are a 95% confidence interval on the difference. Paired Response button. This graph is the same as a scatterplot of the two original variables. Figure 4.3. Figure 4. These two variables are the values that are paired.002. If it doesn’t contain zero. select the two paired variables to be analyzed. with a p-value of 0. Grouping role to have JMP Student Edition estimate means for the groups.

Figure 4. • The mean difference is shown as the horizontal line.5 Comparison of Scatterplot and Matched Pairs Plot Notice the following in Figure 4. allows two plot options — plotting the difference by the mean. and the mean of the two variables. or plotting the difference by the row number. Figure 4. then the means are not significantly different at the 0. the difference is significant. By rescaling. this plot shows the difference between the two variables.05 level.6: • The 45° tilted square shows the frame of the scatterplot of the original columns. See “Script Submenu.5). and standard scripting items are available. shown in Figure 4.7. See the online help for more details of this transformation.6. Figure 4.74 4 The Matched Pairs Platform Introduction but turned 45° clockwise (see Figure 4.7 The Matched Pairs Men .” p. with the 95% confidence interval above and below. In the example shown in Figure 4.6. as in Figure 4. A 45° rotation turns the original coordinates into a difference and a sum. The square reference frame can also be toggled on and off. 66 for details on the Script submenu.6 The Matched Pairs Scatterplot 95% Confidence Interval Mean Difference Line where the two variables are equal Mean of Means The Matched Pairs menu. • The mean of the mean of pairs is shown by the vertical line. If the confidence region includes the horizontal line at zero.

9 Highly Significant Shift Down The Y2 score is consistently lower than Y1 across all subjects.8 No Change The distribution vertically is small and centered at zero. The change from Y1 to Y2 is not significant.4 The Matched Pairs Platform Interpreting the Matched Pairs Plot 75 4 Paired Variables Interpreting the Matched Pairs Plot There are many possibilities for making statements regarding the patterns to be discovered in the new. rotated coordinates. Figure 4. Figure 4. . The examples below show six different situations and their interpretations. This is the high-positive-correlation pattern that is the typical situation.

and is unusual. the mean is the same from Y1 to Y2. but the high Y1s are associated with low Y2s. This is a high-negative-correlation pattern. . Figure 4. the mean is the same from Y1 to Y2. and high variance of the mean of the two values within a subject. and low variance of the mean of the two values within a subject.11 No Average Shift. But Reverse Relationship This example shows a high variance of the difference. but individually.10 No average Shift. the high scores got higher and the low scores got lower. But Amplified Relationship This situation shows a low variance of the difference.76 4 The Matched Pairs Platform Interpreting the Matched Pairs Plot Figure 4. Overall. Overall. and vice-versa.

Positive Correlation.4 The Matched Pairs Platform Interpreting the Matched Pairs Plot 77 4 Paired Variables Figure 4. but the high scores drop a little. the mean is the same from Y1 to Y2. . but Damped Instead of Accelerated Overall.12 No Average Shift. and low scores increase a little.

.

In the list of columns. In this introduction. no powers. Open the data file Denim. click on Method. Introduction After starting JMP Student Edition. . This brings up the Fit Model dialog. Unlike the other platform launchers in JMP Student Edition.” p.1.jmp. fit a simple model with only main effects—no interactions. Standard least squares fitting. 21 in the “The Distribution Platform” chapter. the Fit Model dialog does not go away once the model is run. Click Run Model. To select all the model effects together. click on Sand Blasted?. Click Window in the top menu bar to see a list of open windows. and which are predictors of thread wear in denim. models are developed to determine which variables (if any) are predictors of the starch content of denim. are fit using this single platform. hold down the Shift key. Details about this data are found in “About the Data. which is illustrated in Figure 5. To see that it is still available. Launching the Platform Select Analyze > Fit Model from the menu bar. including stepwise procedures. select Starch Content (%) and click the Y button. To begin with.5 The Fit Model Platform General linear models—those that have more complicated forms than can be fit with simple linear regression—are fit with the Fit Model platform. then click the Add button.

. Title bars within the reports are also editable.80 5 The Fit Model Platform Introduction Figure 5. (Window > Set Report Title on Macintosh) In the resulting dialog box. To make identification easy. type “Main Effects Only”. change its title to something meaningful as follows. so it will need to be identified among the other windows. Make sure the report window is the front window. Double click on the title bar that says Response Starch Content (%). Again type “Main Effects Only” and press Enter.1 The Fit Model Dialog Setting Titles This report generated here is used later in this presentation. Select Window > Set Title.

There is a lot of information in this report. Figure 5. Leverage plots are detailed in “Leverage Plots. printed in several places.2 The Scroller Tool Scroller Tool To scroll using the Scroller tool. first check if the model as a whole is significant. look at the p values associated with each effect. Shift-clicking a tool keeps the tool active for multiple clicks. scroll the window until each effect is visible. or by using the Scroller tool. . Then. including just below each leverage plot. In this initial look at the data. Without Shift-clicking. and its p value indicates that the model is significant. Scrolling is accomplished using scroll bars along the edges of the window. all of it is documented in sections that follow.5 The Fit Model Platform Introduction 81 5 Fit Model Examining Results Now.” p. examine these results. The first leverage plot is for the entire model. 91. Move the Scroller tool over the results report. To examine the p values for the individual effects. and although only a portion is used in this example. Shift-click the Scroller tool in the Tools toolbar. JMP Student Edition reverts back to the arrow tool after other tools’ first use.

Remove the existing effects from the model and re-run the analysis with interactions by doing the following. show the values of the response (starch content in this case) for levels of a nominal effect. Select Method and Size of Load (lbs) in the list of columns and click the Cross button. (That is. the starch content for Caustic Soda is higher than Pumice Stone. the plot suggests that when controlled for the other effects in the model. in short movements. Double-click on each of the other variables. called LSMeans by JMP Student Edition. However. this analysis is fairly primitive. . Figure 5. Re-running an Analysis After moving around the report a bit and observing the p-values for each effect. so that the effect of each variable can be examined. add in an interaction effect. To see a plot of the LSMeans. Try moving the mouse while releasing the mouse button. The response values are adjusted for the other terms in the model.) This illustrates the “inertia” that the scroller tool imparts on reports.82 5 The Fit Model Platform Introduction Hold the mouse button and move the scroller tool to see the window move. Click on the Window menu and select the Fit Model dialog. By default. Now.3 LS Means Table and Plot In this case. it should be clear that they are all significant at the 0. which is in turn higher than Alpha Amalyze. Select LSMeans Plot from the drop-down menu on the title bar of an effect. repeatedly. We want to remove all the effects in this case. Least Squares Means Least squares means. since it does not consider any interactions among the variables. the LSMeans are displayed in this example model. There are two ways to remove effects from the model. Select one of the variables and click the Remove button located above the effects list. several short click-and-drags.05 level.

” p. remembering that the Control (Windows) or (Macintosh) key allows for multiple selections. Click the Macros button and select Full Factorial from the popup menu. Size of Load (lbs). . and Sand Blasted? in the model effects list. 86. The appropriate effects appear in the effects list. For now. These macros are completely discussed in “Macros. request a full factorial model—all main effects with all possible interactions. Select Method. Make sure Starch Content (%) is still in the Y role at the top of the launch dialog. JMP Student Edition provides some pre-defined macros to add popular effects combinations to models.5 The Fit Model Platform Introduction 83 5 Fit Model Since it is rather tedious to specify all main effects and all crossed effects one at a time.

Many models have to be tweaked after initial results are examined.4 Contrast Dialog Click the + button once next to Alpha Amalyze. For example. Another report appears.84 5 The Fit Model Platform Introduction Click Run Model. Figure 5. where details of the linear contrast are specified. then re-run the model to repeat the process until all effects are significant. Remove the Method*Size of Load*Sand Blasted effect. For example. A prudent model maker would. Select LS Means Contrast from the drop-down menu in the title bar of the Method variable. this time much larger. This iterative procedure may be repeated several times. .05 level. This is exactly the reason that the Fit Model dialog persists. noting that neither of the levels of the Method*Size of Load*Sand Blasted effect are significant at the 0. you see that the Method*Size of Load effect levels are not significant. even after clicking the Run Model button. Again bring the Fit Model dialog to the front using the Window menu. to test that the Alpha Amalyze wash method is significantly different from the Pumice Stone wash method. at this point. Linear Contrasts Another common task is to test that levels within an effect are different from each other. This is accomplished by using linear contrasts. remove them one at a time from the model.4) to the report. JMP Student Edition attaches a Contrast Dialog (Figure 5. examine the p values of each effect. Click Run Model. If you examine the new model results. so it can be removed from the model as well.

or. highlight it and click Remove.1 on page 80. Click Done. alternatively. If one of the variables is not significant. This test shows a highly significant p-value. an optional role that identifies a column whose values signify the importance of each row in the model . The Fit Model dialog is different from JMP Student Edition’s other launch dialogs in that it does not disappear after the model is launched. The roles a variable can take are: • Y. In JMP Student Edition. giving confirmation that Alpha Amalyze is significantly different from Pumice Stone in affecting the resulting starch content of denim. or ordinal logistic) is referred to as the fitting personality. To remove a variable from its role. The Fit Model Dialog Box Regardless of the model to be fit. the Fit Model dialog box is the first step. Roles To assign a variable to a role. select the variable name and click the appropriate button. This is where the roles of each variable get specified and the type of fit is selected. This facilitates experimentation with the model. it can be removed and the model re-run quickly. which identifies one or more response variables (the dependent variables) • Weight. the type of fit (standard least squares. nominal logistic.button once next to Pumice Stone. double-click on the variable’s name. The dialog box is completely illustrated in Figure 5.5 The Fit Model Platform The Fit Model Dialog Box 85 5 Fit Model Click the .

• When the outside effect appears in the Model Effects list. use Full Factorial. select it again. then B is said to be nested within A.5 Macros drop-down list The following models are available: Full Factorial To look at many crossed factors. A second degree factorial is a very common analysis. To add a simple regressor to the model. with selected variables A. select Factorial to Degree and enter the degree of interactions in the Degree box. -click When levels of an effect (call it B) only occur within a single level of an effect (call it A). . • Select the outside effects in the column selection list and click Add or Cross. A*C. and A is called the outside effect. such as in a factorial design. Macros Common models can be generated using the macros drop-down list. select the two variables to be crossed (use Control-click or for multiple selections) in the Select Columns list and click the Cross button. B*C. the Full Factorial selection places A. For example. an optional role that identifies a column whose values designate the frequency of rows in the analysis Model Effects Effects are added to the model by using the buttons in the Construct Model Effects section of the dialog. Factorial to Degree To create a limited factorial.86 5 The Fit Model Platform The Fit Model Dialog Box • Freq. and A*B*C in the Model Effects list. It creates the set of effects corresponding to all crossings of all variables selected in the columns list. A*B. C. To add a nested effect. select the variable name and click the Add button. Figure 5. B. • Select the nested variable in the column selection list and click Nest. and C. To add a crossed effect to a model. B.

See the JMP help if you are interested in Scheffe cubics. Polynomial to Degree Polynomial effects are a series of terms that are powers of a single variable. The Stepwise feature computes estimates that are the same as those of the Standard Least Squares . This is accomplished by fitting a collection of terms in a quadratic model. Fitting Personalities The three available personalities for model fitting are available in the Personality drop-down list.5 The Fit Model Platform The Fit Model Dialog Box 87 5 Fit Model Factorial Sorted The Factorial Sorted selection creates the same set of effects as Full Factorial. To specify a polynomial effect. shown in Figure 5. then select Response Surface Effect from the Attributes menu. The critical values for the surface are calculated from the parameter estimates and presented with a report on the shape of the surface. All main effects are listed first.6 Fitting Personalities • Standard Least Squares models one or more continuous responses in the usual way through fitting a linear model by least squares. Scheffe Cubic Cubic Models are an advanced topic not usually covered in an advanced course.6. Mixture Response Surface Mixture response surface variables are selected in the same way as Response Surface Effect variables. Figure 5. and so forth. Select Mixture Response Surface from the Attributes menu after selecting a variable name. followed by all two-way interactions. • Stepwise regression is an approach to selecting a subset of effects for a regression model. Response surface effects appear with an ampersand (&) appended to their name. but lists them in order of degree. Response Surface Response surface models find the values of the terms that produce a maximum or a minimum expected response. • click one or more variables in the column selection list • enter the degree of the polynomial in the Degree box • select the Polynomial to Degree command in the Macros popup menu. select the variable name. To specify a Response Surface effect. then all three-way interactions.

Fit Model Report Items When a model is fit with the Standard Least Squares or Stepwise personality. Emphasis Choices The Emphasis popup menu controls which plots and tables are initially shown in the analysis report: • Effect Leverage initially displays leverage and residual plots for the whole model.88 5 The Fit Model Platform Fit Model Report Items personality. Use the dialog to make changes to the model for additional fits. several reports appear based on the Emphasis selected in the Fit Model dialog. Request plots and reports from the report itself. only available in the professional version of JMP. Run Model The Run Model button submits the model to the fitting platform. but it facilitates searching and selecting among many models. followed by a scaled parameter report and the Prediction Profiler. The Stepwise personality allows only one continuous Y. • Minimal Report suppresses all plots. Select effect details and other statistical reports from the report itself. • Effect Screening shows whole-model information. Multiple categorical responses call for MANOVA or other advanced methods. but does not close the dialog window. or make changes to the data and then refit the same model. Any report that does not appear by default can be requested from the platform menu. .

An R2 of 0 means that the fit predicts the response no better than the overall response mean. ters. The Summary of Fit Table The Summary of Fit table appears first and shows the following numeric summaries of the response for the multiple regression model: Rsquare (R2) estimates the proportion of the variation in the response around the mean that can be attributed to terms in the model. Weights are used in weighted least squares — an advanced topic. called Model. . and C Total.5 The Fit Model Platform Fit Model Report Items 89 5 Fit Model Regression Reports Regression reports give textual information about the fit. and it is commonly denoted as s. this is the sum of that column’s values. It is important as a base model for prediction because all other models are compared to it. DF records an associated degrees of freedom for each source of variation. An R2 of 1 occurs when there is a perfect fit (the errors are all zero). Observations (or Sum of Weights) records the number of observations used in the fit. If there is a column assigned the role of weight. The Analysis of Variance Table The Analysis of Variance table shows the basic calculations for a linear model. so the C Total DF is always one less than the number of observations. rather than to random error. There is only one degree of freedom used (the estimate of the mean parameter) in the calculation of variation. • The Error DF is the difference between the C Total DF and the Model DF. Sum of Squares records an associated sum of squares for each source of variation. this is the same as the number of rows in the data table. Root Mean Square Error estimates the standard deviation of the random error. Each is the sum of squares of the differences between the fitted response and the actual response. Rsquare Adj adjusts R2 to make it more comparable over models with different numbers of parame- The Mean of Response is the overall mean of the response values. Error. The C Total degrees of freedom is for the simple mean model. • The Total (C Total) SS is the sum of squared distances of each response from the sample mean. If there are no missing values and no excluded rows. The variance measured around this mean is the Corrected Total (C Total) mean square in the Analysis of Variance table. The table compares the model to a model containing only the mean: Source lists the three sources of variation. Since adding terms to an existing model always increases R2. this adjustment compensates for adding terms to a model that already has terms in it. The total degrees of freedom are partitioned into the Model and Error terms: • The Model degrees of freedom is the number of parameters (except for the intercept) used to fit the model. It is also the square of the correlation between the actual and predicted response. It is the square root of the mean square for error in the corresponding analysis of variance table. It is a ratio of mean squares instead of sums of squares.

• The model is saturated. Sometimes. therefore.90 5 The Fit Model Platform Fit Model Report Items • The Error SS is the sum of squared differences between the fitted values and the actual values. The error for these exact replicates is called pure error. meaning that the model itself has a degree of freedom for each different X value. This is the portion of the sample error that cannot be explained or predicted no matter which form the model uses for the X variables. A Mean Square is a sum of squares divided by its associated degrees of freedom. It tests the hypothesis that all the regression parameters (except the intercept) are zero. Significance probabilities of 0. If there is a significant effect in the model. consider adding interaction terms. In these cases. The F Ratio is the model mean square divided by the error mean square. Pure Error. or if too few interaction effects exist in an analysis of variance model. The remaining portions of the Lack of Fit table are similar to those of the Analysis of Variance Table. The Lack of Fit Table The Lack of Fit table shows a special diagnostic test and appears only when the data and the model provide the opportunity. the F Ratio is higher than expected by chance alone. This sum of squares corresponds to the unexplained error (residual) after fitting the regression model. The difference between the residual error from the model and the pure error is called lack of fit error. Note that large values of Model SS and small values of Error SS lead to large F ratios and low p values— desirable if the goal is to declare that terms in the model are significantly different from zero. This computation converts the sum of squares to an average.” p. A lack of fit error can be significantly greater than pure error if a regressor is in the model with the wrong functional form. This occurs when observations are exact replicates of each other in terms of the X variables. it is possible to estimate the error variance independently of whether the right form of the model is the one under consideration.05 or less are often considered evidence that there is at least one significant regression factor in the model. 91. Prob>F is the probability of obtaining a greater F value by chance alone if the specified model fits no better than the overall response mean. Most practitioners check this F test first and make sure that it is significant before delving further into the details of the fit. There are two common situations where there is no lack of fit test: • There are no exactly replicated points with respect to the X data. This significance is also shown graphically by the whole-model leverage plot. if appropriate. and Total Error. the maximum R2 that can be achieved by using only the variables in the model. to try to better capture the functional form of a regressor. The only additional information is the Max RSq. The Lack of Fit table shows information about the error terms: Source lists the three sources of variation called Lack of Fit. . and therefore there are no degrees of freedom for pure error. there are no degrees of freedom for lack of fit. Note that the pure error DF is pooled from each group where there are multiple rows with the same values for each effect. described in “Leverage Plots.

Prob>F is the significance probability for the F ratio. Estimate lists the parameter estimates for each term. Nparm is the number of parameters associated with the effect. Continuous effects have 1 parameter. DF is the degrees of freedom for the effect test.8).0000. . The fitting platform produces a leverage plot for each effect in the model. It is the ratio of the estimate to its standard error. a larger F statistic would only occur due to random error.7. Values less than 0. These plots show point-by-point what the residual would be both with and without that effect in the model (See Figure 5. Note that if DF is zero.05 are often considered as significant evidence that the parameter is not zero. An example leverage plot is shown in Figure 5. Models with complex classification effects have a parameter for each anticipated degree of freedom. Simple regressors show as the name of the data table column. Probabilities less than 0.5 The Fit Model Platform Fit Model Report Items 91 5 Fit Model The Parameter Estimates Table The Parameter Estimates table shows the estimates of the parameters in the linear model and a t-test for the hypothesis that each parameter is zero.0005 appear as 0. It is the probability that if the null hypothesis is true. The Effect Test Table The Effect Test table shows the following information for each effect: Source lists the names of the effects in the model. Nested effects depend on how levels occur. The Parameter Estimates table shows these quantities: Term names the estimated parameter. The dummy variables are coded as 1. Crossed effects multiply the number of parameters for each term. the note Lost DFs appears to the right of the line in the report. F Ratio is the F statistic for testing that the effect is zero. It is the ratio of the mean square for the effect divided by the mean square for error. The first parameter is always the intercept. except for the last level. This is the two-tailed test against the alternatives in each direction. Sum of Squares is the sum of squares for the hypothesis that the listed effect is zero. Leverage Plots Leverage plots reveal the significance of an effect in the model. Nominal effects have one less parameter than the number of levels. t Ratio is a statistic that tests whether the true parameter is zero. Whenever DF is less than Nparm. Regressors that are dummy indicator variables constructed from nominal or ordinal effects are labeled with the names of the levels in brackets. Prob>|t| is the probability of getting an greater t statistic (in absolute value). given the hypothesis that the parameter is zero. no part of the effect is testable. This is the value used to construct t-tests and confidence intervals for the parameter. Simple continuous regressors have only one parameter. Std Error is the standard error. an estimate of the standard deviation of the distribution of the parameter estimate. which is coded as –1 across all the other dummy variables for that effect.

7 Example Leverage Plot from Denim Data In addition.8 General Leverage Plot residual residual constrained by hypothesis points farther out pull on the line of fit with greater leverage than the points near the middle Figure 5. Significance of the effect is seen by comparing the slope of the sloped line with that of the horizontal one.92 5 The Fit Model Platform Fit Model Report Items Figure 5. Figure 5. This Whole Model leverage plot dramatizes the test that all the parameters (except intercepts) in the model are zero. In general.9. the horizontal line on the plot represents what the values of the model would be if the effect was removed from the model. The sloped line represents the values of the model with the effect included.9 Significance of Effects Significant Borderline Not Significant confidence curve crosses horizontal line confidence curve asymptotic to horizontal line confidence curve does not cross horizontal line . as in Figure 5. This illustrates the same test reported in the Analysis of Variance report. there is a special leverage plot titled Whole Model that shows the actual values of the response plotted against the predicted values.

11. To construct a contrast. dealing with least squares means.” p.5 The Fit Model Platform Fit Model Report Items 93 5 Fit Model Effect Details In a Standard Least Squares analysis. which gets the value -1. the following effect details are available. For continuous effects. The other model factors are controlled —that is.11 Parameter Estimates Table The Expanded Estimates command shows the same information. Least squares means shows which levels produce higher or lower responses. For nominal effects. .) This command is enabled only for categorical effects. holding the other variables in the model constant. Figure 5. Expanded Estimates The standard Fit Model output includes a Parameter Estimates Table. the estimates are the coefficients of dummy variables whose value is 1 for all levels of the variable except the last. Figure 5. If possible. Exploring the Estimates The following commands allow you to further explore the estimated coefficients of the model. as seen in Figure 5. • The LS Means Student’s t command requests multiple comparison tests. • LS Means Contrast displays a dialog for specifying contrasts with respect to an effect.buttons beside the levels to be compared. but with a coefficient for each continuous variable and each level of other variables. • LS Means Plot plots the LSMeans for nominal and ordinal main effects and two-way interactions. Least squares means are also called adjusted means or population marginal means. 84 for an example of using contrasts. More detailed descriptions of each command are available in the online help. the dialog normalizes after each click to make the sum for a column zero and the absolute sum equal to two after each click. set to neutral values. (See “Linear Contrasts. Ordinal effects show coefficients for dummy variables that measure the difference at levels of the variable from the mean of all levels of the effect. designated LS Means by JMP Student Edition.10 Effect Details • LS Means Table shows predicted values from the specified model across the levels of a categorical effect. the estimates are the (estimated) coefficients of each term in the linear model. It adds to the plus or minus score proportionately. click the + and .

it is possible to test far more complicated null hypotheses than this. one. which displays the dialog shown in Figure 5. or that some parameters are equal to others. and are tested using JMP’s Custom Test command. or another value. To test a custom hypothesis. Custom Test In introductory statistics courses.13. Figure 5.94 5 The Fit Model Platform Fit Model Report Items Figure 5. shown here.13 Custom Test Launch You can enter a descriptive label for this test—useful if you are doing several tests The space beneath the Custom Test title bar is an editable area for entering a test name. These tests are known in statistics as a general linear hypotheses. frequently hypothesizing that a parameter is zero. select Custom Test from the Estimates popup menu. However. . For example.12 Expanded Estimates Table Compare the expanded estimates with the prediction formula for this model. it is reasonable to test that several parameters are zero. null hypotheses are often about one variable at a time.

When the test is specified. . consider using the contrast dialog. You would enter a 1 beside the “=” in this dialog. 91. One of the parameters is labeled “=”. A joint F test for all columns is at the bottom.5 The Fit Model Platform Fit Model Report Items 95 5 Fit Model Parameter lists the names of the model parameters. and other statistics for each test column in the dialog. The results are appended to the bottom of the dialog. which tests hypotheses about the least squares means.14. To the right of the list of parameters are columns of zeros corresponding to these parameters. Use the Add Column button to add as many columns to the test as needed. Click in these cells to enter a new hypothesized parameter value corresponding to the desired test. In the edit box to its right. the standard error. Correlation of Estimates The Correlation of Estimates option in the Estimates platform menu produces a correlation matrix for the all effects in a model. Sample output for a custom test (that the Size of Load coefficient is equal to 1) is shown in Figure 5. the function value of the parameters tested. When the custom test is done. Figure 5. Add Column adds another column of zeros so that several linear functions of the parameters can be jointly tested. instead of using a Custom test.14 Custom Test Output Note: For tests within a categorical effect.” p. you may be testing that a certain combination of the factors sums to 1. the report lists the test name. enter the value that you are testing the contrast against. click Done to see the test performed. For example. Row Diagnostics Leverage Plots (the Plot Actual by Predicted and Plot Effect Leverage commands) are covered previously in this chapter under “Leverage Plots.

• Plot Residual By Row displays the residual value by the row number of its observation. called StdErr Pred colname. • Durbin-Watson Test displays the Durbin-Watson statistic to test whether or not the errors have first-order autocorrelation. Note: If you hold down the Shift key and select Save Mean Confidence Interval. The autocorrelation of the residuals is also shown. This is useful for predicting values in new rows or for obtaining a picture of the fitted model. which are the observed response values minus predicted values. • Plot Residual By Predicted displays the residual values by the predicted values of Y. Individual Confidence Interval creates two new columns called Lower95% Indiv colname and Upper95% Indiv colname. Each selection generates one or more new columns in the current data table titled as shown. containing the standard errors of the predicted values. The new columns contain lower and upper 95% confidence limits for individual response values. you are prompted to enter an α-level for the computations. The new col- umn values are the residuals divided by their standard error. where colname is the name of the response variable: Prediction Formula creates a new column. The new columns contain the lower and upper 95% confidence limits for the line of fit. Std Error of Predicted creates a new column. .96 5 The Fit Model Platform Fit Model Report Items • Plot Actual by Predicted displays the observed values by the predicted values of Y. Studentized Residuals creates a new column called Studentized Resid colname. This Durbin-Watson table is only appropriate for time series data when you suspect that the errors are correlated across time. containing the predicted values computed by the specified model. The prediction formula can require considerable space if the model is large. Use the Column Info command and click the Edit Formula button to see the prediction formula. Residuals creates a new column called Residual colname containing the residuals. use the Save Predicted Values option. It differs from the Save Predicted Values column in that the prediction formula is saved with the new column. Save Commands The Save submenu offers the following choices. Predicted Values creates a new column called Predicted colname that contain the predicted val- ues computed by the specified model. This is the leverage plot for the whole model. you are prompted to enter an α-level for the computations. The Durbin-Watson table has a popup command that computes and displays the exact probability associated with the statistic. • Plot Effect Leverage produces a leverage plot for each effect in the model showing the point-by-point composition of the test for that effect. Note: If you hold down the Shift key and select Save Individual Confidence Interval. Mean Confidence Interval creates two new columns called Lower 95% Mean colname and Upper 95% Mean colname. called Pred Formula colname. You typically want to see the residual values scattered randomly about zero. If you do not need the formula with the column of predicted values.

StdErrResid colname. The columns are named as follows. containing the stan- dard errors of the individual predicted values. Std Error of Individual creates a new column. Effect Leverage Pairs creates a set of new columns that contain the values for each leverage plot. . called StdErr Indiv colname.5 The Fit Model Platform Fit Model Report Items 97 5 Fit Model Std Error of Residual creates a new column called. The new columns consist of an X and Y column for each effect in the model. containing the stan- dard errors of the residual values. If the response column name is R and the effects are X1 and X2. then the new column names are X Leverage of X1 for R X Leverage of X2 for R Y Leverage of X1 for R Y Leverage of X2 for R.

.

As an example. It is used when there is little theory to guide the selection of terms for a model and the modeler. by A. To identify such an equation. it would be more economical to find a formula that uses simpler measurements that evaluate fitness and predict oxygen uptake.jmp data table. measurements of age. but it facilitates searching and selecting among many models. Open the Fitness.6 on page 87). recognizing that the true significance probability could be nowhere near the reported one. in desperation.6 Stepwise Regression Stepwise regression is an approach to selecting a subset of effects for a regression model. The significance levels on the statistics for selected models violate the standard statistical assumptions because the model has been selected rather than tested within a fixed model. and pulse were taken for 31 participants who ran 1.1 shows a partial listing of the Fitness. weight. Figure 6. Introduction In JMP. The approach is somewhat controversial. Aerobic fitness can be evaluated using a special test that measures the oxygen uptake of a person running on a treadmill for a prescribed distance. The Stepwise platform lets you search through models with combinations of effects and choose the model you want. The Stepwise feature computes estimates that are the same as those of other least squares platforms.5 miles. stepwise regression is a personality of the Model Fitting platform—it is one of the selections in the Fitting Personality popup menu on the Model Specification dialog (see Figure 5.jmp data table in the Sample Data folder. However. The book Subset Selection in Regression. wants to use whatever seems to provide a good fit. J. you need to compare many different regression models. brings statistical sense to model selection statistics. On the positive side. Miller (1990). runtime. . To find a good oxygen uptake prediction equation. This chapter uses the term “significance probability” in a mechanical way to represent that the calculation would be valid in a fixed model. the approach has been of practical use for 30 years in helping to trim out models to predict many kinds of responses. This data shows results from an aerobic fitness study.

RunPulse. Select Stepwise from the Fitting Personality popup menu. and MaxPulse as Effects. Choose Oxy as the Y response Choose Weight. Click Run Model. Figure 6.100 6 Stepwise Regression Introduction Figure 6.1 The Fitness Data Table Note: For purposes of illustration. certain values of MaxPulse and RunPulse have been changed from data reported by Rawlings (1988.105). RstPulse. Runtime. .2 Model Specification Dialog for a Stepwise Model To do stepwise regression. p. Select Fit Model in the Analyze menu. In the Model Specification dialog.

3 Current Estimates Table After one step After all steps Now that we have selected the effects that contribute to explaining Oxy.6 Stepwise Regression Introduction 101 6 Stepwise Regression When the report appears. Click Run Model. Click the Step button. In this example. you are presented with a control panel. Backward (where effects are removed as they become noon-significant) or Mixed. After one step. To add the first detected effect. The bottom table in Figure 6. We now want to add significant effects. . we can make a model and examine its analysis.3 shows that all the terms have been added except RstPulse and Weight which are not significant at the Prob to Enter value of 0. Figure 6. the most significant term Runtime is entered into the model (top Current Estimates table in Figure 6. a combination of the two detailed below. so Leave the default Forward selection method. we to add any effects that are significant at the 0.25 level or better.25 as the Prob to Enter. A Fit Model dialog appears. We now have a choice of three stepwise methods: Forward (where effects are added as they become significant). To automatically add all the detected effects automatically (rather than manually with the Step button) Click Go to see the stepwise process run to completion. This example uses forward selection.25 specified in the Stepwise Regression Control Panel.3). Leave 0. This produces a report identical to those seen in “The Fit Model Platform” chapter. used to specify how effects should enter or exit the model. Click Make Model.

The following sections describe the components of these areas and tell how to use them. . begin or stop the selection process. Click the field to enter a value. Stepwise Regression Control Panel The Stepwise Regression Control Panel (Control Panel for short). shown next. the stepwise platform displays a window that shows three areas: • The Stepwise Regression Control panel.102 6 Stepwise Regression The Stepwise Window The Stepwise Window When launched. You use the Control Panel as follows: • Prob to Enter is the significance probability that must be attributed to a regressor term for it to be considered as a forward step and entered into the model. which is an interactive control panel for operating the platform • The Current Estimates table. • Direction accesses the popup menu shown here. which lets you choose how you want variables to enter the regression equation. • Forward brings in the regressor that most improves the fit. and create a model. given that term is significant at the level specified by Prob to Enter. has editable areas. which shows the current status of the specified model. and provides additional control features • The Step History table. determine the method of selecting effects. which lists the steps in the stepwise selection. buttons and a popup menu. Click the field to enter a value. • Prob to Leave is the significance probability that must be attributed to a regressor term in order for it to be considered as a backward step and removed from the model. You use these dialog features to limit regressor effect probabilities.

Current Estimates Table The Current Estimates table lets you enter. given that term is not significant at the level specified in Prob to Leave. You can click a term’s check box to manually bring an effect into or out of the model. • Make Model forms a model for the Model Specification Dialog from the model currently showing in the Current Estimates table. Buttons on the controls panel let you control the stepwise processing: • Go starts the selection process. The following quantities update continually during the fitting process and are used in determining the final model. remove. Lock does not permit a term that is checked to be entered or removed from the model. • Stop stops the background selection process. • Mixed alternates the forward and backward steps. It includes the most significant term that satisfies Prob to Enter and removes the least significant term satisfying Prob to Leave. Click an effect’s check box to change its lock status. Make Model can create new data table columns to contain terms that are needed for the model.) if the effect is not currently in the model. . The process continues to run in the background until the model is finished. It is missing (. In cases where there are nominal or ordinal terms. • Parameter lists the names of the regressor terms (effects). The platform begins with no terms in the model except for the intercept. • Step stops after each step of the stepwise process • Enter All enters all unlocked terms into the model. and lock in model effects. You use check boxes to define the stepwise regression process: • Lock locks a term in or out of the model. • Entered shows whether a term is currently in the model. It continues removing terms until the remaining terms are significant and then it changes to the forward direction.6 Stepwise Regression The Stepwise Window 103 6 Stepwise Regression • Backward removes the regressor that affects the fit the least. The intercept is permanently locked into the model. • Estimate is the current parameter estimate. • Remove All removes all terms from the model. as is shown here.

• SS is the reduction in the error (residual) SS if the term is entered into the model or the increase in the error SS if the term is removed from the model. A term has more than one degree of freedom if its entry into a model also forces other terms into the model. DFE. MSE are the sum of squares. If a term is restricted in some fashion. the model seen in the Current Estimates table appears in the Model Specification dialog. Like the “F Ratio.4 without a fitting personality selection.4. If you use Mallow’s Cp as a model selection criterion.104 6 Stepwise Regression The Stepwise Window • nDF is the number of degrees of freedom for a term. Step History Table As each step is taken. select the model where Cp approaches p. • AIC is Akaike’s Information Criterion. Make Model When you click Make Model. • “F Ratio” is the traditional test statistic to test that the term effect is zero. • RSquareAdj adjusts R2 to make it more comparable over models with different numbers of parameters by using the degrees of freedom in its computation. The adjusted R2 is useful in stepwise procedure because you are looking at many different models and want to adjust for the number of terms in the model.” it is in quotation marks because it is not to be trusted as a real significance probability. if you click Make Model after the backward selection in Figure 6. It is in quotation marks because it does not have an F-distribution for testing the term because the model was selected as it was fit. In this example. . it could have a reported SS of zero. • Cp is Mallow’s Cp criterion. the number of parameters in the model. the Model Specification dialog appears as shown in Figure 6. and mean square error (residual) of the current model. three or four variables appear to be a good choice for a regression model. • RSquare is the proportion of the variation in the response that can be attributed to terms in the model rather than to random error. • “Prob>F” is the significance level associated with the F-statistic. The Step History table for the Fitness data example shows the order in which the terms entered the model and shows the effect as reflected by R2 and Cp. degrees of freedom. the Step History table records the effect of adding a term to the model. It is the square of a t-ratio. For example. Statistics for the current model appear above the list of effects: • SSE.

Figure 6. Note that this report is for a three-variable model consisting of Runtime. It is accessible from the red triangle drop-down menu on the stepwise control panel (see Figure 6. .5 All Possible Models When selected.4 New Model Specification dialog from Forward Stepwise Procedure All Possible Regressions Stepwise includes an All Possible Models command. RunPulse.6 Stepwise Regression All Possible Regressions 105 6 Stepwise Regression Figure 6. and MaxPulse.6. all possible models of the regression parameters are run. resulting in the report seen in Figure 6.5).

More may be possible. The model with the highest R2 for each number of parameters is highlighted. but can strain computer memory (and human patience). Right-click (Control-click on the Macintosh) and select Columns > Cp from the menu that appears. .6 All Models Report The models are listed in decreasing order of the number of parameters they contain.106 6 Stepwise Regression All Possible Regressions Figure 6. Note: Mallow’s Cp statistic is computed. We suggest that no more than about 15 variables be used with this platform. To make it visible. but initially hidden in the table.

• A point outside the control limits signals the presence of a special cause of variation.7 Control Charts Control charts are a graphical and analytic tool for deciding whether a process is in a state of statistical control and for monitoring an in-control process. out of control point measurement axis UCL centerline LCL subgroup sample axis The following example uses the Coating. To replicate this example. • Graph > Control Chart subcommands create control charts that can be updated dynamically as samples are received and recorded or added to the data table. • The vertical axis of a control chart is scaled in the same units as the summary statistic. A subgroup sample of four is chosen.1. An X -chart and an R-chart for the process are shown in Figure 7. Introduction Control charts have the following characteristics: • Each point represents a summary statistic computed from a subgroup sample of measurements of a quality characteristic. • The center line on a Shewhart control chart indicates the average (expected) value of the summary statistic when the process is in statistical control. This monitoring process is often called quality control or QC.jmp data in the Quality Control sample data folder (taken from the ASTM Manual on Presentation of Data and Control Chart Analysis). labeled UCL and LCL. The quality characteristic of interest is the Weight column. • The horizontal axis of a control chart identifies the subgroup samples. give the range of variation to be expected in the summary statistic when the process is in statistical control. • The upper and lower control limits. . Choose the Graph > Control Chart > XBar command.

. Specify Weight as the Process variable. click the sample six summary point on either control chart. Click OK. Sample six indicates that the process is not in statistical control. The corresponding rows highlight in the data table. Since our example has four samples in each subgroup. To check the sample values. Change the Sample Size Constant from 5 to 4.108 7 Control Charts Introduction Note the selected chart types of XBar and R.

Specify the interval variable Weight as Y and the nominal variable Sample as X. Figure 7.2 show that the sixth sample has a small range of high values. change the modeling type of Sample to Nominal. The box plots in Figure 7.2 Quantiles Option in Fit Y By X Platform all values in sample six are high .7 Control Charts Introduction 109 7 Quality Control with Control Charts Figure 7. First.1 Variables Charts for Coating Data { You can use Fit Y by X for an alternative visualization of the data.

you see a Control Chart Launch dialog similar to the one in Figure 7. .4. Figure 7. Figure 7. you specify exactly how you want your charts created.) Initially.4 Control Chart Launch Dialog process information chart type information limits specification enter or remove known statistics Add capability analysis to report Through interaction with the Launch dialog.110 7 Control Charts The Control Chart Launch Dialog The Control Chart Launch Dialog When you select a Control Chart from the Graph > Control Chart menu (Figure 7. for measurement variable selection • chart type information • limits specification. the dialog shows three kinds of information: • process information. The following sections describe the panel elements.3 Control Chart Menu Specific information shown for each section varies according to the type of chart you request. (The exact controls vary depending on the type of chart you choose.3).

When the chart type is IR. specify measurements as the process. check the Sample Grouped by Sample Label radio button and enter the sample identifying column as the sample label. • For variables charts. c-. using the Coating. specify the defect count or defective proportion as the process. and u-charts). a Range Span text box appears. the samples are identified by their subgroup sample number. this variable is the subgroup sample size. If no sample label variable is specified. its values are used to label the horizontal axis. • If the sample subgroups are the same size. The illustration in Figure 7. Process selects variables for charting. • If the sample subgroups have an unequal number of rows or have missing values and you have a column identifying each sample. • For attribute charts. Sample Label enables you to specify a variable whose values label the horizontal axis and can also identify unequal subgroup sizes. In Variables charts. check the Sample Size Constant radio button and enter the size into the text box. The range span specifies the number of consecutive measurements from which the moving ranges are computed. If you entered a Sample Label variable. np-. For attribute charts (p-.5 shows an X -chart for a process with unequal subgroup sample sizes.7 Control Charts The Control Chart Launch Dialog 111 7 Quality Control with Control Charts Process Information The Launch dialog displays a list of columns in the current data table. . Here. it identifies the sample.jmp sample data from the Quality Control sample data folder. you specify the variables to be analyzed and the subgroup sample size.

phases might correspond to time periods during which a new process is brought into production and then put through successive changes. and resulting tests. a new sigma.112 7 Control Charts The Control Chart Launch Dialog Figure 7. . A phase is a group of consecutive observations in the data table. For example. zones. Phases generate.5 Variables Charts with Unequal Subgroup Sample Sizes Phase The Phase role enables you to specify a column identifying different phases. or sections. set of limits. for each level of the specified Phase variable.

R. and S checkboxes. and Median moving range charts. Moving average charts and cusum charts can be thought of as special kinds of variables charts. Moving Range. X . . • The Cusum chart is a special chart for means or individual measurements. r.7 Control Charts The Control Chart Launch Dialog 113 7 Quality Control with Control Charts Chart Type Information Shewhart control charts are broadly classified as variables charts and attribute charts. • The IR menu selection has checkbox options for the Individual Measurement.and s- IR CUSUM • XBar charts menu selection gives XBar.

01 or 0. Parameters You specify computations for control limits by entering a value for k (K Sigma) or by entering a probability for α(Alpha).6 K Sigma =3 (left) and K Sigma=4 (right) Control Limits Alpha specifies control limits (also called probability limits) in terms of the probability α that a single subgroup statistic exceeds its control limits. NP. click the Alpha radio button and enter the probability you want. This lets you enter historical statistics (statistics obtained from historical data) for the process variable. a tab with editable fields is appended to the bottom of the launch dialog. There must be a specification of either K Sigma or Alpha. The example here shows 1 as the standard deviation of the process variable and 20 as the mean measurement. The dialog default for K Sigma is 3. which shows as the center line. assuming that the process is in control. To specify alpha. and Run Charts.114 7 Control Charts The Control Chart Launch Dialog • P. The Control Chart platform uses those entries to construct control charts. which is three standard deviations. C.001 Using Specified Statistics If you click the Specify Stats (when available) button on the Control Chart Launch dialog. the number of sigmas. . The examples shown in Figure 7. To specify k. K Sigma specifies control limits at k sample standard errors above and below the expected value.6 compare the X -chart for the Coating. click K Sigma and enter a positive k value into the textbox.jmp data with control lines drawn with K Sigma = 3 and K Sigma = 4. and U charts. The usual choice for k is three. Figure 7. have no additional specifications. K Sigma allows specification of control limits in terms of a multiple of the sample standard error. Reasonable choices for α are 0.

After entering the specification limits and clicking OK. If you specify a label at an increment of eight. If you check the Capability option on the Control Chart launch dialog (see Figure 7. axis values. For example. it is labeled in the plot as µ0. gridline and reference lines to display on the x-axis.7 Control Charts Tailoring the Horizontal Axis 115 7 Quality Control with Control Charts Note: When the mean is user-specified.4). Its items vary with the type of chart you select. Tailoring the Horizontal Axis When you double-click the x-axis. which is the default option. with seven ticks between them.7. The standard deviation for the control chart selected is sent to the dialog and appears as a Specified Sigma value.JMP data lists eight measures a day for three days. as shown in the chart on the right. . the x-axis is labeled at every other tick. capability output appears in the same window next to the control chart. a dialog appears as the platform is launched asking for specification limits. number of ticks. • There is a menu of items on the chart type title bar with options that affect each chart individually. by default.7 Example of Labeled x-Axis Tick Marks Display Options Control Charts have popup menus that affect various parts of the platform: • The menu on the top-most title bar affects the whole platform window. Figure 7. the Pickles. the x-axis is labeled once for each day. In this example. the X Axis Specification dialog appears for you to specify the format. Sometimes this gives redundant labels. as shown to the left in Figure 7.

or right-click the chart space. Connect Points toggles between connecting and not connecting the points.116 7 Control Charts Display Options Single Chart Options The popup menu of chart options appears when you click the icon next to the chart name. . Box Plots superimposes box plots on the subgroup means plotted in a Mean chart. Needle connects plotted points to the center line with a vertical line segment. minimum. 75th percentile. The box plot shows the subgroup maximum. Show Points toggles between showing and not showing the points representing summary statistics. You can use this option to suppress the markers denoting subgroup means when the Box Plots option is in effect. Markers for subgroup means show unless you deselect the Show Points option. Initially. 25th percentile. and median. the points show. It is most appropriate for larger subgroup sample sizes (more than 10 samples in a subgroup). The Box Plots option is available only for X -charts. The control limits displayed apply only to the subgroup mean.

Line Width allows you to pick the width of the control lines.8 Box Plot Option and Needle Option for Airport. These spe- . Limits Color displays the JMP-SE color palette for you to choose the color of the line segments used in the upper and lower limits lines. Tests 1 to 4 apply to Mean. Tests 5 to 8 apply to Mean charts and Individual Measurement charts only. Show Control Limits toggles between showing and not showing the chart control limits and their legends. Show Center Line initially displays the center line in green.7 Control Charts Display Options 117 7 Quality Control with Control Charts Figure 7. Tests shows a submenu that enables you to choose which tests to mark on the chart when the test is positive. Medium. Deselecting Show Center Line removes the center line and its legend from the chart. or Thick. until the sample sizes become equal. Tests apply only for charts whose limits are 3σ limits. If tests do not apply to a chart.jmp Data Connect Color displays the JMP-SE color palette for you to choose the color of the line segments used to connect points. but will not appear for charts whose control limits vary due to unequal subgroup sample sizes. Individual and attribute charts. Tests apply. Options are Thin. Center Line Color displays the JMP-SE color palette for you to choose the color of the line segments used to draw the center line. the Tests option is dimmed.

K. c-. B. For example. you can run an OC curve directly from the QC tab on the JMP-SE Starter window.” p.jmp sample data. JMP-SE opens a new window containing the curve. 122. and u-charts. This test works on all charts with limits. Show Zones toggles between showing and not showing the zone lines with the tests for special causes. OC Curve gives Operating Characteristic (OC) curves for specific control charts. and C as shown here in the Mean plot for weight in the Coating. For more information on special causes tests. centered on the center line. and sample size. When you choose the OC Curve option from the control chart option list. Window Options The popup menu on the window title bar lists options that affect the report window. see “Tests for Special Causes” on page 119 later in this chapter. you could use this command. Alternatively. UCL. Sigma. See the text and chart in “Westgard Rules. using all the calculated values directly from the active control chart. The zones are labeled A. Select the chart on which you want the curve based.118 7 Control Charts Display Options cial tests are also referred to as the Western Electric rules. then a dialog prompts you for Target. p-. Control Chart tests use the zone lines as boundaries. regardless of the sample size being constant. and regardless of the size of k or the width of the limits. and wanted to flag any points beyond the limits of an r-chart. . You can check each chart to show or hide it. OC curves are defined in JMP-SE only for X -. Test Beyond Limits flags as a “*” any point that is beyond the limits. if you had unequal sample sizes. The example menu shown here appears if you request XBar and R at the same time. np-. Westgard Rules are detailed in a later section. The curve shows how the probability of accepting a lot changes with the quality of the sample. LCL. The seven zone lines are set one sigma apart.

UCL. When you select several tests for display and more than one test signals at a particular point. the last point in the test sequence is labeled with the test number. The right-hand chart appears if you use the Connect thru Missing option. Details are found in Figure 2. which is the default. You can select one or more tests for special causes with the options popup menu. Nelson (1984) developed the numbering notation used to identify special tests on control charts. where the sequence is the moving set of points evaluated for that particular test. and 10 missing with the points not connected.9 is a control chart with no missing points. The middle chart has samples 8.9 Example of Connect thru Missing Option Script has a submenu of commands available to all platforms that let you redo the analysis or save the JSL commands for the analysis to a window or a file. The following options show for all control charts except Run charts: Show Limits Legend shows or hides the Avg. Unavailable options show as grayed menu items. and LCL values to the right of the chart.20 on page 39. the label of the numerically lowest test specified appears beside the point. If a selected test is positive. Capability launches a capability analysis. Figure 7. The left-hand chart in Figure 7.7 Control Charts Tests for Special Causes 119 7 Quality Control with Control Charts The specific options that are available depend on the type of control chart you request. Tests for Special Causes The Tests option in the chart type popup menu displays a submenu for test selection. 9. Connect thru Missing connects points when some samples have missing values. .

each with a width of one standard deviation. See Nelson (1984. B. Tests 3. and 6 apply to the upper and lower halves of the chart separately. 2. • Any point lying on a line separating two zones lines is considered belonging to the outermost zone. 5. and 8 apply to the whole chart. the R-chart can be used to rule out increases in variation. or operators. For interpreting Test 1. 4. Tests 1. c-.120 7 Control Charts Tests for Special Causes Western Electric Rules Western Electric rules are implemented in the Tests submenu. 1985) for further recommendations on how to use these tests. • A point lies in Zone B or beyond if it lies beyond the line separating zones C and B.detects a shift in the process mean. np-.1 on page 120 lists and interprets the eight tests. Tests 1 through 8 apply to Mean ( X ) and individual measurement charts. 1985) Table 7. . and Figure 7. vendors. gle (upper or lower) side of Zone C or beyond Test 3 Six points in a row steadily detects a trend or drift in the process increasing or decreasing mean. Note: All Tests and zones require equal sample sizes in the subgroups of nonmissing data. C. Table 7. or a single aberration in the process. B.1 Description and Interpretation of Special Causes Tests Test 1 One point beyond Zone A detects a shift in the mean.10 illustrates the tests. Tests 1 through 4 can also apply to p-. Test 2 Nine points in a row in a sin. • The zones are labeled A. if it is more than one standard deviation from the centerline. 7. C. 3σ limits zones centerline Nelson (1984. The following rules apply to each test: • The area between the upper and lower limits is divided into six zones. and u-charts. Test 4 Fourteen points in a row detects systematic effects such as two alteralternating up and down nately used machines. Small trends will be signaled by this test before Test 1. That is. an increase in the standard deviation. A with zones C nearest the center line.

Test 7 Fifteen points in a row in detects stratification of subgroups when Zone C. Test 6 Four out of five points in detects a shift in the process mean. above and below the the observations in a single subgroup center line come from various sources with different means.1 Description and Interpretation of Special Causes Tests Test 5 Two out of three points in a detects a shift in the process average or row in Zone A or beyond increase in the standard deviation. Any two out of three points provide a positive test. Test 8 Eight points in a row on both detects stratification of subgroups when sides of the center line with the observations in one subgroup come none in Zones C from a single source. but subgroups come from different sources with different means. .7 Control Charts Tests for Special Causes 121 7 Quality Control with Control Charts Table 7. Any Zone B or beyond four out of five points provide a positive test.

1 2s refers to a test that one point is two standard deviations away from the mean.122 7 Control Charts Tests for Special Causes Figure 7. 1985) Westgard Rules Westgard rules are implemented under the Westgard Rules submenu of the Control Chart platform.10 Illustration of Special Causes Tests Test 1: One point beyond Zone A UCL A B C C B A Test 2: Nine points in a row in a single (upper or lower) side of Zone C or beyond UCL A B C C B A Avg 2 Avg LCL LCL 1 Test 3: Six points in a row steadily increasing or decreasing A 3 UCL B C C B A Test 4: Fourteen points in a row alternating up and down A B C C B A UCL Avg LCL Avg LCL 4 Test 5: Two out of three points in a row in Zone A or beyond UCL A 5 5 B C C B A Test 6: Four out of five points in a row in Zone B or beyond UCL A 6 B C C B A Avg 5 LCL Avg LCL Test 7: Fifteen points in a row in Zone C (above and below the centerline) UCL A B C C B A Test 8: Eight points in a row on both sides of the centerline with none in Zone C UCL A B C C B A 7 Avg LCL Avg 8 LCL Nelson (1984. . For example. The different tests are abbreviated with the decision rule for the particular test.

The rule is triggered when any one point goes beyond these limits. where control limits are set 2 standard deviations away from the mean. they can be computed without regard to constant sample size. -1s -2s -3s Rule 4s is triggered when one measure- ment in a group is two standard deviations above the mean and the next is two standard deviations below. -1s -2s -3s Rule 2 2s is triggered when two consecu+3s +3s +2s +1s -1s -2s -3s UCL Avg LCL UCL Avg LCL UCL Avg LCL UCL Avg LCL UCL Avg LCL UCL Avg LCL tive control measurements are farther than +2s +1s two standard deviations from the mean. Table 7. The rule is triggered when any one point goes beyond these limits. +3s +2s +1s -1s -2s -3s Rule 4 1s is triggered when four consecu.7 Control Charts Tests for Special Causes 123 7 Quality Control with Control Charts Because Westgard rules are based on sigma and not the zones.+3s +2s tive points are on one side of the mean. +1s -1s -2s -3s . Rule 1 3s refers to a rule common to +3s Levey-Jennings plots where the control limits are set 3 standard deviations away +2s +1s from the mean. -1s -2s -3s Rule 10 X is triggered when ten consecu.2 Westgard Rules Rule 1 2s is commonly used with Levey-Jennings plots.+3s +2s tive measurements are more than one +1s standard deviation from the mean.

it does not appear on the graph. and Deleted Samples Sample is excluded before creating the Sample is not included in the calculation of the limits. and the sample marker does not appear on the graph. and it chart. Hidden. the sample label will still appear on the axis. Hidden. Moving average charts are special kinds of control charts for variables. the sample label will still appear on the axis. Data set is subsetted with Sample Sample is not included in the calculation of the limits. Sample is excluded after creating the Sample is included in the calculation of the limits. and Deleted Samples Excluded. but does after creating the chart. not appear on the graph. Hidden. it appears on the graph.3 Excluded. but does chart. Shewhart Control Charts Shewhart control charts are broadly classified into control charts for variables and control charts for attributes. and before creating the chart. appears in the graph. and the limits will change.124 7 Control Charts Excluded. the sample will still appear on the chart. if the second observation in the sample is hidden. Data set is subsetted with Sample Sample is not included in the calculation of the limits. does not appear on the graph. but will not flag if a sample is hidden. while the first observation is not hidden. The Control Chart platform in JMP-SE implements a variety of control charts: . 2 An exception to the exclude/hide rule: Tests for Special Causes can flag if a sample is excluded. and deleted after creating chart. The sample marker will disappear from the graph. For example. but limits remain the same. not appear on the graph. The sample marker will disappear from the graph. Sample is hidden before creating the Sample is included in the calculation of the limits. not appear on the graph. Sample is hidden after creating the Sample is included in the calculation of the limits. Sample is both excluded and hidden Sample is not included in the calculation of the limits. the sample label will still be removed from the axis. Some additional notes: 1 Exclude and Hide operate only on the row state of the first observation in the sample. axis will not include a value for the sample. the graph will shift. Sample is both excluded and hidden Sample is included in the calculation of the limits. but does chart. Nothing will change on the output by excluding a sample with the graph open. but limits remain the same. but chart. The sample marker will disappear from the graph. and Deleted Samples The following table summarizes the effects of various conditions on samples and subgroups: Table 7. the deleted before creating chart.

and u-charts. a typical analysis shows both the process mean and its variability with a mean chart aligned above its corresponding R. in the Quality Control sample data folder. An X -chart and an S chart for the process are shown in Figure 7. This time the quality characteristic of interest is the Weight 2 column. the individual measurement chart shows above its corresponding moving range chart. Shewhart Control Charts for Variables Control charts for variables are classified according to the subgroup summary statistic plotted on the chart: • X -charts display subgroup means (averages) • R-charts display subgroup ranges (maximum – minimum) • S-charts display subgroup standard deviations. and S. p-. R-. and u. np-. IR-. • Select the chart types of XBar and S. • p-.or S-chart.s Example.Charts For quality characteristics measured on a continuous scale.jmp. is that they update dynamically as data is added or changed in the table. different from other platforms in JMP-SE. and S-charts. Or. X . c-. • Click OK. Sample as the Sample Label variable. • Choose the Graph > Control Chart > XBar command. • Specify Weight 2 as the Process variable. The IR selection gives two additional chart types: • Individual Measurement charts display individual measurements • Moving Range charts display moving ranges of two or more successive measurements. if you are charting individual measurements. c-.and S-charts with varying subgroup sizes This example uses the same data as example 1. • Phase Control Charts for X -. .7 Control Charts Shewhart Control Charts for Variables 125 7 Quality Control with Control Charts • X -. Coating. r-. • Run charts display data as a connected series of points.charts One feature special to Control Charts. • Individual and Moving Range charts. • Specify the column.11. XBar-. To replicate this example. R-. • The Sample Size option should automatically change to Sample Grouped by Sample Label. np-.

Although. 5. each sample has the same number of observations. Run Charts Run charts display a column of data as a connected series of points. If the samples change while the chart is open and they become equally sized. and 7 each have a missing value. so you may notice the chart has uneven limits.jmp. Note: Although they will turn on and appear checked. samples 1.11 X and S charts for Varying Subgroup Sizes Weight 2 has several missing values in the data.126 7 Control Charts Shewhart Control Charts for Variables Figure 7. and the zone and/or test option is selected. as neither are valid on charts with unequally sized samples. the zones and/or tests will be applied immediately and appear on the chart. The following example is a Run chart for the Weight variable from Coating. 3. no zones or tests will appear on the chart until all samples are equally sized. .

When Use Median is selected.The center line is determined by the Use Median setting of the platform drop-down. Example. a line is drawn through the center value of the column. Run charts can also plot the group means when a sample label is given. The data table records day. high acidity ruins an entire pickle vat. Individual Measurement Charts Individual Measurement Chart Type displays individual measurements. the median is used as the center line. . Moving Range Chart Type displays moving ranges of two or more successive measurements. Otherwise. Because moving ranges are correlated. Moving ranges are computed for the number of consecutive measurements you enter in the Range Span box. 2. The acidity in four vats is measured each day at 1.13 creates Individual Measurement and Moving Range charts with date labels on the horizontal axis. The default range span is 2. both the overall mean and median are saved. The dialog in Figure 7.12 Run Chart When you select the Show Center Line option in the Run Chart drop-down. and acidity measurements. Individual Measurement and Moving Range Charts The Pickles. the mean is used. time.7 Control Charts Shewhart Control Charts for Variables 127 7 Quality Control with Control Charts Figure 7. and 3 PM. either on the dialog or through a script. these charts should be interpreted with care. Individual Measurement charts are appropriate when only one measurement is available for each subgroup sample. Because the pickles are sensitive to acidity and produced in large vats. When saving limits to a file.jmp data in the Quality Control sample data folder contains the acid content for vats of pickles.

• Click OK. • Select both Individual Measurement and Moving Range chart types. • Specify Acid as the Process variable.128 7 Control Charts Shewhart Control Charts for Variables Figure 7.13 Launch Dialog for Individual Measurement and Moving Range Chart To complete this example. • Choose the Graph > Control Chart > IR command. • Specify Date as the Sample Label variable.14 monitor the acidity in each vat produced. The individual measurement and moving range charts shown in Figure 7. .

the limits on the Individuals chart use the Median Moving Range as the sigma. This data is often continuous. it is important when comparing charts to determine whether you have similar number of items in the subgroups between the charts. where the variable of interest is a discrete count of the number of defects or blemishes per subgroup. Since the counts are measured per subgroup. rather than the Average Moving Range.14 Individual Measurement and Moving Range Charts for Pickles Data Note: If you choose a Median Moving range chart. like variables charts. For discrete count data.4 Determining which Attribute Chart to use Each item is judged as either conforming or For each item.7 Control Charts Shewhart Control Charts for Attributes 129 7 Quality Control with Control Charts Figure 7. Attribute charts. Another type of data is count data. Shewhart Control Charts for Attributes In the previous types of charts. are classified according to the subgroup sample statistic plotted on the chart: Table 7. attribute charts are applicable. the number of defects is counted non-conforming The subgroups are a The subgroups vary in The subgroups are a The subgroups vary in constant size size constant size size np-chart p-chart c-Chart u-chart . measurement data was the process variable. as they are based on binomial and poisson models. and the charts are based on continuous theory.

jmp data. and an item is judged as either conforming or nonconforming. • Choose # defects as the Process variable.15 np-Chart Example. . we can specify a sample size variable. np-Charts The Washers. Since each subgroup for a p-chart consists of N items. • c-charts display the number of nonconformities (defects) in a subgroup sample that usually consists of one inspection unit. If a washer contained a finish defect. Thus. Figure 7. • np-charts display the number of nonconforming (defective) items in constant sized subgroup samples.and np-Charts Example. • u-charts display the number of nonconformities (defects) per unit in subgroup samples that can have a varying number of inspection units. follow these steps: • Choose the Graph > Control Chart > NP command. p-Charts Again. Since each subgroup for a np-chart consists of Ni items. which would allow for varying sample sizes. The example here illustrates an np-chart for the number of defects. it was deemed nonconforming or defective.jmp data in the Quality Control sample data folder contains defect counts of 15 lots of 400 galvanized washers. The washers were inspected for finish defects such as rough galvanization and exposed steel. and an item is judged as either conforming or nonconforming. the maximum number of nonconforming items in subgroup i is Ni. the maximum number of nonconforming items in a subgroup is N. • Click OK. p. To replicate this example. using the Washers.130 7 Control Charts Shewhart Control Charts for Attributes • p-charts display the proportion of nonconforming (defective) items in subgroup samples which can vary in size. the defect count represents how many washers were defective for each lot of size 400. • Change the Constant Size to 400.

• Click OK. Note:When you generate a u-chart. and select Capability. . The u-chart. is monitoring the number of brace defects per subgroup sample size. The upper and lower bounds vary according to the number of units inspected. A box of braces is one inspection unit. • Choose Lot as the Sample Label variable. Follow these steps or submit the JSL script below: • Choose the Graph > Control Chart > P command.16 p-Chart Note that although the points on the chart look the same as the np-chart. shown here. the y-axis. • Choose # defects as the Process variable. which can vary.7 Control Charts Shewhart Control Charts for Attributes 131 7 Quality Control with Control Charts Note: This data contains all constant sample sizes. JMP-SE launches the Poisson Fit in Distri- bution and gives a Poisson-specific capability analysis.jmp data in the Quality Control sample data folder records the defect count in boxes of automobile support braces. Avg and limits are all different since they are now based on proportions u-Charts The Braces. Figure 7. • Choose Lot Size as the Sample Size variable. The number of boxes inspected (per day) is the subgroup sample size. The chart shown here illustrates a p-chart for the proportion of defects.

• Choose # Defects as the Process variable. • Choose # defects as the Process variable. • Open the Shirts. Prior to shipment.132 7 Control Charts Shewhart Control Charts for Attributes Figure 7. Since the manufacturer is interested in the average number of flaws per shirt. However. Example 10. • Choose the Graph > Control Chart > C command. Note:When you generate a c-chart. follow these steps or submit the JSL below. c-charts can also be used to monitor the average number of defects per inspection unit. • Choose Date as the Sample Label. . each shirt is inspected for flaws. • Open the Braces. the number of flaws found in each box is divided by ten and then recorded. c-Charts c-charts are similar to u-charts in that they monitor the number of nonconformities in an entire subgroup. • Choose Box as the Sample Label.jmp data in the Quality Control sample data folder. • Choose Unit size as the Unit Size variable. • Click OK. follow these steps or submit the JSL below. made up of one or more units.17 u-Chart Example. they require constant subgroup sizes. To replicate this example. • Choose Box Size as the Sample Size. c-Charts for Noncomformities per Unit In this example. a clothing manufacturer ships shirts in boxes of ten.jmp data in the Quality Control sample data folder. and select Capability. • Choose the Graph > Control Chart > U command. JMP-SE launches the Poisson Fit in Distribu- tion and gives a Poisson-specific capability analysis. u-Charts To replicate this example.

• Choose DIAMETER as the Process. phases might correspond to time periods during which a new process is brought into production and then put through successive changes. zones. Presummarized. p-. Example Open Diameter.7 Control Charts Phases 133 7 Quality Control with Control Charts • Click OK. s-.18 c-Chart Phases A phase is a group of consecutive observations in the data table. Phases generate. np-. For example. both with the first prototype (phase 1) and the second prototype (phase 2). the phase variable is examined. and resulting tests. and Levey-Jennings charts. Saving to a limits file reveals the sigma and specific limits calculated for each phase. set of limits. DAY as the Sample Label.JMP. • Select Graph > Control Chart > XBar. a Phase variable button appears. On the dialog for X -. • Click OK. found in the Quality Control sample data folder. and Phase as the Phase. If a phase variable is specified. to identify to which phase each row belongs. a new sigma. Figure 7. u-. This data set contains the diameters taken for each day. c-. for each level of the specified Phase variable. row by row. IR-. . r-.

20 Phase Control Chart .134 7 Control Charts Phases Figure 7.19 Launch Dialog for Phases The resulting chart has different limits for each phase Figure 7.

two-sided charts allow specification in terms of error probabilities α and β. or two-sided to detect a shift in either direction. as in Figure 7. and the arms extended backward on the x-axis. Points crossing the lower arm signal an increasing process mean. As data are collected. you compare the points with limits that compose a V-mask. The point where the slope changes is the point where the shift occurs.21 Illustration of a V-Mask for a Two-Sided Cusum Chart upper arm vertex lower arm d h the rise in the arm corresponding to the distance (d) from origin to vertex 1 unit k. and points crossing the upper arm signal a downward shift. the rise in the arm corresponding to one sampling unit Shifts in the process mean are visually easy to detect on a cusum chart because they produce a change in the slope of the plotted points. There are major differences between cusum charts and other control (Shewhart) charts: • A Shewhart control chart plots points based on information from a single subgroup sample.21). each point is based on information from all samples taken up to and including the current subgroup. . Both charts can be specified in terms of geometric parameters (h and k described in Figure 7. In cusum charts. the limits can be either in the form of a V-mask or a horizontal decision interval.21. horizontal control limits define whether a point signals an out-of-control condition. JMP cusum charts can be one-sided. A condition is out-of-control if one or more of the points previously plotted crosses the upper or lower arm of the V-mask. Figure 7. the cumulative sum sequence is updated and the origin is relocated at the newest point. A V-mask is formed by plotting V-shaped limits.7 Control Charts Cumulative Sum (Cusum) Charts 135 7 Quality Control with Control Charts Cumulative Sum (Cusum) Charts Cumulative Sum (Cusum) charts display cumulative sums of subgroup or individual measurements from a target value. On a cusum chart. To interpret a two-sided Cusum chart. The origin of a V-mask is the most recently plotted point. Cusum charts are graphical and analytical tools for deciding whether a process is in a state of statistical control and for detecting a shift in the process mean. • On a Shewhart control chart. which detect a shift in one direction from a specified target mean.

If it is not checked. the limits are determined from average run length.136 7 Control Charts Cumulative Sum (Cusum) Charts • The control limits on a Shewhart control chart are commonly specified as 3σ limits. which lets you enter the process variable specifications. H is the decision interval. including appropriate options and specifications as shown here. Note: The following items pertain only to cusum charts: Two Sided requests a two-sided cusum scheme when checked. Data Units specifies that the cumulative sums be computed without standardizing the subgroup means or individual values so that the vertical axis of the cusum chart is scaled in the same units as the data. You also enter a value for the increase in the lower V-mask per unit change on the subgroup axis (See Figure 7. or from an economic design. Beta specifies the probability of failing to discover that the specified shift occurred. A cusum chart is more efficient for detecting small shifts in the process mean. Beta is the probability of a Type II error and is available only when you specify Alpha.26). . For a one-sided scheme. Lucas (1976) comments that a V-mask detects a 1σ shift about four times as fast as a Shewhart control chart. the Control Charts Launch dialog appears. Note: Data Units requires that the subgroup sample size be designated as constant. Specify Stats appends the panel shown here to the Control Charts Launch dialog. Launch Options for Cusum Charts When you choose Graph > Control Charts > Cusum. Choose H as a multiple of the standard error. On a cusum chart. When you click H. a decision interval is displayed. from error probabilities. If an H value is specified. the Beta entry box is labeled K. a one-sided scheme is used and no V-mask appears. H is the vertical distance h between the origin for the V-mask and the upper or lower arm of the V-mask for a two-sided scheme.

Delta is an alternative to the Shift option (described next).7 Control Charts Cumulative Sum (Cusum) Charts 137 7 Quality Control with Control Charts Target is the target mean (goal) for the process or population. Head Start specifies an initial value for the cumulative sum. Enter Head Start as a multiple of standard error. Shift is the minimum value you want to detect on either side of the target mean. respectively. and n is the (common) subgroup sample size. The relationship between Shift and Delta is given by ∆ δ = ----------------------(σ ⁄ ( n)) where δ represents Delta. By default. . You can use Sigma instead of the Alpha option on the Control Charts Launch dialog. Cusum Chart Options Cusum charts have these options (in addition to standard chart options). Connect Points connects the sample points with a straight line. ∆ represents the shift. You enter the shift value in the same units as the data. Show Points shows or hides the sample data points. σ0. Delta specifies the absolute value of the smallest shift to be detected as a multiple of the process standard deviation or of the standard error. σ represents the process standard deviation. The target mean must be scaled in the same units as the data. σ. and you interpret it as a shift in the mean of the sampling distribution of the subgroup mean. the Control Chart platform estimates sigma from the data. You can choose either Shift or Delta. for the process standard deviation. S0. for a one-sided cusum scheme (S0 is usually zero). depending on whether the shift is viewed as a shift in the population mean or as a shift in the sampling distribution of the subgroup mean. Sigma specifies a known standard deviation.

Subgroup samples of four cans are selected and weighed every hour for twelve hours. The process is set so that the average weight of a filled can. A two-sided cusum chart is used to detect shifts of at least one standard deviation in either direction from the target mean of 8. or center line. Previous analysis shows that the standard deviation of fill weights. • Choose the Graph > Control Chart > CUSUM command. • Click the H radio button and enter 2 into the text box. Center Line Color displays the JMP color palette for you to select a color for the center line. A machine fills 8-ounce cans of two-cycle engine oil additive. . is 8. is 0.jmp data table contains one value of weight along with its associated value of hour. • Specify weight as the Process variable. Each observation in the Oil1 Cusum.10 ounces. Show V Mask shows or hides the V-mask based on the parameters (statistics) specified on the Control Charts Launch dialog when Cusum is selected as the Chart Type.26) that summarizes the Cusum charting parameters.10 ounces. The Control Chart platform assumes that the data are sorted in increasing order. • Specify hour as the Sample Label. Show ARL displays the average run length (ARL) information.jmp file from the Quality Control sample data folder. Connect Color displays the JMP color palette for you to select a color for the connect line when the Connect Points option is in effect. Example 1. Two-Sided Cusum Chart with V-mask To see an example of a two-sided cusum chart. µ0. To create a Cusum chart for this example. The observations are sorted so that the values of hour are in increasing order. open the Oil1 Cusum.138 7 Control Charts Cumulative Sum (Cusum) Charts Mask Color displays the JMP color palette for you to select a line color for the V-mask.05 ounces. The filling process is believed to be in statistical control. Show Shift shows or hides the shift you entered. σ0. • Click the Two Sided check box if it is not already checked. Show Parameters displays a Parameters table (see Figure 7.

• Set Target to the average weight of 8. The finished dialog should look like the one in Figure 7. Figure 7. there is no evidence that a shift in the process has occurred.22. Because none of the points cross the arms of the V-mask.7 Control Charts Cumulative Sum (Cusum) Charts 139 7 Quality Control with Control Charts • Click Specify Stats to open the Known Statistics for CUSUM chart tab.05. • Set Sigma to the standard deviation of 0.23 appears.1.22 Dialog for Cusum Chart Example When you click OK. Figure 7.jmp Data You can interpret the chart by comparing the points with the V-mask whose right edge is centered at the most recent point (hour=12).23 Cusum Chart for Oil1 Cusum. . • Enter a Delta value of 1. the chart in Figure 7.

The center line and V-mask adjust to reflect the process condition at that point. • Choose the Graph > Control Chart > CUSUM command. The resulting output should look like the picture in Figure 7. . • Click Specify Stats to open the Known Statistics for CUSUM chart tab.05.140 7 Control Charts Cumulative Sum (Cusum) Charts A shift or out-of-control condition is signaled at a time t if one or more of the points plotted up to the time t cross an arm of the V-mask. and a downward shift is signaled by points crossing the upper arm. and not so concerned about under-filling. The cusum chart automatically updates when you add new samples. • Enter a Delta value of 1.24 is the previous chart with additional points.25. • Deselect the Two Sided check box. One-Sided Cusum Chart with no V-mask Consider the data used in Example 1. Anything 0. • Specify weight as the Process variable.25 ounces beyond the mean of 8. • Specify hour as the Sample Label.1.24 Updated Cusum Chart for the OIL Data Example 2. • Set Target to the average weight of 8. • Set Sigma to the standard deviation 0. To do this example. where the machine fills 8-ounce cans of engine oil.1 is considered a problem.25 into the text box. • Click the H radio button and enter 0. You can move the origin of the V-mask by using the hand to click a point. The Cusum chart in Figure 7. Consider also that the manufacturer is now concerned about significant over-filling in order to cut costs. Figure 7. An upward shift is signaled by points crossing the lower arm. The time at which the shift occurred corresponds to the time at which a distinct change is observed in the slope of the plotted points. A one-sided Cusum Chart can be used to identify data approaching or exceeding the side of interest.

An additional chart option.25). Figure 7. Show ARL. sometimes denoted ARL0.26. The average run length is the expected number of samples taken before an out-of-control condition is signaled: • ARL (Delta).7 Control Charts Cumulative Sum (Cusum) Charts 141 7 Quality Control with Control Charts Figure 7.25 One-Sided Cusum Chart for the OIL Data Notice that the decision interval or horizontal line is set at the H-value entered (0. is the in-control average run length for the specified parameters (Montogomery (1985)). Also note that no V-mask appears with One-Sided Cusum charts. adds the average run length (ARL) information to the report. The parameters report summarizes the charting parameters from the Known Statistics for CUSUM chart tab on the Control Chart Launch dialog. The Show Parameters option in the Cusum chart popup menu shows the Parameters report in Figure 7.26 Show Parameters and Show ARL Options . is the average run length for detecting a shift the size of the specified Delta • ARL(0). sometimes denoted ARL1.

.

jmp table found in the Time Series sample data folder (Box and Jenkins 1976). you can specify a time ID variable. Note: The Time Series Launch dialog requires that one or more continuous variables be assigned as the time series.1). sorted ascending. evenly spaced. If a time ID variable is specified. Time ID. which is used to label the time axis. This dialog allows you to specify the number of lags to use in computing the autocorrelations and partial autocorrelations. several forms of exponential smoothing and Winter’s method. A time series is a set y1. assign Passengers as Y.. In addition. autoregressive integrated moving-average. analyze. often called Box-Jenkins models • Smoothing Models. Time Series and Time as X. The time series variable is Passengers and the time ID is Time. .. These indicate how and to what degree each point in the series is correlated with earlier values in the series and can be used to identify the type of model appropriate for describing and predicting (forecasting) the evolution of the time series. It also lets you specify the number of future periods to forecast using each model fitted to the data. and without missing values. Figure 8. the platform displays graphs of the autocorrelations and partial autocorrelations of the series.8 Time Series The Time Series platform lets you explore. Introduction The data for the next examples are in the Seriesg. .1 Launch Dialog .yN of observations taken over a series of equally-spaced time periods. y2. For this example. The model types include • ARIMA. it must be continuous. and forecast univariate time series. Optionally. The analysis begins with a plot of the points in the time series. Select Analyze > Time Series to display the Time Series Launch dialog (Figure 8.

Time Series and Time as X. Again select Analyze > Time Series Assign Log Passengers as Y.2 Initial Time Series Report The graph shows that the series has an increasing spread over time. The series now has an acceptable appearance for modeling. increasing variances are transformed using logarithms. Time ID. . In general. A column containing a count of Log Passengers is already included in the table. its autocorrelation graph. and its partial autocorrelation graph. Figure 8. This should be accounted for before modeling the series.144 8 Time Series Introduction The first thing you see is a graph showing the time series.

3 Log Passenger Series Since the autocorrelation graph decreases slowly and steadily. JMP Student Edition estimates the model and displays a model summary. Select ARIMA from the platform menu. To try this model. parameter estimates. which is initially closed. Click Estimate. a reasonable guess for a model is an MA(1). however. and a forecast graph.8 Time Series Introduction 145 8 Time Series Figure 8. is the residuals. Enter a 1 beside q. . Moving Average Order. The most important graph. but the partial autocorrelation graph drops off drastically after lag 1.

Figure 8.146 8 Time Series Introduction Figure 8.5 MA(1)Model Results .4 Model Results Open the Residuals node to reveal a graph and autocorrelation plots for the model residuals.

where n is the number of observations. note that JMP is accumulating a list of models. The Forecast Periods box allows you to set the number of periods into the future that the fitted models are forecast. an MA(2). or row number if no time ID is specified (Figure 8. select it in the role box and click Remove. drag variables from the column selector list to one of the following role boxes: X. has features to resize the graph. Click Estimate. JMP uses 25 lags and 25 forecast periods The Time Series Graph The Time Series platform begins with a plot of each times series by the time ID. one or more time series variables. The selector list at the left of the dialog shows all columns in the current table. Moving Average Order. the MA(2) is a worse fit than the MA(1). can be run in the same way. select one or more columns in the column selector list and click a role button.6 Model Comparison Table Examine the R2 for the two models in this table. You set the number of lags for the autocorrelation and partial autocorrelation plots in the Autocorrelation Lags box. It must be more than one but less than the number of rows. click OK to see the analysis for each time series variable versus the time ID. Some reflection is necessary. By default. Time ID for the x-axis. Select ARIMA from the platform menu. To remove an unwanted variable from an assigned role. This is the maximum number of periods between points used in the com- putation of the correlations. The Time Series Platform First. Figure 8. Similar unsatisfactory results appear. However. like others in JMP. In fact. The plot.8 Time Series The Time Series Platform 147 8 Time Series The expected reduction in spikes did not occur. so an MA(1) is not an appropriate model. and label points. in the Model Comparison table. To cast a column into a role.1. highlight points with the cursor or brush tool. Enter a 2 beside q.7). Time Series for the y-axis. After assigning roles. Or. A commonly used rule of thumb for the maximum number of lags is n/4. one variable used for labeling the time axis Y. A second model. along with appropriate fit statistics. assign columns for analysis with the dialog in Figure 8. .

151). Time Series Commands The popup menu next to the time series name has the commands shown here. a summary of all the models is inserted just above the first model report (an example is shown in “Model Comparison Table. The result of each new fit is appended to the report. but can be hidden with commands from the platform popup menu on the Time Series title bar. discuss statistical results. These are typically used to determine the nature of the model to be fitted to the series.148 8 Time Series Time Series Commands Figure 8.5) of the time series are also shown. The following sections describe options and model fits. graphs of the autocorrelation and partial autocorrelation (Figure 8. The first three items in this menu control the descriptive and diagnostic graphs and tables. The platform popup menu. and cover additional platform features. also has fitting commands and options for displaying additional graphs and statistical tables. discussed next.” p. The ARIMA and Smoothing Model commands are for fitting various models to the data and producing forecasts.7 Time Series Plot of Seriesg (Airline Passenger) Data By default. After the first model has been fit. You can select the model fitting commands repeatedly. .

• Connecting Lines hides or displays the lines connecting the points in the time series graph. This graph describes the correlation between all the pairs of points in the time series with a given separation in time or lag. confidence curves show twice the large-lag standard error (± 2 standard errors). • Mean Line hides or displays a horizontal line in the time series graph that depicts the mean of the time series. The Graph command on the platform popup menu has a submenu of controls for the time series plot with the following commands. Partial Autocorrelation The Partial Autocorrelation command alternately hides or displays the graph of the sample partial autocorrelations. • Show Points hides or displays the points in the time series graph.8 Time Series Time Series Commands 149 8 Time Series Graph The Time Series platform begins by showing a time series plot. By definition.7.8. You can examine the autocorrelation and partial autocorrelations plots to determine whether the time series is stationary (meaning it has a fixed mean and standard deviation over time) and what model might be appropriate to fit the time series. The plot on the right in Figure 8. • Time Series Graph hides or displays the time series graph. The solid black lines represent ± 2 standard errors for approximate 95% confidence limits . like the one shown previously in Figure 8. The autocorrelation plot for the Seriesg data is shown on the left in Figure 8. In addition.8 shows the partial autocorrelation function for the Seriesg data. Autocorrelation The Autocorrelation command alternately hides or displays the autocorrelation graph of the sample. the first autocorrelation (lag 0) always has length 1. often called the sample autocorrelation function.

150 8 Time Series Modeling Reports Figure 8. All existing and future forecast results will show the new number of periods with this command. a report of the results of the fit and a forecast is added to the platform results. Modeling Reports The time series modeling commands are used to fit theoretical models to the series and use the fitted model to predict (forecast) future values of the series. . as discussed previously in their respective sections. and smoothing models begin with the following report tables. The initial value is set in the Time Series Launch dialog. seasonal ARIMA.8 Autocorrelation and Partial Correlation Plots Number of Forecast Periods The Number of Forecast Periods command displays a dialog for you to reset the number of periods into the future that the fitted models will forecast. The fit of each model begins with a dialog that lets you specify the details of the model being fit as well as how it will be fit. These commands also produce statistics and residuals that allow you to ascertain the adequacy of the model you have elected to use. You can select the modeling commands repeatedly. The ARIMA. Each general class of models has its own dialog. using a Kalman filter to compute the likelihood function. The models are fit by maximizing the likelihood function. Each time you select a model.

• Invertible indicates whether the moving average operator is invertible. The Model Comparison table shown above summarizes the ARIMA models (1. which summarizes the statistics of the fit. Schwartz’s Bayesian Criterion [SBC or BIC] are goodness of fit statistics. • Variance Estimate is the unconditional sum of squares (SSE) divided by the number of degrees of freedom.” p.8 Time Series Modeling Reports 151 8 Time Series Model Comparison Table The Model Comparison table summarizes the fit statistics for each model. n is the number of nonmissing observations and k is the number of fitted parameters in the model. 1). The numerical values in the table are drawn from the Model Summary table for each fitted model. and (1. • Sum of Squared Errors is the sum of the squares of the prediction errors. (0. Model Summary Table Each model fit generates a Model Summary table. That is. whether all the roots of θ ( z ) = 0 lie outside the unit circle. • DF is the number of degrees of freedom in the fit. 0). 154. • RSquare and RSquare Adj are also goodness of fit statistics. SSE / (n – k). Smaller values of these criteria indicate better fit. 154. This is the sample estimate of the variance of the random shocks at. This is a sample estimate of the standard deviation of at. • Stable indicates whether the autoregressive operator is stable. whether all the roots of φ ( z ) = 0 lie outside the unit circle. 1) respectively. n – k. where values closer to 1 indicate a better fit. Each row corresponds to a different model. 0. You can use it to compare several models fitted to the same time series. the random shocks • Akaike’s Information Criterion [AIC]. That is. . Note: The φ and θ operators are defined in the section “ARIMA Model. • –2LogLikelihood is minus two times the natural log of the likelihood function evaluated at the best-fit parameter estimates. Smaller values are better fits. described in the section “ARIMA Model. detailed in the online help. 0. In the formulae below. • Standard Deviation is the square root of the variance estimate. SSE.” p. 0.

” p. a t-ratio greater (in absolute value) than the computed value. which gives the estimates for the time series model parameters. They are used in constructing tests and confidence intervals. The Parameter Estimates table also gives the Constant Estimate. Some models contain an intercept or mean term. The definition of the constant estimate is given under “ARIMA Model. given a true hypothesis. Often.01) is interpreted as evidence that the parameter is significantly different from zero. • Std Error lists the estimates of the standard errors of the parameter estimates. • Factor (Seasonal ARIMA only) lists the factor of the model that contains the parameter. It is the probability of getting. for models that contain an intercept or mean term. This is only shown for multiplicative models. If the hypothesis is true. 154. • Estimate lists the parameter estimates of the time series model. They are described in the sections on specific time series models.05 significance level.152 8 Time Series Modeling Reports Parameter Estimates Table There is a Parameter Estimates table for each selected fit. Looking for a t-ratio greater than 2 in absolute value is a common rule of thumb for judging significance because it approximates the 0. These are described below for each model type. by chance alone. then this statistic has an approximate Student’s t-distribution. The definition of the constant estimate is given under the description of ARIMA models. • t Ratio lists the test statistics for the hypotheses that each parameter is zero. In those models. Each type of model has its own set of parameters.05 (or sometimes 0. . Factor 1 is nonseasonal and Factor 2 is seasonal. • Lag lists the degree of the lag or backshift operator that is applied to the term to which the parameter is multiplied. the related constant estimate is also shown. In the multiplicative seasonal models. • Prob>|t| lists the observed significance probability calculated from each t-ratio. It is the ratio of the parameter estimate to its standard error. a value below 0. The Parameter Estimates table has these terms: • Term lists the name of the parameter.

It is divided by a vertical line into two regions. Iteration History The model parameter estimation is an iterative procedure by which the log-likelihood is maximized by adjusting the estimates of the parameters. The iteration history for each model you request shows the value of the likelihood function for each iteration. These can be used to determine whether the fitted model is adequate to describe the data. You can control the number of forecast values by changing the setting of the Forecast Periods box in the platform launch dialog or by selecting Number of Forecast Periods from the Time Series drop-down menu. the points in the residual plot should be normally distributed about the zero line and the autocorrelation and partial autocorrelation of the residuals should not have any significant components for lags greater than zero. If it is. These are the actual values minus the one-step-ahead predicted values. Residuals The graphs under the residuals section of the output show the values of the residuals based on the fitted model. the autocorrelation and partial autocorrelation of these residuals are shown. To the left of the separating line the one-step-ahead forecasts are shown overlaid with the input data points.8 Time Series Modeling Reports 153 8 Time Series Forecast Plot Each model has its own Forecast plot. . This can be useful for diagnosing problems with the fitting procedure. Attempting to fit a model which is poorly suited to the data can result in a large number of iterations that fail to converge on an optimum value for the likelihood. The Forecast plot shows the values that the model predicts for the time series. In addition. The data and confidence intervals can be toggled on and off using the Show Points and Show Confidence Interval commands on the model’s popup menu. To the right of the line are the future values forecast by the model and the confidence intervals for the forecasts.

Show Confidence Interval hides or shows the confidence intervals in the forecast graph. 2 p 2 q The a t are assumed to be independent and normally distributed with mean zero and constant variance. The model can be rewritten as φ ( B )w t = δ + θ ( B )a t where the constant estimate δ is given by the relation . φ ( B ) and θ ( B ) . the autoregressive operator and the moving average operator and are written φ ( B ) = 1 – φ 1 B – φ 2 B – … – φ p B and θ ( B ) = 1 – θ 1 B – θ 2 B – … – θ q B a t are the sequence of random shocks. Save Columns creates a new data table with columns representing the results of the model. however. 148.” p. These displays are described in the section “Time Series Commands. The ARIMA command performs a maximum likelihood fit of the specified ARIMA model to the time series. ARIMA Model An AutoRegressive Integrated Moving Average (ARIMA) model predicts future values of a time series by a linear combination of its past values and a series of errors (also known as random shocks or innovations). respectively.154 8 Time Series ARIMA Model Model Report Options The title bar for each model you request has the popup menu shown to the right. For a response series { y i } . the general form for the ARIMA model is: φ ( B ) ( w t – µ ) = θ ( B )a t where t is the time index B is the backshift operator defined as B y t = y t – 1 w t = ( 1 – B ) y t is the response series after differencing d µ is the intercept or mean term. with the following options for that model: Show Points hides or shows the data points in the forecast graph. they are applied to the residual series (the one-step-ahead model predictions minus the input series). Residual Statistics controls which displays of residual statistics are shown for the model.

8 Time Series Smoothing Models δ = φ ( B )µ = µ – φ 1 µ – φ 2 µ – … – φ p µ . . You can check the Model Summary table to see if the resulting fitted model is stable and invertible. β t is the time-varying slope term. The results appear when you click Estimate. s ( t ) is one of the s time-varying seasonal terms. then model would be denoted MA(q). You might want to uncheck this box if the fitter is having difficulty finding the true optimum or if you want to speed up the fit. If the Constrain fit check box is checked. For example.d. An ARIMA model is commonly denoted ARIMA(p. Smoothing Models JMP offers a variety of smoothing techniques.q).d. The Confidence Intervals box allows you to set the confidence level between 0 and 1 for the forecast confidence bands. the fitting procedure will constrain the autoregressive parameters to always remain within the stable region and the moving average parameters within the invertible region. 155 8 Time Series The ARIMA command displays the Specify ARIMA Model dialog. which allows you to specify the ARIMA model you want to fit. a t are the random shocks. Smoothing models represent the evolution of a time series by the model: y t = µ t + β t + s ( t ) + a t where µ t is the time-varying mean term. or q are zero. if p and d are zero. If any of p. Use the Specify ARIMA Model dialog for the following three orders that can be specified for an ARIMA model: 1 2 3 4 The Autoregressive Order is the order (p) of the polynomial ϕ ( B ) operator. The Moving Average Order is the order (q) of the differencing operator θ ( B ) . The Differencing Order is the order (d) of the differencing operator. the corresponding letters are often dropped. The Intercept check box determines whether the intercept term µ will be part of the model.

Unconstrained allows the parameters to range freely. …. Stable Invertible constrains the parameters such that the equivalent ARIMA model is stable and invertible. The Confidence Intervals box allows you to set the confidence level for the forecast confidence bands. the level smoothing weight γ. They are α. You may not be able to specify the equivalent ARIMA model using the ARIMA command because some smoothing models intrinsically constrain the ARIMA model parameters in ways the ARIMA command will not allow. The estimators for these time-varying terms are L t smoothed level that estimates µ t T t is a smoothed trend that estimates β t S t – j for j = 0. The constraints are: Zero To One keeps the values of the smoothing weights in the range zero to one. While these parameters enter each model in a different way (or not at all). they have the common property that larger weights give more influence to recent data while smaller weights give less influence to recent data. the trend smoothing weight ϕ. Smoothing Model Dialog The Smoothing Model dialog appears in the report window when you select one of the smoothing model commands. Custom expands the dialog to allow you to set constraints on individual smoothing weights. . Each smoothing model defines a set of recursive smoothing equations that describes the evolution of these estimators. The dialog also lets you to specify what type of constraint you want to enforce on the smoothing weights during the fit. These ARIMA equivalents are used to estimate the smoothing weights and provide forecasts. Each smoothing model has an ARIMA model equivalent. The dialogs for seasonal smoothing models include a Periods Per Season box for setting the number of periods in a season. the trend damping weight δ. The smoothing equations are written in terms of model parameters called smoothing weights. s – 1 are the estimates of the s ( t ) . the seasonal smoothing weight.156 8 Time Series Smoothing Models Models without a trend have βt = 0 and nonseasonal models have s ( t ) = 0 . 1.

the value of the Trend weight is allowed to move within the range 0 to 1 while the Level weight is held at 0. The example shown here has the Level weight (α) fixed at a value of 0.3. In that case. or Unconstrained as determined by the setting of the popup menu next to the weight’s name. The moving average form of the model is ∞ yt = at + ∑ j–1 αa t – j Double (Brown) Exponential Smoothing The model for double exponential smoothing is y t = µ t + β 1 t + a t . 1. Fixed. In this case. none of the weights would be estimated from the data although forecasts and residuals would still be computed. is defined in terms of a single smoothing weight α. 1) model where ( 1 – B )y t = ( 1 – θ B )α t with θ = 1 – α .3 and the Trend weight (γ) bounded by 0 and 1. Note that you can specify all the smoothing weights in advance by using these custom constraints. 2 2 The moving average form of the model is . 1. The smoothing equation. Simple Exponential Smoothing The model for simple exponential smoothing is y t = µ t + α t . 1)(0. When you click Estimate. 1)1 model ( 1 – B ) y t = ( 1 – θ B ) a t where θ 1. The smoothing equations. This model is equivalent to an ARIMA(0. 1. defined in terms of a single smoothing weight α are L t = αy t + ( 1 – α )L t – 1 and T t = α ( L t – L t – 1 ) + ( 1 – α )T t – 1 .8 Time Series Smoothing Models 157 8 Time Series Each smoothing weight can be Bounded. 1 with θ = 1 – α . the results of the fit appear in place of the dialog. 1 = θ 2. This model is equivalent to an ARIMA(0. Lt = αyt + (1 – α)Lt-1.

2 The moving average form of the model is ∞ yt = αt + ∑ j=1 ⎛ α + αγϕ ( ϕ – 1 )⎞ ⎜ ---------------------------------------⎟ α t – j ϕ–1 ⎝ ⎠ j Seasonal Exponential Smoothing The model for seasonal exponential smoothing is y t = µ t + s ( t ) + a t . γ. 1. θ 2 = θ 2. 2) model where ( 1 – ϕ B ) ( 1 – B )y t = ( 1 – θ 1 B – θ 2 B )a t with θ 1 = 1 + ϕ – α – αγϕ and θ 2 = ( α – 1 )ϕ . 1. 1)(0. s . 1. 1 . The smoothing equations in terms of smoothing weights α and δ are L t = α ( y t – S t – s ) + ( 1 – α )L t – 1 and S t = δ ( y t – L t – s ) + ( 1 – δ )ϕS t – s This model is equivalent to a seasonal ARIMA(0. and θ 3 = – θ 1. 1 θ 2. s . The smoothing equations in terms of smoothing weights α.158 8 Time Series Smoothing Models ∞ yt = at + ∑ j=1 ( 2α + ( j – 1 )α )a t – j 2 Linear (Holt) Exponential Smoothing The model for linear exponential smoothing is y t = µ t + β t t + a t . The smoothing equations defined in terms of smoothing weights α and γ are L t = αy t + ( 1 – α ) ( L t – 1 + T t – 1 ) and T t = γ ( L t – L t – 1 ) + ( 1 – γ )T t – 1 This model is equivalent to an ARIMA(0. and ϕ are L t = αy t + ( 1 – α ) ( L t – 1 + ϕT t – 1 ) and T t = γ ( L t – L t – 1 ) + ( 1 – γ )ϕT t – 1 This model is equivalent to an ARIMA(1. 0)S model where we define θ 1 = θ 1. 2 2 The moving average form of the model is ∞ yt = at + ∑ j=1 ( α + jαγ )a t – j Damped-Trend Linear Exponential Smoothing The model for damped-trend linear exponential smoothing is y t = µ t + β t t + a t . s = θ 2. 2) model where ( 1 – B ) y t = ( 1 – θ B – θ 2 B )a t with θ = 2 – α – αγ and θ 2 = α – 1 . 2.

1. s+1)(0.8 Time Series Smoothing Models 159 8 Time Series so ( 1 – B ) ( 1 – B )y t = ( 1 – θ 1 B – θ 2 B – θ 3 B s 2 s+1 )a t with θ 1 = 1 – α . s+1 ⎛ ⎞ i ⎜1 – ( 1 – B ) ( 1 – B )y t = ⎜ ∑ θi B ⎟ at ⎟ ⎝ ⎠ i=1 2 . The smoothing equations in terms of weights α. and δ are Lt = α ( yt – St – s ) + ( 1 – α ) ( Lt – 1 + Tt – 1 ) S t = δ ( y t – L t ) + ( 1 – δ )S t – s . θ 2 = δ ( 1 – α ) . 0)s model The moving average form of the model is ∞ yt = at + ∑ j=1 Ψj at – j where ⎧ α + jαγ for jmods ≠ 0 ψ = ⎨ ⎩ α + jαγ + δ ( 1 – α ) forjmods = 0 . T t = γ ( L t – L t – 1 ) + ( 1 – γ )T t – 1 . and θ 3 = ( 1 – α ) ( δ – 1 ) . 1. and This model is equivalent to a seasonal ARIMA(0. γ. The moving average form of the model is ∞ yt = at + ∑ j=1 ψj at – j ⎧ α for jmods ≠ 0 ⎪ where ψ = ⎨ ⎪ ⎩ α + δ ( 1 – α ) forjmods = 0 Winters Method (Additive) The model for the additive version of Winter’s method is y t = µ t + β t t + s ( t ) + a t .

.

You can highlight points on any scatterplot in the scatterplot matrix. and pairwise correlations • a covariance matrix • nonparametric measures of association • simple statistics (such as mean and standard deviation) All plots and the current data table are linked. The platform begins by showing a standard correlation matrix.9 Correlations and Multivariate Techniques The Multivariate platform specializes in exploring how many variables relate to each other. When the report appears. partial. you see correlations and a scatterplot matrix. The points are highlighted on all other plots and are selected in the data table. The Multivariate platform popup menu gives the additional correlations options and other techniques for looking at multiple variables such as • a scatterplot matrix with normal density ellipses • inverse. Select Analyze > Multivariate to bring up the launch dialog. Open Solubility. or the outlier distance plot. Introduction For a short tour of the Multivariate platform. .jmp from the sample data folder.

the last four variables are similarly correlated. Select Principal Components > on Correlations from the platform’s drop-down list. shown here. In addition. indicates that there are two strong directions of variation. you can calculate several different kinds of correlations.785 and 0. This suggests that the variability in these six variables could be explained in fewer dimensions. The principal components report.162 9 Correlations and Multivariate Techniques Introduction From here. We can express 95% of the variation of these six dimensions in only two dimensions. corresponding to the eigenvalues of 4. including nonparametric correlations (se below for instances of each kind.945. . Principal Components would confirm this. Note that the first two variables (1-Octonol and Ether) are correlated with each other.

jmp sample data table. which is a matrix of correlation coefficients that summarizes the strength of the linear relationships between each pair of response (Y) variables. the diagonal inverse . If the multiple correlation is zero. the diagonal is 1/(1 – R2) for the fit of that variable by all the other variables. a standard correlation matrix and scatterplot matrix appears first. and Simple Statistics > Univariate. The platform popup menu shown here lists additional correlation options and other techniques for looking at multiple variables. shown at the top in the next figure. provides useful multivariate information. In the inverse correlation. The exceptions are in Pairwise Correlations. Many of the following examples use the Solubility. Inverse Correlations and Partial Correlations The inverse correlation matrix (Inverse Corr table). which exclude rows that are missing on either of the variables under consideration. The following sections describe the tables and plots offered by the Multivariate platform. In most of the following analysis options. which calculates its statistics column-by-column. a missing value in an observation causes the entire observation to be deleted. The diagonal elements of the matrix are a function of how closely the variable is a linear function of the other variables. Correlations Multivariate The Correlations Multivariate option gives the Correlations table. This correlation matrix only uses the observations that have nonmissing values for all variables in the analysis. without regard to missing values in other columns.9 Correlations and Multivariate Techniques Launch the Platform and Select Options 163 9 Correlations and Multivariate Statistics Launch the Platform and Select Options When you choose Analyze > Multivariate.

164 9 Correlations and Multivariate Techniques Launch the Platform and Select Options element is 1. a 95% bivariate normal density ellipse is imposed on each scatterplot. . By default. the variables are uncorrelated. If the variables are bivariate normally distributed. If the scatterplots are not showing. The cells of the scatterplot matrix are size-linked so that stretching a plot from any cell resizes all the scatterplot cells. a scatterplot for each pair of response variables displays in a matrix arrangement. as shown in Figure 9. this ellipse encloses approximately 95% of the points. The scatterplot matrix is shown by default.1. select Scatterplot Matrix from the platform popup menu. If the multiple correlation is 1. The correlation of the variables is seen by the collapsing of the ellipse along the diagonal axis. If the ellipse is fairly round and is not diagonally oriented. Scatterplot Matrix To help you visualize the correlations. then the inverse element becomes infinite and is reported missing. This is the negative of the inverse correlation matrix scaled to unit diagonal. The partial correlation table (Partial Corr table) shows the partial correlations of each pair of variables after adjusting for all the other variables.

1 Example of a Scatterplot Matrix Two clusters of correlations: The first two variables and the next four. These histograms can be specified as Horizontal or Vertical.9 Correlations and Multivariate Techniques Launch the Platform and Select Options 165 9 Correlations and Multivariate Statistics Figure 9. Show Correlations shows the correlation of each histogram in the upper left corner of each scatShow Histogram draws histograms in the diagonal of the scatterplot matrix. In addition. Density Ellipses toggles the display of the density ellipses on the scatterplots constructed by the α level that you choose. terplot. By default they are 95% ellipses. . The popup menu next on the Scatterplot Matrix title bar button lets you tailor the matrix with color and density ellipses and by setting the α-level. you can toggle the counts that label each bar with the Show Counts command.

For example. When you look for patterns in the whole scatterplot matrix with reordered columns. You can reorder the scatterplot matrix columns by dragging a diagonal (label) cell to another position on the diagonal. if you drag the cell of the column labeled 1-octanol diagonally down one cell.166 9 Correlations and Multivariate Techniques Launch the Platform and Select Options Show Correlations Show Counts Show Histogram (Horizontal in this case) Ellipse α lets you select from a submenu of standard α-levels or select the Other command and specifically set the α level for the density ellipses. . as illustrated previously by the two groups showing in Figure 9.2. you clearly see the variables cluster into groups based on their correlations.1. Ellipse Color lets you select from a palette of colors to change the color of the ellipses. Figure 9.2 Reorder Scatterplot Matrix Drag cell diagonally Covariance Matrix The Covariance Matrix command displays the covariance matrix for the analysis. the columns reorder as shown in Figure 9.

These statistics can be calculated in two ways that differ when there are missing values in the data table.3 Pairwise Correlations Report Simple Statistics The Simple Statistics submenu allows you to display simple statistics (mean. The count values differ if any pair has a missing value for either variable. standard deviation. These values match the ones that would be produced using the Distribution platform. as shown in Figure 9. .9 Correlations and Multivariate Techniques Launch the Platform and Select Options 167 9 Correlations and Multivariate Statistics Pairwise Correlations The Pairwise Correlations table lists the Pearson product-moment correlations for each pair of Y variables. Univariate Simple Statistics are calculated on each column. These are values produced by the Density Ellipse option on the Fit Y by X platform. regardless of values in other col- umns. and so on) for each column. using all available values. Figure 9. The Pairwise Correlations report also shows significance probabilities and compares the correlations with a bar chart.3.

The Nonparametric submenu offers these three nonparametric measures: Spearman’s Rho is a correlation coefficient computed on the ranks of the data values instead of on the values themselves. There is a correction for tied pairs (pairs of observations that have equal values of X or equal values of Y). A pair is concordant if the observation with the larger value of X also has the larger value of Y. for computational information. . These are the statistics that are used by the Multivariate platform to calculate correlations. A pair is discordant if the observation with the larger value of X has the smaller value of Y. Nonparametric Correlations When you select Nonparametric Correlations from the platform popup menu. Hoeffding’s D is a statistical scale that ranges from –0. with large positive values indicating dependence. the Nonparametric Measures of Association table is shown. See “Computations and Statistical Details. The Nonparametric Measures of Association report also shows significance probabilities for all measures and compares them with a bar chart similar to the one in Figure 9. 169. and detects more general departures from independence.The statistic approximates a weighted sum over observations of chi-square statistics for two-by-two classification tables.168 9 Correlations and Multivariate Techniques Launch the Platform and Select Options Multivariate Simple Statistics are calculated by dropping any row that has a missing value for any column in the analysis.” p.3.5 to 1. Kendall’s Tau is based on the number of concordant and discordant pairs of observations.

Average ranks are used in case of ties. n is the number of observations. If there is no linear relationship. Nonparametric Measures of Association For the Spearman. ∑ (x – x) ∑ (y – y) 2 2 If there is an exact linear relationship between two variables. it is denoted as r and computed as ∑ (x – x)(y – y) r = --------------------------------------------------------. Otherwise they are discordant. the correlation is 1 or –1. depending on whether the variables are positively or negatively related. Kendall’s τb Coefficients Kendall’s τb coefficients are based on the number of concordant and discordant pairs. If a weight variable is specified. A pair of rows for two variables is concordant if they agree in which variable is greater. the correlation tends toward zero. Kendall. . or Hoeffding correlations. and Kendall’s τb ranges from –1 to 1. The formula ∑ sgn ( x i – xj ) sgn ( y i – y j ) i<j τ b = --------------------------------------------------------------( T0 – T1 ) ( T0 – T2 ) computes Kendall’s τb where T0 = ( n ( n – 1 ) ) ⁄ 2 . For response variables X and Y..9 Correlations and Multivariate Techniques Computations and Statistical Details 169 9 Correlations and Multivariate Statistics Computations and Statistical Details Pearson Product-Moment Correlation The Pearson product-moment correlation coefficient measures the strength of the linear relationship between two variables. it is ignored. Computations are then performed on the ranks of the data values. or tied. and ∑ ( ( ui ) ( ui – 1 ) ) ⁄ 2 . T1 = T2 = ∑ ( ( ti ) ( ti – 1 ) ) ⁄ 2 . 0 if z = 0 . and –1 if z < 0 . Note that sgn ( z ) is equal to 1 if z > 0 . the data are first ranked. Spearman’s ρ (rho) Coefficients Spearman’s ρ correlation coefficient is computed on the ranks of the data using the formula for the Pearson’s correlation previously described. The ti (the ui) are the number of tied x (respectively y) values in the ith group of tied x (respectively y) values.

• The observations are then re-ranked according to the values of the second variable.5 and 1. . If a weight variable is specified. The diagonal elements of the inverse correlation matrix. the diagonal element is denoted r and is computed as r ii 1 = VIF i = --------------. a large rii indicates that the ith variable is highly correlated with any number of the other variables. 2 1 – Ri where Ri2 is the coefficient of variation from the model regressing the ith explanatory variable on the other explanatory variables. A point tied on both x and y contributes 1/4 to Qi.170 9 Correlations and Multivariate Techniques Computations and Statistical Details Computations proceed in the following way: • Observations are ranked in order according to the value of the first variable. D2 = Si ( Ri – 1 ) ( Si – 1 ) ( Si – 2 ) D3 = ( Ri – 1 ) ( Si – 2 ) ( Qi – 1 ) The Ri and Si are ranks of the x and y values. Inverse Correlation Matrix The inverse correlation matrix provides useful multivariate information. Hoeffding’s D Statistic The formula for Hoeffding’s D (1948) is ( n – 2 ) ( n – 3 )D 1 + D 2 – 2 ( n – 2 )D 3 D = 30 ⎛ ----------------------------------------------------------------------------------------⎞ where ⎝ n( n – 1)( n – 2 )(n – 3)( n – 4) ⎠ D1 = Si ( Qi – 1 ) ( Qi – 2 ) . contributes 1/2 to Qi if the other value is less than the corresponding value for the ith point. When there are no ties among observations. sometimes called the variance inflation factors (VIF). if the correlation matrix is ii denoted R and the inverse correlation matrix is denoted R-1. and the Qi (sometimes called bivariate ranks) are one plus the number of points that have both x and y values less than the ith points. i.. Note that the definition of R2 changes for no-intercept models. JMP uses the R2 from the uncorrected Sum of Squares. from the zero model. • The number of interchanges of the first variable is used to compute Kendall’s τb.e. with 1 indicating complete dependence. are a function of how closely the variable is a linear function of the other variables. but not on both. the D statistic has values between –0. For no-intercept and hidden-intercept models. from the mean model. Specifically. rather than the corrected sum of squares. it is ignored. A point that is tied on its x value or y value. Thus.

Using the Chart Platform To begin with. Open the data file Denim. Exporting. presentation managers. the Chart. Click on the Statistics button and select Mean from the drop-down list. 21 in the “The Distribution Platform” chapter. X Levels button. In addition to techniques for accomplishing these tasks. Select Graph > Chart from the menu bar. or web authoring tools. Click the Categories. Details about this data are found in “About the Data.10 Importing. some of the variables’ statistics are charted. Graphs and charts are usually used to summarize results. so this chapter also describes copying and pasting results from JMP Student Edition into word processors. shown later in this chapter. Select Starch Content (%) from the list of columns. Click on the Statistics button and select Max. This brings up a dialog like the one in Figure 10. produce a bar chart showing the mean starch content and the maximum thread wear for each of the wash methods. Select Method from the list of variables.” p. Select Thread Wear Measured from the list of columns. In this introduction. and Overlay Plot platforms are documented.7 on page 181.jmp. Therefore. Introduction After starting JMP Student Edition. No statistics package is useful unless its results can be communicated to others. and Charting Data This chapter shows you how to use JMP to interact with the rest of the world. . it illustrates ways of importing data from various formats into JMP for analysis.

A bar chart like the one in Figure 10. . Please note that merely because JMP Student Edition allows for a great many chart options to be changed. do not feel that they all must be. and Charting Data Introduction Click OK. Exporting. With Mean(Starch Content (%)) still selected: From the drop-down list in the title bar next to Chart. Right-Click on Mean(Starch Content (%)) in the legend to the right of the plot. and choose a color from the resulting palette. select Overlay Color. Figure 10. The bars for Mean(Starch Content (%)) change to a needle chart.1 appears.1 Bar Chart The Chart platform makes it easy to format this chart. The colors of the bars for starch content change to the selected color. select Y Options > Needle Plot. Simple charts are almost always the most effective.2. or even change to another chart type. as shown in Figure 10.172 10 Importing.

X Level role.10 Importing. Using the Overlay Plot Platform In this example. and Method to the Categories. Exporting. The entire chart changes to a line plot. charting options can be applied to individual levels as well as entire charts. . • A plot of starch content versus thread wear for each level of wash method. Therefore. From the drop-down list in the title bar next to Chart. Assign Thread Wear Measured and Starch Content (%) to the Y role. two charts are produced: • A plot of starch content and thread wear against wash method. select Y Options > Line Chart. Select Graph > Overlay Plot from the menu bar. and Charting Data Introduction 173 10 Graphs Figure 10.2 Half Needle Chart Click somewhere in the blank area above the legend to deselect all columns.

Exporting. select Y Options > Connect Points. From the drop-down list in the title bar next to Overlay Plot.3.3 Final Overlay Plot . To connect the points and produce the plot shown in Figure 10. and Charting Data Introduction Click OK. Figure 10. This produces a plot with both variables plotted on the y-axis.174 10 Importing.

. JMP looks at the type of file given by the 3-character extension appended to its file name and opens it accordingly. text files with any column delimiter. Select Method from the list of columns and click the By button. If *. The graphs shown here have been reduced in size by holding down the control or key and dragging on the corner of one graph. SAS transport files. and Excel files.* is chosen from the Files of Type menu. and flat-file database files. The dialog shows the table notes. JMP Student Edition directly reads JMP data tables. one for each level of the Method variable. The example in Figure 10. Windows The Files of Type selection filters the list of files displayed in the dialog.4 shows an Open Data File dialog when the Files of Type drop-down list is changed from the default to JMP data table. Again select Graph > Overlay Plot from the menu bar. Select Size of Load from the list of columns and click the X button. They all resize. This produces three separate plots. JMP Student Edition gives additional information about the file in the File Open dialog. JMP Script files. Exporting.10 Importing. and Charting Data Importing Data 175 10 Graphs To produce the plot of starch content versus load size for each level of wash method. Importing Data The File > Open command displays a specialized open file dialog used to locate a file to open and tell JMP Student Edition the file format of the incoming file. Select Starch Content (%) from the list of columns and click the Y button. if they exist. The Open command then reads the file into a JMP Student Edition data table. This works as long as the file has the structure indicated by its name. JMP journal files. If applicable.

176 10 Importing. . the Open As menu appears in the window (see the next section for details on text importing). If you select a text document to open. Exporting. and Charting Data Importing Data Figure 10.4 The Open Data File Dialog to Read a JMP table Macintosh The Open window allows you to open any type of file into JMP.

• Data (Using Preferences) opens the file and uses default rules (set in the preferences panel) to interpret end of field and end of line delimiters to create a JMP data table All Platforms JMP Student Edition attempts to discern the arrangement of text data. • Data (Using Preview) presents a dialog similar to the one shown in Figure 10.10 Importing. and shows a preview of the resulting data table. Then choose one of the three text commands: • Text opens the file in a simple text editing window • Data (Best Guess) opens the data in a format that JMP thinks is appropriate based on the contents of the file. that lets you designate delimiters and other information. This displays the Open As menu in the lower part of the dialog. This is adequate for a rectangular text file with no missing fields. depending on the operating system you are working with. select a text file to open. Macintosh On the Macintosh. Choose Text from the Files of Type drop-down list to import the data based on your current preference settings. and Charting Data Importing Data 177 10 Graphs Importing Text Files You import text be doing one of three things. You then see the dialog shown in Figure 10. Exporting. a consistent field delimiter.5. Windows Under Microsoft Windows. and an end-of-line delimiter . Select the Data With Preview checkbox to see a preview of an incoming text file.5 for specification of delimiters and other import details.

178 10 Importing. including spaces embedded within the quotes. The initial settings in the delimited import dialog are taken from the current Preferences file. and the first two rows of data. Column 2. are ignored and treated as part of a text string. and so on. data types. and Charting Data Importing Data Note: If double-quotes are encountered when importing text data. and height. Figure 10. Other text delimiters. JMP changes the delimiter rules to look for an end double-quote. the option to Strip enclosing quotes. the Name fields are called Column 1. preferences are set that indicate the incoming table contains column headers to be used as the column names. In Figure 10.5. One or more end-of-field delimiters. the column names are name. The dialog also shows the column names. sex. and the ability to set how many rows and columns will be read are additional options presented in the dialog. Exporting. If no column names are indicated. age. end-of-line delimiters.5 Import Text File .

Excel worksheet and workbooks are imported simply by choosing Excel Files from the Files of Type (Windows) or Enable (Macintosh) lists as shown in Figure 10. JMP Student Edition opens all the worksheets in the workbook. click the Try Fixed Width button to specify the separations of each column to be imported. each variable uses a set number of columns in the text file).6 Excel Open Choice JMP Student Edition can also import Excel workbooks that contain several data tables inside them.10 Importing. Importing Microsoft Excel Files JMP Student Edition has the ability to directly import Microsoft Excel worksheets and workbooks under Macintosh and Microsoft Windows. and Charting Data Importing Data 179 10 Graphs Fixed-Width Text If your data is fixed width (that is.6. .xls) from the Open dialog and double-clicking on the desired workbook. Exporting. After selecting Excel Files(*. Figure 10.

the plots in the Chart platform do not “bristle with interactivity” to the degree that other platforms do. Don’t worry too much about getting the orientation and chart type correct initially — they can be changed after the chart has been generated. In the word processor.7 Chart Launch Dialog Select the data column in the column list. To copy results into another program. pie. through standard cut and paste facilities.8) and select the statistic to be charted. In essence. Select the Selection tool ( ). To plot descriptions of data. complete the following steps after bringing up the Chart launch dialog (Figure 10.180 10 Importing. The Chart Platform The Chart platform computes and plots data and statistics about the data.7) Figure 10. select Edit > Paste. needle. It is used to report the results from other explorations. and Charting Data Results from Platforms Results from Platforms The results from JMP Student Edition’s platforms can be cut and pasted into other programs using the system’s clipboard. From the section labeled Options. For that reason. line. the Chart platform is not intended as an exploratory device. Unlike the statistical platforms in JMP. what is going to be reported should be known before the Chart platform is used. Click the Statistics button (revealing the menu shown in Figure 10. Hold down the Shift key and click on each part of the report that needs to be copied. use the drop-down list to select the orientation of the chart (vertical or horizontal) and the type of chart (bar. Exporting. . Note that axes frequently need to be selected in addition to the graphs they accompany. Select Edit > Copy. or point chart) to be generated.

Exporting. Platform options affect all charts in the report window. . Weight. Click OK. or a Grouping variable to generate separate graphs for each level of the column. Level column to be plotted on the horizontal axis. either in separate windows or overlaid in the same window. and By options work as in other platforms. However.9 Chart Example Many of the options on the launch dialog can be changed using the platform popup menu.8 Plottable Statistics Figure 10. some options can be applied to individual charts. X Levels is shown in Figure 10. include an X. Figure 10. plotting the mean of Starch Content (%) using Method as Categories. Freq.9. An example bar chart. and Charting Data The Chart Platform 181 10 Graphs Optionally.10 Importing.

• Pen Style allows the choice of a line style from the palette shown in Figure 10. the commands apply to all charts. . When this menu is accessed through a chart legend. The default chart is a bar chart. leaving a point chart. Exporting.10 Chart Options • Bar Chart displays a bar for each level of the chart variables.182 10 Importing. without connecting them. • Overlay Pattern assigns bars a fill pattern to identify them in overlaid charts. and Charting Data The Chart Platform Single-Chart Options To apply the following options to individual charts. without any legends highlighted. • Connect Points toggles the line connecting points on and off. • Needle Chart replaces each bar with a line drawn from the axis to the plotted value. • Std Error Bars overlays needles at plus or minus one standard error from the mean. • Overlay Color assigns color to a variable to identify it when overlaid with other charts. • Overlay Marker assigns plot points a marker to identify them in overlaid charts. right-click on a chart legend to see the popup menu shown in Figure 10. Choose the Show Points option to show or hide the points. • Line Chart replaces a bar chart with a line chart and connects each point with a straight line. Figure 10.10. • Show Points toggles the point markers on a line or needle chart off and on. the commands apply only to the individual chart.10. When accessed through the platform popup menu. • Point Chart shows only the plot points.

Exporting. Y Scale shows the standard axis scale dialog. • Marker Drawing Mode lets you adjust the way markers are drawn. • Size/Scale allows changes in axes and plot frames. • Marker Size allows selection of the marker from a palette of six point sizes that range from dot to very large. • Border lets you turn border lines on or off. and Charting Data The Chart Platform 183 10 Graphs Frame Options Frame options allow control of the plot frame’s elements as a whole. Fast mode is useful for graphs with large number of points. • DisplayBox lists the commands to conveniently select and deselect the plot area. Outlined mode is often used in presentations. and redraw the plot. The axis scale dialog allows the specification of the maximum. and draws tailored reference lines or a grid on the plot.11 Frame Options • Background Color colors the background of the plot with the color chosen from the JMP Student Edition color palette. minimum. so that they re-draw quickly after adjustments. and tick marks for the axis.10 Importing. increment. X Scale is only active on platforms where the X axis is numeric (the X axis for the Charts platform is categorical). The marker size applies to the plot point and its associated rows. • Add Graphics Script displays a text entry box to enter JSL commands. . which can also be shown by double-clicking in the Y axis. See the JMP Scripting Language Guide for documentation of JSL commands. usually to tailor the graphics output in ways not provided with commands and options. The Preferred Size is set in the Preferences. The Frame Size command displays a dialog to enter the exact pixel size for the plot frame. Figure 10.

the chart shown in Figure 10. • Overlay displays a single overlaid chart when the chart has more than one Y (statistics) variable. Separate Axes is only enabled on the menu when there are multiple Y variables that are not overlaid. with one as a bar chart and one as a needle chart. and pattern palettes options described in “Level Options. the platform shows duplicate axis notation for each chart. • Horizontal changes a vertical bar chart or a pie chart to a horizontal bar chart • Pie changes a horizontal or vertical chart into a pie chart. Exporting. Right-clicking on a highlighted legend shows the commands for the Colors and Markers palettes. (See “Script Submenu. which can be overlaid. • Script has a submenu of commands available to all platforms that redo the analysis. 66. Figure 10. the axis notation only shows for the last chart displayed if the charts are not overlaid.12 Different Chart Types • Vertical changes a horizontal bar chart or a pie chart to a vertical bar chart.” p. 182. and Charting Data The Chart Platform Level Options Click on a value in the legend to highlight it. These options affect highlighted bars. These options affect the variable whose legend is highlighted. marker.10. 184. The commands affect only the highlighted level and its associated rows in the data table.12 has two overlaid variables.184 10 Importing. When Overlay is not checked. Each chart can have its own chart type. or save the JSL commands for the analysis to a window or a file. For example. • Y Options accesses the options described in “Single-Chart Options. By default. their drop-down menu has the following options. • Level Options accesses the color. Platform Options After the charts appear. • Separate Axes duplicate the axis notation for each chart when there are multiple charts.” p. which are identical to their Overlay equivalents in Figure 10.) .” p.

the plots show separately. • Connect Through Missing connects adjacent points in the plot. When Overlay option is turned off. and Charting Data The Overlay Plot Platform 185 10 Graphs The Overlay Plot Platform The Overlay Plot platform overlays numeric Y variables with a single numeric or character X variable. regardless if there are missing values between them. If one or more plot legends are highlighted. Each Y variable’s plot can be modified by right-clicking on the variable name to bring up a menu. The section “Single-Plot Options. • Ungroup Plots creates a separate chart for each level of a grouping variable • Arrange Plots allows you to specify the number of plots in each row • Script has a submenu of commands available to all platforms that redo the analysis or save the JSL commands for the analysis to a window or a file. When Separate Axes is selected. • Uniform Y Scale makes the Y scales the same on grouped plots. There is also a single-plot options menu for each Y variable. . on the last plot in the window. • Y Options has a submenu of options that apply to all variables and plots in the report window when selected from the main platform window. describes Y options for each individual variable. When one of these options is selected at the platform level. • Separate Axes lets the X axis scale values be printed only once. (See “Script Submenu. Exporting. Connect Points can be activated when Show Points is not.” p. described below. it affects all plots in the report if no legends are highlighted. 186. the values of the X variable appear in ascending order.” p. the X axes for other Y variables show tick marks but show no scale values. Optionally. • Connect Points is a toggle that alternately connects the points with lines. showing its name and symbol.10 Importing.) Single-Plot Options Each Y variable is labeled beneath the plot. The individual plot options are the same as those in the Y Options submenu at the platform level. and Range option cannot be selected at the same time. allowing for greater flexibility in plot displays. Plots initially appear overlaid with the Connect Points option in effect. • Overlay overlays plots for all columns assigned the Y role. Platform Options Platform options affect every plot in the report window. • Needle draws a vertical line from each point to the X axis. which show when the Y variable legend beneath the plot is right-clicked. the options affects all those plots. • Show Points alternately shows or hides points. 66. Note: The Needle option. The Overlay Plot platform has platform plotting options accessed by the popup menu icon on the Overlay Plot title bar. • Range Plot connects the lowest and highest points at each X value with a line with bars at each end. with points plotted and connected in that order.

186

10 Importing, Exporting, and Charting Data The Overlay Plot Platform

• Step joins the position of the points with a discrete step by drawing a straight horizontal line from each point to the X value of the following point, and then a straight vertical line to that point. Step, without showing points, is illustrated in Figure 10.13. Note: Only one of Connect Points, Needle, and Step can be chosen at a time. Figure 10.13 Overlay Step Plot

• Function Plot plots a formula (stored in the Y column) as a smooth curve. To use this function, store a formula in the Y column that is a function of a single X column. For example, the following column contains a formula involving the sine function. When used in an overlay plot, the function is plotted as a curve rather than individual points.

Note: Overlay Plot normally assumes you want a function plot when the Y column contains a formula. However, formulas that contain random number functions are more frequently used with simulations, where function plotting is not often wanted. Therefore, the Function Plot option is off (by default) when a random number function is present, but on for all other functions. • Connect Color displays the standard JMP color palette (see Figure 10.10) for assigning colors to lines that connect points. • Overlay Marker assigns markers to plotted points using the standard JMP marker palette (see Figure 10.10). • Overlay Marker Color lets you select the color of the overlay marker. • Line Style and Line Width let you adjust the appearance of the lines drawn on the plot.

10 Importing, Exporting, and Charting Data The Overlay Plot Platform

187 10 Graphs

11
Full Factorial Designs
Designing Experiments
A full factorial design contains all possible combinations of a set of factors. This is the most conservative design approach, but it is also the most costly in experimental resources. The full factorial designer supports both continuous factors and categorical factors with up to nine levels. In full factorial designs, you perform an experimental run at every combination of the factor levels. The sample size is the product of the numbers of levels of the factors. For example, a factorial experiment with a two-level factor, a three-level factor, and a four-level factor has 2 x 3 x 4 = 24 runs. Factorial designs with only two-level factors have a sample size that is a power of two (specifically 2f where f is the number of factors). When there are three factors, the factorial design points are at the vertices of a cube as shown in the diagram below. For more factors, the design points are the vertices of a hypercube. Full factorial designs are the most conservative of all design types. There is little scope for ambiguity when you are willing to try all combinations of the factor settings. Unfortunately, the sample size grows exponentially in the number of factors, so full factorial designs are too expensive to run for most practical purposes.

Introduction
The following example, adapted from Meyer et al. (1996) and Box, Hunter, and Hunter (1978), shows a five-factor reactor example. To follow along with this example, open the folder Sample Data that was installed when you installed JMP. Within this folder, open Design Experiment > Reactor 32 Runs.jmp. Suppose you have used the screening designer to investigate the effects of five factors on the percent reaction of a chemical process. The factors (Feed Rate, Catalyst, Stir Rate, Temperature, and Concentration) are all two-level continuous factors. 1 Select DOE > Full Factorial Design. 2 Click the red triangle icon on the Full Factorial Design title bar and select Load Responses. 3 Open Reactor Response.jmp to load the responses by opening the Sample Data folder that was installed with JMP. In the Sample Data folder, open Design Experiment > Reactor
Response.jmp.

4 Click the red triangle icon on the Full Factorial Design title bar and select Load Factors. 5 Open Reactor Factors.jmp to load the responses by opening the Sample Data folder that was installed with JMP. In the Sample Data folder, open Design Experiment > Reactor Factors.jmp.

190

11 Full Factorial Designs Introduction

The completed dialog is shown in Figure 11.1. Figure 11.1 Full-Factorial Example Response and Factors Panels

6 Click Continue to see Output Options panel. A full factorial design includes runs for all combinations of high and low factors for the five variables, giving 32 runs. 7 Click Make Table. The design data table (Figure 11.3) contains a run for every combination of high and low values for the five variables. Since there are five variables, there are 25=32 runs. This covers all combinations of a five factors with two levels each. Initially, the table has an empty Y column named Percent Reacted for entering response values when the experiment is complete. The values in your table may be different from those shown below. Figure 11.2 25 Factorial Reactor Data

To see the completed experiment and continue following this example, open the folder Sample Data that was installed when you installed JMP. Within this folder, open Design Experiment > Reactor 32
Runs.jmp.

2 Highlight Percent Reacted and click Y. Columns. Then click OK.11 Full Factorial Designs Introduction 191 11 DOE: Full Factorial Designs Figure 11. Figure 11.1. and brings up the Stepwise control panel.4. 1 Click the red triangle icon next to the Fit Model script and select Run Script. The results are shown in Figure 11.05. The data table has a script stored with it that automatically defines an analysis of the model that includes main effects and all two-factor interactions. 3 The probability to remove a factor (Prob to Leave) should be 0.4 Distribution of Response Variable for Reactor Data Start the formal analysis with a stepwise regression. .3 Reactor 32 Runs. 3 Click the red triangle icon on the Percent Reacted title bar and select Normal Quantile Plot. 2 The probability to enter a factor (Prob to Enter) in the model should be 0.jmp Begin the analysis with a quick look at the data before fitting the factorial model. 1 Select Analyze > Distribution.

6.7 Model After Mixed Stepwise Regression 7 Click the Make Model button.5 Run JSL Script for Stepwise Regression 4 A useful way to use the Stepwise platform is to check all the main effects in the Current Estimates table.6 Starting Model For Stepwise Process The mixed stepwise procedure removes insignificant main effects and adds important interactions.192 11 Full Factorial Designs Introduction Figure 11. The Model Specification window that appears is automatically set . To do this. 5 Check the boxes for the main effects of the factors as shown in Figure 11.7. The end result is shown in Figure 11. Note that the Feed Rate and Stir Rate factors are no longer in the model. 6 Click Go. Figure 11. make sure the menu beside Direction specifies Mixed. Figure 11.

The size of the random noise as measured by the RMSE is only 3.10. All effects selected by the stepwise process are highly significant. The figure on the right in Figure 11.9 Actual by Predicted Plot and Prediction Model Estimates The factor Prediction Profiler also gives you a way to compare the factors and find optimal settings.8). Figure 11.9 shows a table of model coefficients and their standard errors (labeled Parameter Estimates).8 Fitting a Prediction Model 8 Click Run Model to see the analysis for a candidate prediction model (Figure 11. The predicted model covers a range of predictions from 40% to 95% reacted.3311%. 1 Open the Prediction Profiler by clicking the red triangle on the Response Percent Reacted title bar and selecting Factor Profiling > Profiler. which is more than an order of magnitude smaller than the range of predictions. This is strong evidence that the model has good predictive capability. The figure on the left in Figure 11.9). Figure 11. as shown in Figure 11. .11 Full Factorial Designs Introduction 193 11 DOE: Full Factorial Designs up with the appropriate effects (Figure 11.9 shows the actual by predicted plot for the model.

Figure 11.11 shows the profiler’s initial display.10 Selecting the Profiler Figure 11.12 Viewing the Maximum Desirability The plot of Desirability versus Percent Reacted shows that the goal is to maximize Percent Reacted. The reaction is unfeasible economically unless the Percent Reacted is above 90%. Figure 11.11 Viewing the Profiler 2 Click the red triangle on the Prediction Profiler title bar and select Maximize Desirability to see the profiler in Figure 11.194 11 Full Factorial Designs Introduction Figure 11. . therefore the Desirability for values less than 90% decreases and finally becomes zero.12. Desirability increases linearly as the Percent Reacted increases.

11 Full Factorial Designs Creating a Factorial Design

195 11 DOE: Full Factorial Designs

The maximum Desirability is 0.945 when Catalyst and Temperature are at their highest settings and Concentration is at its lowest setting. Percent Reacted increases from 65.5 at the center of the factor ranges to 95.875 at the most desirable setting.

Creating a Factorial Design
To start a full factorial design, select DOE > Full Factorial Design, or click the Full Factorial Design button on the JMP Starter DOE page. Then, follow the steps below.

Entering Responses and Factors
To enter responses, follow the steps in Figure 11.13. Then, enter factors as shown in Figure 11.14 Figure 11.13 Entering Responses Click to enter lower and upper limits and importance weights. 4 To enter one 1 response at a time, click then select a goal type: Maximize, Match Target, Minimize, or None.

2 Double-click to edit the response name, if desired.

3 Click to change the response goal, if desired.

Tip: To quickly enter multiple responses, click the N Responses button and enter the

number of responses you want.

196

11 Full Factorial Designs Creating a Factorial Design

Figure 11.14 Entering Factors in a Full Factorial Design To enter factors, click either the Continuous button or the Categorical button and select a factor type, level 2 - 9.

Double-click to edit the factor name.

Click to enter values or change the level names. To remove a level, click it, press the delete key on the keyboard, then press the Return or Enter key on the keyboard.

When you finish adding factors, click Continue.

Selecting Output Options
Use the Output Options panel to specify how you want the output data table to appear:

• Run Order—Lets you designate the order you want the runs to appear in the data table when it is created. Choices are:
Keep the Same—the rows (runs) in the output table appear as they do in the Design panel. Sort Left to Right—the rows (runs) in the output table appear sorted from left to right. Randomize—the rows (runs) in the output table appear in a random order. Sort Right to Left—the rows (runs) in the output table appear sorted from right to left.

• Number of Center Points—Specifies additional runs placed at the center of each continuous factor’s range. • Number of Replicates—Specify the number of times to replicate the entire design, including centerpoints. Type the number of times you want to replicate the design in the associated text box. One replicate doubles the number of runs.

11 Full Factorial Designs Creating a Factorial Design

197 11 DOE: Full Factorial Designs

Making the Table
When you click Make Table, the table shown in Figure 11.15 appears. Figure 11.15 Factorial Design Table The name of the table is the design type that generated it. values in the Pattern column describe the run each row represents for continuous factors, a plus sign represents high levels for continuous factors, a minus sign represents low levels level numbers represent values of categorical factors

This script allows you to easily fit a model using the values in the design table.

12
Screening Designs
Designing Experiments
Screening designs are arguably the most popular designs for industrial experimentation. They examine many factors to see which have the greatest effect on the results of a process. Compared to other design methods, screening designs require fewer experimental runs, which is why they are cheap. Thus, they are attractive because they are a cheap and efficient way to begin improving a process. Often screening designs are a prelude to further experiments. It is wise to spend only about a quarter of your resource budget on an initial screening experiment. You can then use the results to guide further study. The efficiency of screening designs depends on the critical assumption of effect sparsity. Effect sparsity results because real-world processes usually have only a few driving factors; other factors are relatively unimportant. To understand the importance of effect sparsity, you can contrast screening designs to full factorial designs: • Full factorial designs consist of all combinations of the levels of the factors. The number of runs is the product of the factor levels. For example, a factorial experiment with a two-level factor, a three-level factor, and a four-level factor has 2 x 3 x 4 = 24 runs. • By contrast, screening designs reduce the number of runs by restricting the factors to two (or three) levels and by performing only a fraction of the full factorial design. Each factor in a screening design is usually set at two levels to economize on the number of runs needed, and response measurements are taken for only a fraction of the possible combinations of levels. In the case described above, you can restrict the factors to two levels, which yield 2 x 2 x 2 = 8 runs. Further, by doing half of these eight combinations you can still assess the separate effects of the three factors. So the screening approach reduces the 24-run experiment to four runs. Of course, there is a price for this reduction. This chapter discusses the screening approach in detail, showing both pros and cons. It also describes how to use JMP’s screening designer, which supplies a list of popular screening designs for two or more factors. These factors can be continuous or categorical, with two or three levels. The list of screening designs you can use includes designs that group the experimental runs into blocks of equal sizes where the size is a power of two.

Introduction
Suppose an engineer wants to investigate a process that uses an electron beam welding machine to join two parts. The engineer fits the two parts into a welding fixture that holds them snugly together. A voltage applied to a beam generator creates a stream of electrons that heats the two parts, causing them

Figure 12. . who is the technician operating the welding machine • Rotation Speed.22 as the upper limit (maximum acceptable value). the engineer cuts the part in half. and Current.17 inches To begin this example. 2 The default goal for the single default response is Maximize.17 inches with a lower bound of 0. Speed. select DOE > Screening Design from the main menu. but the goal of this process is to get a target value of 0.1. as shown in Figure 12. Click the Goal text edit area and choose Match Target. The Length of this fused area is the depth of penetration of the weld. 6 Double-click the factor names and rename them Operator. as shown in Figure 12.).17 inches.12 and an upper bound of 0. For this study. The ideal depth of the fused region is 0. Then click the Upper Limit text edit area and enter 0. This depth of penetration is the response for the study. which is the speed at which the part rotates under the beam • Beam Current.1 Screening Design Response With Match Target Goal 3 Click the Lower Limit text edit area and enter 0. the engineer wants to explore the following three inputs. The goals of the study are to: • find which factors affect the depth of the weld • quantify those effects • find specific factor settings that predict a weld depth of 0. 7 Set high and low values for Speed to 3 and 5 rpm. This example has one categorical factor (Operator) and two continuous factors (Speed and Current). Set high and low values for Current to 150 and 165 amps. The engineer wants to study the welding process to determine the best settings for the beam generator to produce the desired depth in the fused region. which is a current that affects the intensity of the beam After each processing run. and assign Mary and John as values for the categorical factor called Operator. Change the default response as follows: 1 Double-click the response name and change it to Depth (In.2. Note that in the Responses panel.22. This reveals an area where the two parts have fused. 4 Add the categorical factor by clicking the Add button beside 2-Level Categorical.12 as the lower limit (minimum acceptable value). which are the factors for the study: • Operator. 5 Add two continuous factors by typing 2 in the Continuous box and clicking the associated Add button.200 12 Screening Designs Introduction to fuse. there is a single default response called Y.

Then. The table uses the names for responses. and levels you specified. .4 The Design Data Table Creating a Screening Design To start a screening design.3 List of Screening Designs for Two Continuous and One Categorical Factors When the design details are complete.3.4 appears. The Pattern variable shows the coded design runs. factors. follow the steps below. 10 View the table produced in this example by selecting Help (View on the Macintosh) > Sample Data Directory > Design Experiment > DOE Example 1.jmp. Figure 12. or click the Screening Design button on the JMP Starter DOE page. Figure 12.12 Screening Designs Creating a Screening Design 201 12 DOE: Screening Designs Figure 12. and then click Continue.2 Screening Design with Two Continuous and One Categorical Factor 8 Click Continue. 9 Select Full Factorial in the list of designs. click Make Table to create a JMP table that contains the specified design. as shown in Figure 12. The table in Figure 12. select DOE > Screening Design.

The default target value is assumed to be midway between the lower and upper limits. click the N Responses button and enter the number of responses you want. follow the steps in Figure 12. or that there is no goal. Match Target. 4 To enter one 1 response at a time. or None. click then select a goal type: Maximize. to match a specific value. Note: If your target range is not symmetric around the target value.5) and the lower and upper limits (step 4 in Figure 12. • The Match Target goal supports the objective when the best value for a responses is a specific target value. A goal of Maximize supports this objective. such as with part dimensions. 2 Double-click to edit the response name. Tip: To quickly enter multiple responses. In the data table. open the response’s Column Info dialog by double-clicking the column name.5 Entering Responses Click to enter lower and upper limits and importance weights. if desired. Figure 12. if desired.202 12 Screening Designs Creating a Screening Design Entering Responses To enter responses. and enter an asymmetric target value. such as when the response is impurity or defects. you can alter the default target after you make a table from the design. 3 Click to change the response goal. the best value is usually the largest possible. you can tell JMP that your goal is to obtain the maximum or minimum value possible. .5): • For responses such as strength or yield. Minimize. Specifying Goal Types and Lower and Upper Limits When entering responses. The following description explains the relationship between the goal type (step 3 in Figure 12.5. • The Minimize goal supports an objective of having the best objective be the smallest value.

With two responses you can give greater weight to one response by assigning it a higher importance value. then press the Return or Enter key on your keyboard.12 Screening Designs Creating a Screening Design 203 12 DOE: Screening Designs Understanding Importance Weights When computing overall desirability. Click to enter factor values. If there is only one response. After your responses and factors are entered. Double-click to edit the factor name. click Continue. click it. JMP uses the value you enter as the importance weight (step 4 in Figure 12. This process is described below. Highlight the type of screening design you would like to use and click Continue. the order is the sorting sequence. Entering Factors After entering responses. you can set a continuous factor to any value between the lower and upper limits you supply. If the values are character. press the delete key on your keyboard. the order is the numeric magnitude. you can enter different types of factors in the model. and Response Surface Design. operator. Mixture Design. . To remove a level. type the number of factors and click Add. Below is a description of factor types from which you can choose when creating screening designs: • Continuous Continuous factors have numeric data types only. when designing experiments. In theory. and gender. Highlight the factor and click the Remove Selected button to remove a factor in the list. Entering factors is the same in Screening Design. in Figure 12. • Categorical Categorical factors (either numerical or categorical data types) have no implied order. Examples of categorical factors are machine. The Factors panel’s appearance depends on the design you select.6. enter factors. Figure 12.6 Entering Factors To enter factors. The settings of a categorical factor are discrete and have no intrinsic order. If the values are numbers. Space Filling Design. Types of Factors In general. then specifying importance is unnecessary.5) as the weight of each response. Choosing a Design The list of screening designs you can use includes designs that group the experimental runs into blocks of equal sizes where the size is a power of two.

The trade-off in screening designs is between the number of runs and the resolution of the design. If you remove an effect in the analysis.204 12 Screening Designs Creating a Screening Design Figure 12. If price is no object. few experimenters worry about interactions higher than two-way interactions. This can be expensive if the number of factors is greater than 3 or 4. Most empirical modeling involves first. Their p-values change slightly. you typically only run a fraction of all possible levels. Experiments can therefore be classified by resolution number into three groups: • Resolution = 3 means that main effects are confounded with one or more two-way interactions. An effect is nonestimable when it is confounded with another effect. . The fraction of the full factorial is 2-p. This causes some of the higher-order effects in a model to become nonestimable. These higher-order interactions are assumed to be zero. including interaction effects to the mth degree. because the estimate of the error variance and the degrees of freedom are different. If k is the number of factors. However. Two-Level Fractional Factorial A fractional factorial design also has a sample size that is a power of two. fractional factorial designs are orthogonal. two-factor interactions are confounded with other two-factor interactions. These designs are orthogonal. In fact. • Resolution = 4 means that main effects are not confounded with other main effects or two-factor interactions. The samples size is the product of the levels of the factors. This means that the estimates of the effects are uncorrelated. Resolution Number: The Degree of Confounding In practice. you can run several replicates of all possible combinations of m factor levels. this is 2k where k is the number of factors. fractional factorials are designed by deciding in advance which interaction effects are confounded with the other interaction effects. For two-level designs. But because running experiments costs time and money. This provides a good estimate of everything. Full factorial designs allow the estimation of interactions of all orders up to the number of factors. which must be assumed to be zero for the main effects to be meaningful. Like the full factorial. The figure to the left in Figure 12.or second-order approximations to the true functional relationship between the factors and the responses.8 is a geometric representation of a two-level factorial. the number of runs is 2k – p where p < k. the values of the other estimates remain the same.7 Choosing a Type of Screening Design The screening designer provides the following types of designs: Two-Level Full Factorial A full factorial design contains all combinations of the levels of the factors.

There are no two-level fractional factorial designs with sample sizes between 16 and 32 runs. The main effects are orthogonal and two-factor interactions are only partially confounded with main effects.12 Screening Designs Creating a Screening Design 205 12 DOE: Screening Designs • Resolution ≥ 5 means there is no confounding between main effects. 1. 1 Plackett-Burman Designs Plackett-Burman designs are an alternative to fractional factorials for screening. Table 12. then none of the classical designs discussed previously are appropriate. -1. -1 1. In cases of effect sparsity. JMP offers fractional factorials. -1. For mixed two-level and three-level designs. –1.8 Representation of Full Factorial (Left) and Two-Level Fractional Factorial (Right) Designs –1.8 is geometric representation of a two-level fractional factorial design. For pure three-level factorials. –1 -1. between two-factor interactions. 1 L18 John L18 Chakravarty L18 Hunter L36 1 3 8 11 7 6 4 12 . One useful characteristic is that the sample size is a multiple of four rather than a power of two. JMP offers complete factorials and specialized orthogonal-array designs. -1 1. The figure on the right in Figure 12. However. Mixed-Level Designs If you have qualitative factors with three values. Figure 12. 1.1 Design Two–Level Factors Three–Level Factors -1. the minimum aberration design of a given resolution minimizes the number of words in the defining relation that are of minimum length. a stepwise regression approach can allow for removing some insignificant main effects while adding highly significant and only somewhat correlated two-factor interactions. All the fractional factorial designs are minimum aberration designs. 24-run. This is different from resolution-three fractional factorial where two-factor interactions are indistinguishable from main effects. and 28-run Plackett-Burman designs. there are 20-run. listed below. or between main effects and two-factor interactions. 1. For DOE experts. -1 1.

However. JMP also includes a set of extra columns to use as regressors. the sum will show an effect if it contains an active effect. How to Run a Cotter Design By default. and the others high. A. if you have three factors. The danger is that several active effects with mixed signs will cancel and still sum to near zero and give a false negative. Click the Platform icon. JMP does not include a Cotter design in the list of available screening designs (Figure 12. To alter the setting for all screening designs: 1 2 3 4 Select File > Preferences. Cotter Designs Cotter designs are used when you have very few resources and many factors. . These are of the form factorOdd and factorEven where factor is a factor name. even though they are all orthogonal. A Cotter design begins with a run having all factors at their high level. When you use JMP to generate a Cotter design. and you believe there may be interactions. Suppose you believe in effect sparsity— that very few effects are truly nonzero. Some of these designs are not balanced. there are 2k + 2 runs. subject to randomizing the runs. They are constructed by adding up all the odd and even interaction terms for each factor. and so forth. B. The next run sets all factors at their low level and sequences through k more runs with one factor high and the rest low. Because Cotter designs have a false-negative risk. Cotter designs are easy to set up. Click DOE to highlight it. Changing the setting via the red triangle menu applies only to the current design. In the example of factors listed above. Then follow k runs each with one factor in turn at its low level.7). AEven estimates the sum of the even terms involving A. You believe in this so strongly that you are willing to bet that if you add up a number of effects. and C: Figure 12.206 12 Screening Designs Creating a Screening Design If you have fewer than or equal to the number of factors for a design listed in the table. many statisticians discourage their use. For k factors.9 AOdd = A + ABC BOdd = B + ABC COdd = C + ABC AEven = AB + AC BEven = AB + BC CEven = BC + AC Because these columns in a Cotter design make an orthogonal transformation. This completes the Cotter design. 2 Select Supress Cotter Designs. click the red triangle icon in the Screening Design title bar. For example. testing the parameters on these combinations is equivalent to testing the combinations on the original effects. you can use that design by selecting an appropriate subset of columns from the original design. Uncheck the box beside Supress Cotter Designs. The design is similar to the “vary one factor at a time” approach many books call inefficient and naive. if you would like to make a Cotter design: 1 Immediately after entering responses and factors (and before clicking Continue). AOdd estimates the sum of odd terms involving A.

Figure 12.10 Display and Modification Options • Change Generating Rules—Controls the choice of different fractional factorial designs for a given number of factors. click the disclosure button ( on Windows and on the Macintosh) to reveal the Aliasing of Effects panel. Confounding is the direct result of the assignment of new factor values to products of the coded design columns. Eight runs can only accommodate a full factorial with three two-level factors.11). Figure 12.11 Generating Rules and Aliasing of Effects Panel For example. • Coded Design—Shows the pattern of high and low values for the factors in each run. . a full factorial with five factors requires 25 = 32 runs. It is necessary to construct the two additional factors in terms of the first three factors.12 Screening Designs Creating a Screening Design 207 12 DOE: Screening Designs Displaying and Modifying the Design After you select a design type. Aliasing of Effects To see which effects are confounded with which other effects. • Aliasing of Effects—Shows the confounding pattern for fractional factorial designs.10). The price of reducing the number of runs from 32 to eight is effect aliasing (confounding). click the disclosure buttons ( on Windows and on the Macintosh) to display the design and show modification options using the Display and Modify Design panel to tailor the design (Figure 12. It shows effects and confounding up to two-factor interactions (Figure 12.

Figure 12. This means that you can’t tell the difference of the effect of Temperature and the synergistic (interactive) effect of Feed Rate and Concentration. This is characteristic of resolution-three designs. with second order aliasing as “1 5” (Feed Rate and Concentration).11.12). 2 Select Show Confounding Pattern (Figure 12. and third order confounding as “1 2 3” (Feed Rate. the values for Temperature are the product of the values for Feed Rate and Concentration. The effect names begin with C (Constant) and are shown by their order number in the design. Thus. In the example shown in Figure 12. . Temperature appears as “4”.14 shows the third level alias for the five-factor reactor example.12 Show Confounding Patterns 3 Enter the order of confounding you want to see (Figure 12.13 Enter Order 4 Click OK. Viewing the Confounding Pattern JMP can create a data table that shows the aliasing pattern for a specified level. and Stir Rate). all the main effects are confounded with two-factor interactions. Figure 12. To create this table: 1 Click the red triangle at the bottom of the Aliasing of Effects area.208 12 Screening Designs Creating a Screening Design For example. Figure 12.13). Catalyst.

the last column (Concentration) is the product of the second and third columns. so the fourth column of the coded design is the element by element product of the first two columns. each row represents a run.14 The Third Level Alias for the Five-Factor Reactor Example Understanding the Coded Design In the coded design panel.16 shows how the last two columns are constructed in terms of the first three columns. The checkmarks for Concentration show it is a function of Catalyst and Stir Rate. it changes the choice of different fractional factorial designs for a given number of factors. changing the checkmarks and clicking Apply changes the coded design. Similarly. The check marks for Temperature show it is a function of Feed Rate. The first three columns of the coded design remain a full factorial for the first three factors (Feed Rate. As shown in Figure 12.12 Screening Designs Creating a Screening Design 209 12 DOE: Screening Designs Figure 12. which represent Feed Rate. Figure 12. Temperature is now the product of Feed Rate and Catalyst. Catalyst. and Stir Rate). rows for the first three columns of the coded design.15 Default Coded Designs Changing the Coded Design In the Change Generating Rules panel.16 and click Apply. The fourth column (Temperature) of the coded design is the element-by-element product of the first three columns. and Stir Rate. If you check the options as shown in Figure 12. and Stir Rate are all combinations of high and low values (a full factorial design). Plus signs designate high levels and minus signs represent low levels. Concentration is a function of Feed Rate and Stir Rate. .15. the Coded Design panel changes. Catalyst. The Change Generating Rules table in Figure 12. Catalyst.

Figure 12. .16 Modified Coded Designs and Generating Rules Specifying Output Options Use the Output Options panel to specify how you want the output data table to appear. Sort Right to Left—the rows (runs) in the output table appear sorted from right to left.17 Select the Output Options • Run Order—Lets you designate the order you want the runs to appear in the data table when it is created.210 12 Screening Designs Creating a Screening Design Figure 12. click Make Table. including centerpoints. One replicate doubles the number of runs. Randomize—the rows (runs) in the output table appear in a random order. When the options are correctly set up. Sort Left to Right—the rows (runs) in the output table appear sorted from left to right. • Number of Replicates—Specify the number of times to replicate the entire design. Randomize within Blocks—the rows (runs) in the output table will appear in random order within the blocks you set up. Type the number of times you want to replicate the design in the associated text box. Choices are: Keep the Same—the rows (runs) in the output table appear as they do in the Design panel. • Number of Center Points—Specifies additional runs placed at the center points.

Continuing the Analysis After creating and viewing the data table. In the table.19 Running the Model Script .12 Screening Designs Creating a Screening Design 211 12 DOE: Screening Designs Viewing the Table After clicking Make Table. The column called Pattern shows the pattern of low values denoted “–” and high values denoted “+”. the high and low values you specified are displayed for each run. Figure 12. This script allows you to easily fit a model using the values in the design table.18 The Design Data Table The name of the table is the design type that generated it. Figure 12. Right-click it and select Run Script to run a fit model analysis (Figure 12.19). you can now run analyses on the data. Pattern is especially useful as a label variable in plots. you have a data table that outlines your experiment. The data table contains a script labeled Model.

Viewing an Actual-by-Predicted Plot When the model contains no interactions. The last column of the table has the p-values for each effect. In this case. but the Catalyst effect is large enough to be interesting if it is real. Figure 12. shown on the left in Figure 12. but this does not mean that the experiment has failed. which tells you that the model is not significant. The Scaled Estimates report displays a bar chart of the individual effects embedded in a table of parameter estimates. At this stage the results are not clear. None of the factor effects are significant. This suggests that effects other than the main effects of each factor are important.199. and RMSE appear below the plot. In Figure 12. appears at the top of the Fit Model report. an actual-by-predicted plot. the mean line falls inside the bounds of the 95% confidence curves. it is impossible to determine which two-factor interactions are important without performing more experimental runs. which is much larger than expected.21) as a part of the Fit Model report. Because of the confounding between two-factor interactions and main effects in this design. the RMSE is 14.20 An Actual-by-Predicted Plot To show labels in the graph (on the right in Figure 12. and select Row Label.212 12 Screening Designs Creating a Screening Design The next sections describe some of the parts of the analysis report that appears when you click Run Model. The model p-value. select all points.20. The pattern variable displayed in the data table serves as the label for each point.20. Viewing a Scaled Estimates Report When you fit the model. R2. . It means that some follow-up runs are necessary. JMP displays a Scaled Estimates report (Figure 12. The RMSE is an estimate of the standard deviation of the process noise assuming that the unestimated effects are negligible. right-click the graph.20).

.Figure 12. For comparison. you would then want to augment the design. you might also want to have complete 32-run factorial experimental data and analysis.21 Example of a Scaled Estimates Report If this scaled estimates report were not merely an example.

.

illustrated in the figure on the right below. so the standard two-level designs cannot fit curved surfaces. for which all but one factor are set at zero (midrange) and that one factor is set at outer (axial) values. Three distinct values for each factor are necessary to fit a quadratic function. Another important difference between the two design types is that the Box-Behnken design has no points at the vertices of the cube defined by the ranges of the factors. This is sometimes useful when it is desirable to avoid these points due to engineering considerations. • Axial (or star) points. Central Composite Design fractional factorial points Box-Behnken Design axial points center points . a response surface model can pinpoint it. If a minimum or maximum response exists inside the factor region. illustrated in the figure to the left below. One distinguishing feature of the Box-Behnken design is that there are only three levels per factor.13 Response Surface Designs Response surface designs are useful for modeling a curved quadratic surface to continuous factors. The Box-Behnken design. It combines a two-level fractional factorial and two other kinds of points: • Center points. is an alternative to central composite designs. The price of this characteristic is the higher uncertainty of prediction near the vertices compared to the central composite design. for which all the factor values are at the zero (or midrange) value. The most popular response surface design is the central composite design.

Figure 13. Silane. and Sulfur used to manufacture the tennis balls. open the Sample Data folder that was installed when you installed JMP.216 13 Response Surface Designs Introduction Introduction The Bounce Data. the Response Surface Design Choice dialog lists the designs in Figure 13.2 Response Surface Design Selection The Box-Behnken design selected for three effects generates the design table of 15 runs shown in Figure 13. and select Design Experiment > Bounce Response. and select Design Experiment > Bounce Data. The objective of this experiment is to match a standardized target value (450) of tennis ball bounciness. 3 Load the responses by clicking the red triangle icon on the Response Surface Design title bar and selecting Load Responses. In real life. Navigate to the Sample Data folder that was installed when you installed JMP. Let’s pretend this happened and use a finalized data table called Bounce Data. and select Design Experiment > Bounce Factors.jmp. Figure 13.2. The experimenter wants to collect data over a wide range of values for these variables to see if a response surface can find a combination of factors that matches a specified bounce target. .jmp. Figure 13. The bounciness varies with amounts of Silica.jmp.1 shows the completed Response panel and Factors panel.1 Response and Factors For Bounce Data After the response data and factors data are loaded. 2 Load factors by clicking the red triangle icon on the Response Surface Design title bar and selecting Load Factors. To follow this example: 1 Select DOE > Response Surface Design. Navigate to the Sample Data folder that was installed when you installed JMP.jmp.jmp sample data file has response surface data inspired by the tire tread data described in Derringer and Suich (1980).3. you would conduct the experiment and then enter the responses into the data table. To see this example data table.

3 JMP Table for a Three-Factor Box-Behnken Design After obtaining the Bounce Data. 2 Right-click Model and select Run Script to start a fit model analysis.13 Response Surface Designs Introduction 217 13 DOE: Response Surface Designs 1 Open to the Sample Data folder that was installed when you installed JMP.4 JMP Statistical Reports for a Response Surface Analysis of Bounce Data The Response Surface report also has the tables shown in Figure 13. and select Design Experiment > Bounce Data.3).jmp (Figure 13.5. run a fit model analysis on the data.jmp data table.4. Figure 13. The standard Fit Model analysis results appear in tables shown in Figure 13. . The prediction model is highly significant with no evidence of lack of fit. The data table contains a script labeled Model. with parameter estimates for all response surface and crossed effects in the model. Figure 13. All main effect terms are significant as well as the two interaction effects involving Sulfur. 3 Click Run Model.

218

13 Response Surface Designs Creating a Response Surface Design

Figure 13.5 Statistical Reports for a Response Surface Analysis

Provides a summary of the parameter estimates

Lists the critical values of the surface factors and tells the kind of solution (maximum, minimum, or saddlepoint). The solution for this example is a saddlepoint. The table also warns that the critical values given by the solution are outside the range of data values. Shows eigenvalues and eigenvectors of the effects. The eigenvector values show that the dominant negative curvature (yielding a maximum) is mostly in the Sulfur direction. The dominant positive curvature (yielding a minimum) is mostly in the Silica direction.

Creating a Response Surface Design
Response Surface Methodology (RSM) is an experimental technique invented to find the optimal response within specified ranges of the factors. These designs are capable of fitting a second-order prediction equation for the response. The quadratic terms in these equations model the curvature in the true response function. If a maximum or minimum exists inside the factor region, RSM can find it. In industrial applications, RSM designs involve a small number of factors. This is because the required number of runs increases dramatically with the number of factors. Using the response surface designer, you choose to use well-known RSM designs for two to eight continuous factors. Some of these designs also allow blocking. Response surface designs are useful for modeling and analyzing curved surfaces. To start a response surface design, select DOE > Response Surface Design, or click the Response Surface Design button on the JMP Starter DOE page. Then, follow the steps below: • “Entering Responses and Factors,” p. 219 • “Choosing a Design,” p. 219 • “Specifying Axial Value (Central Composite Designs Only),” p. 220 • “Specifying Output Options,” p. 221 • “Viewing the Design Table,” p. 221 • “Continuing the Analysis, If Needed,” p. 222

13 Response Surface Designs Creating a Response Surface Design

219 13 DOE: Response Surface Designs

Entering Responses and Factors
The steps for entering factors in a response surface design are unique to this design. To add factors, follow the step in Figure 13.6. Figure 13.6 Entering Factors into a Response Surface Design

Click Continue to proceed to the next step.

Choosing a Design
Highlight the type of response surface design you would like to use and click Continue. Figure 13.7 Choose a Design Type

The Response Surface designer provides the following types of designs: Box-Behnken Designs The Box-Behnken design has only three levels per factor and has no points at the vertices of the cube defined by the ranges of the factors. This is sometimes useful when it is desirable to avoid extreme

220

13 Response Surface Designs Creating a Response Surface Design

points due to engineering considerations. The price of this characteristic is the higher uncertainty of prediction near the vertices compared to the central composite design. Central Composite Designs The response surface design list contains two types of central composite designs: uniform precision and orthogonal. These properties of central composite designs relate to the number of center points in the design and to the axial values: • Uniform precision means that the number of center points is chosen so that the prediction variance at the center is approximately the same as at the design vertices. • For orthogonal designs, the number of center points is chosen so that the second order parameter estimates are minimally correlated with the other parameter estimates.

Specifying Axial Value (Central Composite Designs Only)
When you select a central composite (CCD-Uniform Precision) design and then click Continue, you see the panel in Figure 13.8. It supplies default axial scaling information. Entering 1.0 in the text box instructs JMP to place the axial value on the face of the cube defined by the factors, which controls how far out the axial points are. You have the flexibility to enter the values you want to use. Figure 13.8 Display and Modify the Central Composite Design

• Rotatable makes the variance of prediction depend only on the scaled distance from the center of the design. This causes the axial points to be more extreme than the range of the factor. If this factor range cannot be practically achieved, it is recommended that you choose On Face or specify your own value. • Orthogonal makes the effects orthogonal in the analysis. This causes the axial points to be more extreme than the –1 or 1 representing the range of the factor. If this factor range cannot be practically achieved, it is recommended that you choose On Face or specify your own value. • On Face leaves the axial points at the end of the -1 and 1 ranges. • User Specified uses the value entered by the user, which can be any value greater than zero. Enter that value into the Axial Value text box. If you would like to inscribe the design, click the box beside Inscribe. When checked, JMP re-scales the whole design so that the axial points are at the low and high ends of the range (the axials are –1 and 1 and the factorials are shrunken based on that scaling).

13 Response Surface Designs Creating a Response Surface Design

221 13 DOE: Response Surface Designs

Specifying Output Options
Use the Output Options panel to specify how you want the output data table to appear. When the options are correctly set up, click Make Table. Figure 13.9 Select the Output Options

• Run Order—Lets you designate the order you want the runs to appear in the data table when it is created. Choices are: • Keep the Same—the rows (runs) in the output table will appear as they do in the Design panel. • Sort Left to Right—the rows (runs) in the output table will appear sorted from left to right.
• Randomize—the rows (runs) in the output table will appear in a random order. • Sort Right to Left—the rows (runs) in the output table will appear sorted from right to left. • Randomize within Blocks—the rows (runs) in the output table will appear in random order

within the blocks you set up. • Number of Center Points—Specifies additional runs placed at the center points. • Number of Replicates—Specify the number of times to replicate the entire design, including centerpoints. Type the number of times you want to replicate the design in the associated text box. One replicate doubles the number of runs.

Viewing the Design Table
Now you have a data table that outlines your experiment, as described in Figure 13.10.

222

13 Response Surface Designs Creating a Response Surface Design

Figure 13.10 The Design Data Table The column called Pattern identifies the coding of the factors. It shows all the codings with “+” for high, “–” for low factor, “a” and “A” for low and high axial values, and “0” for midrange. Pattern is suitable to use as a label variable in plots because when you hover over a point in a plot of the factors, the pattern value shows the factor coding of the point. Runs are in a random order. This script allows you to easily fit a model using the values in the design table. There are two center points per replicate.

The name of the table is the design type that generated it.

The Y column is for recording experimental results.

Continuing the Analysis, If Needed
After creating and viewing the design table, running the experiment, and recording your results in the design table’s Y column, run a fit model analysis on the data. The data table contains a script labeled Model. Right-click it and select Run Script (Figure 13.11) to fit the model. Figure 13.11 Running the Script

After clicking Run Model in the dialog, (Figure 13.12), review the analysis.

12 Fitting the Model .13 Response Surface Designs Creating a Response Surface Design 223 13 DOE: Response Surface Designs Figure 13.

.

Prospective Power Analysis The following five values have an important relationship in a statistical test on means: • Alpha Alpha is the significance level that prevents declaring a zero effect significant more than alpha portion of the time. estimate of within-group variance. This capability is available for the single sample. and power. and computes the third for the following cases: • difference between one sample's mean and a hypothesized value • difference between two samples means • differences in the means among k samples • difference between a variance and a hypothesized value • difference between one sample proportion and a hypothesized value • difference between two sample proportions • difference between counts per unit in a Poisson-distributed sample and a hypothesized value. . two-sample. If you supply two values. the result is a plot of the other two. You can supply either one or two of the three values. The calculations assume that there are equal numbers of units in each group. you must give JMP an estimate of the group means and sample sizes in a data table as well as an estimate of the within-group standard deviation (σ). It requires that you enter any two of three quantities. These depend on the significance level—alpha—of the hypothesis test for the effect and the standard deviation of the noise in the response. The Sample Size and Power command determines how large of a sample is needed to be reasonably likely that an experiment or sample will yield a significant result. and k-sample situations. power. “Will I detect the group differences I am looking for. if they are balanced. difference to detect. JMP computes the third. and alpha level?” In this type of analysis. given that the true effect size is at least a certain size. sample size. given my proposed sample size. You can apply this platform to more general experimental designs. and a number-of-parameters adjustment is specified. If you supply only one of these values.14 Prospective Power and Sample Size Use the DOE > Sample Size and Power command to answer the question “How many runs do I need?” The important quantities are sample size. and the magnitude of the effect. Using the Sample Size and Power command when doing a prospective analysis helps answer the question.

1). The values you enter depend on your initial choice. or Two Sample Means in the initial Sample Size selection list (Figure 14. or when comparing k sample means.2 appears and asks for the anticipated experimental values. As an example. the panel shown in Figure 14.1 appears with button selections for experimental situations. The following sections describe each of these selections and explains how to enter estimated parameter values and the desired computation. • Effect Size Effect size is how different the means are from each other or from the hypothesized value. Figure 14.1 Sample Size and Power Choices One-Sample and Two-Sample Means After you click either One Sample Mean. or samples) are involved in the experiment. the Power and Sample Size dialog in Figure 14. consider the two-sample situation. or the effect size expected in the experimental situation where there is a single mean comparison. When you select DOE > Sample Size and Power. a two sample comparison. • Sample Size Sample size is how many experimental units (runs. • Power Power is the probability of declaring a significant result. The Sample Size and Power command in JMP helps estimate in advance either the sample size needed. . power expected.226 14 Prospective Power and Sample Size One-Sample and Two-Sample Means • Error Standard Deviation Error Standard Deviation is the unexplained random variation around the means.

and one or two of the other three values: Difference to detect. If there are two unspecified fields. Even though the true error is not known. there are other factors with the extra parameters that can be specified here. it isn’t very important what values you enter here unless the experiment is in a range where there is very few degrees of freedom for error. as shown here. Computed sample size numbers can have fractional values. in addition to fitting the means described in the situation. experimental units.05. This is usually done by . the power and sample size platform constructs a plot that shows the relationship between those two values: • power as a function of sample size. the number of extra parameters is five—two parameters for the extra main effects. For example. • Sample Size Sample size is the total number of observations (runs. In a multi-factor balanced design. the power calculations are an exercise in probability that calculates what might happen if the true values were as specified. In practice. Leave this field zero in simple cases. The Power and Sample Size dialog asks for the values depending the first choice of design: • Alpha Alpha is the significance level.14 Prospective Power and Sample Size One-Sample and Two-Sample Means 227 14 DOE: Power And Sample Size Figure 14. but the total over all groups. This implies willingness to accept (if the true difference between groups is zero) that 5% (alpha) of the time a significant difference will be incorrectly declared. for a given power. Sample Size. given a sample size • effect size as a function of sample size. • Error Std Deviation Error Std (Standard) Deviation is the true residual error. and three parameters for the interactions. For single sample problems this is the difference between the hypothesized value and the true value. • Extra Params Extra Params (Parameters) is only for multi-factor designs.2 Initial Power and Sample Size Dialogs for Single Mean (left) and Two Means (right) The Two Sample Means choice in the initial Power and Sample Size dialog always requires values for Alpha and the error standard deviation (Error Std Dev). and Power. given specific effect size • power as a function of effect size. which you need to adjust to real units. in a three-factor two-level design with all three two-factor interactions. The power and sample size platform then calculates the missing item. • Difference to Detect Difference to detect is the smallest detectable difference (how small a difference you want to be able to declare statistically significant). or samples). usually 0. Sample size is not the number per group.

• Continue Evaluates at the entered values. To calculate the power when the sample size is 10. If an experiment requires considerable effort. which gives a more typical looking power curve. For example. 229.99998. Double click on the horizontal axis to get any desired scale.9.3.2 to 0. “Power and Sample Size Animation for a Single Sample. • Back Goes back to the previous dialog. rounding to 1.228 14 Prospective Power and Sample Size One-Sample and Two-Sample Means increasing the estimated sample size to the smallest number evenly divisible by the number of groups. Change the range of the curve by changing the range of the horizontal axis. Figure 14.3. • Power Power is the probability of getting a statistic that will be declared statistically significant. for an illustration of this animation script. shows a range of sample sizes for which the power varies from about 0.95 if you can afford it. the plot on the right in Figure 14.95. See the section. leave both Sample Size and Power empty and click Continue. • Animation Script The Animation Script button runs a JSL script that displays an interactive plot showing power or sample size.” p. Power is equal to alpha when the specified effect size is zero. Bigger power is better. plan so that the experimental design has the power to detect a sizable effect.4. shows the power is calculated to be 0. The left-hand graph in Figure 14. . leave Power missing in the dialog and click Continue. when there is one. but the cost is higher in sample size.4. You should go for powers of at least 0. Single-Sample Mean Suppose there is a single sample and the goal is to detect a difference of 2 where the error standard deviation is 0.3 A One-Sample Example To see a plot of the relationship of power and sample size. as shown in the left-hand dialog in Figure 14. The dialog on the right in Figure 14. has the horizontal axis scaled from 1 to 8.90 or 0.

5) and Difference to Detect and Power are empty. Sample Size is 10.14 Prospective Power and Sample Size One-Sample and Two-Sample Means 229 14 DOE: Power And Sample Size Figure 14. In the example shown in Figure 14. The animation begins showing a normal curve positioned with mean at zero (representing the estimated mean and the true mean).4 A One-Sample Example Plot When only Sample Size is specified (Figure 14. and another with mean at 0. a plot of power by difference appears.05. Figure 14.4. Alpha is 0.5 Plot of Power by Difference to Detect for a Given Sample Size Power and Sample Size Animation for a Single Sample The Animation Script button on the Power and Sample Size dialog for the single mean displays an interactive plot that illustrates the effect that changing the sample size has on power. The probabil- .4 (the difference to be detected).6. and the Difference to Detect is set to 0.

7. The comparison is between two random samples instead of one. Also.7. shown on the right in Figure 14. Suppose the error standard deviation is 0. Figure 14.9 (as before). 0. the Difference to Detect is the difference between two means.9. Leave Power blank and click Continue to see the power calculation. To increase the power requires a larger sample. To find out how large. as shown in the dialog on the left in Figure 14. and the sample size is 16. The crosshair tool estimates that a sample size of about 35 is needed to obtain a power of 0. leave both Sample Size and Power blank and examine the resulting plot. is shaded in blue on this plot. You can drag the handles over the curves drag to see how their positions affect power.5433. you can click on the values for sample size and alpha showing beneath the plot to change them. often represented as β in literature.230 14 Prospective Power and Sample Size One-Sample and Two-Sample Means ity of committing a Type II error (not detecting a difference when there is a difference). . This is considerably lower than in the single sample because each mean has only half the sample size.6 Example of Animation Script to Illustrate Power Two-Sample Means The dialogs work similarly for two samples. the desired detectable difference is 1.

5)2+(0. which is the square root of 5.9.95.7 Plot of Power by Sample Size to Detect for a Given Difference k-Sample Means The k-Sample Means situation can examine up to 10 kinds of means. as shown in the dialog on the left in Figure 14. This confirms that a sample size of 16 looks acceptable.236. the power and sample size calculations produce the power curve shown on the right in Figure 14. calculated as square root of the sum of squared deviations from the grand mean. The next example considers a situation where 4 levels of means are expected to be about 10 to 13. . In this case it is the square root of (–1.5)2+ (–0. Notice that the difference in means is 2.8.5)2. If both Sample Size and Power are left blank. and the Error Std Dev is 0.14 Prospective Power and Sample Size k-Sample Means 231 14 DOE: Power And Sample Size Figure 14.5)2+(1.8. When a sample size of 16 is entered the power calculation is 0.

An example is when the variance for resistivity measurements on a lot of silicon wafers is claimed to be 100 ohm-cm and a buyer is unwilling to accept a shipment if variance is greater than 55 ohm-cm for a particular lot. with an alpha of 0.05 and power of 0. alpha level. specify a baseline variance. In the dialog. . To indicate direction of change.gov/div898/handbook. You can access the NIST manual examples at http://www.232 14 Prospective Power and Sample Size One-Sample Variance Figure 14.nist. Suppose you want to detect an increase of 55 for a baseline variance of 100. and direction of change you want to detect. The examples throughout the rest of this chapter use engineering examples from the online manual of The National Institute of Standards and Technology (NIST).itl. you enter two of the items and the Power and Sample Size calculations determines the third. As with previous dialogs.The usual purpose of this option is to compute a large enough sample size to guarantee that the risk of accepting a false hypothesis (β) is small.1) determines sample size for detection of a change in variance. The computations then show whether the true variance is larger or smaller than its hypothesized value.99. select either Larger or Smaller from the Guarding a change menu. When you click Continue.8 Prospective Power for k-Means and Plot of Power by Sample Size One-Sample Variance The One-Sample Variance choice on the Power and Sample Size dialog (Figure 14. Enter these items as shown in Figure 14. entered as the Baseline Variance. the computed result shows that you need a sample size of 170.9.

The dialogs are the same except you enter Baseline Proportion and also specify either a one-sided or two-sided test.10). The sampling distribution for proportions is actually binomial. enter a negative amount in the Difference to Detect box. Note: Remember to enter the variance in the Baseline Variance box. When there are two samples. as indicated on the dialogs (Figure 14. the Baseline Proportion you enter is the average of the two sample proportions.9 Sample Size and Power Dialog To Compare Single-Direction One-Sample Variance One-Sample and Two-Sample Proportions The dialogs and computations to test power and sample sizes for proportions are similar to those for testing sample means. not the standard deviation. For a one-sample situation.14 Prospective Power and Sample Size One-Sample and Two-Sample Proportions 233 14 DOE: Power And Sample Size If you want to detect a change to a small variance. the Baseline Proportion is the average of a known baseline proportion and the single sample proportion. Figure 14. but the computations to determine sample size and test proportions use a normal approximation. .

or average of two-sample proportions. The example process is monitored with a one-sided test at 5% alpha and a 10% risk (90% power) of failing to detect a change of that magnitude.234 14 Prospective Power and Sample Size One-Sample and Two-Sample Proportions Figure 14. For example. and the computed sample size of approximately 77.08.The manager does not want to stop the process unless it has degenerated greater than 16% defects (6% above the 10% known baseline defective).11. leave both Difference to Detect and Sample Size blank. To see the plot on the right in Figure 14. Figure 14.11 shows the entries in the Sample Size and Power dialog to detect a given difference between an observed proportion and a baseline proportion.or two-sided) Testing proportions is useful in production lines. The Baseline Proportion in this example is 0. Figure 14. Enter 1 or 2 to indicate the type of test (one. which is the average of the baseline (10%) and the proportion above the baseline (6%).11 Dialog To Compare One Proportion to a Baseline and Sample Size Plot .10 Default Power and Sample Dialog for One-Sample and Two-Sample Proportions Enter average of baseline and one-sample proportions. Use the grabber tool (hand) to move the x-axis and show a specific range of differences and sample sizes. where proportion of defects is part of process control monitoring. suppose a line manager wants to detect a change in defective units that is 6% above a known baseline of approximately 10% defective.

alpha is sometimes called the producer’s risk and beta is called the consumer’s risk. As an example. Although the number of defects observed in an area of a given size is often assumed to have a Poisson distribution. Then enter two of the remaining fields to see the calculation of the third. Enter an alpha of 0. A unit can be an area and the counts can be fractions or large numbers. Enter a power of 0.128 (Figure 14. consider a wafer manufacturing process with a target of 4 defects per wafer and you want to verify that a new process meets that target. Click the Counts per Unit button. The process meets the target if there are less than 48 defects (6 defects per wafer in a sample of 8 wafers). The computed sample size is expressed in those units. the area and count are assumed to be large enough to support a normal approximation. 5 Click Continue to see the computed sample size of 8.12 Dialog For Counts Per Unit Example 1 2 3 4 .9. The test is for one-sided (one-tailed) change. Questions of interest are: • Is the defect density within prescribed limits? • Is the defect density greater than or less than a prescribed limit? Enter alpha and the baseline count per unit. Figure 14.14 Prospective Power and Sample Size Counts per Unit 235 14 DOE: Power And Sample Size Counts per Unit The Counts per Unit selection calculates sample size for the Poisson-distributed counts typical when you can measure more than one defect per unit. Enter the Difference to Detect in terms of the baseline count per unit (defects per unit).12). Select DOE > Sample Size and Power. In this kind of situation. which is the chance of detecting a change larger than 2 (6 defects per wafer).1 to be the chance of failing the test if the new process is as good as the target.

as shown in the window on the right in Figure 14. use the Sample Size and Power feature to compute the Sigma quality level for 50 defects in 1.13. as shown in the window to the left in Figure 14.236 14 Prospective Power and Sample Size Sigma Quality Level Sigma Quality Level Use the Sigma Quality Level feature. Figure 14.13.14).000 as the number of opportunities. The results.13 Sigma Quality Level Example 1 If you want to know how many defects reduce the Sigma Quality Level to “six-sigma” for 1.4.000 opportunities: 1 Select DOE > Sample Size and Power. by entering any two of the following three quantities: • number of defects • number of opportunities • Sigma quality level When you click Continue.14) shows that the Number of Defects cannot be more than approximately 3.000.5.000. 4 Click Continue.3. the sigma quality calculator computes the missing quantity using the formula Sigma Quality Level = NormalQuantile(1 – defects/opportunities) + 1. As an example. The computation (window to the right in Figure 14. 2 Click the Sigma Quality Level button. enter 6 as the Sigma Quality Level and leave the Number of Defects blank (window to the left in Figure 14.000. accessed by selecting DOE > Sample Size and Power. .000 opportunities. 3 Enter 50 for the number of defects and 1. are a Sigma quality level of 5.

14 Sigma Quality Level Example 2 .14 Prospective Power and Sample Size Sigma Quality Level 237 14 DOE: Power And Sample Size Figure 14.

.

202 Autocorrelation 149 autocorrelation 148–149 Autocorrelation Lags 147 Autoregressive Order 155 axial points 215 scaling.jmp 216 Bounce Factors.Index JMP-SE Symbols “F Ratio” 104 “Prob>F” 104 Numerics –2LogLikelihood 151 95% bivariate normal density ellipse 164 A aberration designs 205 acceptable values See lower limits and upper limits activating toolbars 14 Actual-by-Predicted plots 212 Add button 86 Add Column button 95 Add Graphics Script command 184 additional runs 196. 89 Analyze menu 22. 89 ARIMA 143. Tukey Kramer command 62 Alpha 225. 210. 72. 65 Box-Behnken designs 215–216. 227 Alpha Amalyze 21 analysis of variance report 47. 45. 151 aliasing effects 207 All Graphs command 65 All Pairs. 79 Analyze Toolbar 13–14 animation scripts 228 Annotate tool 28 annotating 28 resizing and repositioning 28 ANOVA 60 Display Options command 62 JMP INTRO terms 60 one way 46 popup menu 62 report 47 table 53.jmp 216–217 Bounded 157 Box Plots command 62. 181 producing 171 Bar Chart command 183 bar chart of correlations 167 Bartlett’s test 64 BIC 151 bivariate normal density ellipse 164 blue diamond disclosure icon 53 Border 184 Bounce Data. 60. central composite designs 220 B Background Color command 184 Backward 103 bar chart 172. 219 See also Response Surface designs Box-Jenkins model see ARIMA Braces. 154–156 Arrange Plots 186 assigning importances (of responses) 195. 221 AIC 151 AIC 104 Akaike’s Information Criterion 104. 92 table 53.jmp 131 Brown smoothing 157 Brown-Forsythe test 64 By role 38 .

37 Confidence Intervals 155–156 confidence intervals in ANOVA 50 in linear regression 52 mean 23. 46 graphs and reports 30 popup menu 30 Contrast dialog 84 contrasts 84 Control Charts c 132 Individual Measurement 127 Moving Range 127 np 130 p 130 R 125 S 125 Shewhart 124–133 u 131 XBar 125 Copy command 18. 186 Connect Through Missing 186 Connecting Lines 149 constant estimate 152 Constrain fit 155 contingency table 54. 36 score 37 selecting level 37 confidence limits in linear regression 59 confounding 207. 30. 181 Frame Options 183 Level Options 184 Platform options 185 Single-Chart Options 182 chart platform 172 changing all levels 173 chart types 185 Chi Square statistic 35 clipboard 180 Close command 19 Coating. 183. 66 analysis 45. 66 reports 66 Contingency Table command 67 continuous factors 203 continuous variables 29. 104 Cp 104 .jmp 107. 125–126 Color or Mark by Column command 49 colors and markers 49 Colors command 184 Column Info 96 Columns command in reports 34 Compare Means command 50. 212 resolution numbers 204 confounding pattern 208 Connect Color command 187 Connect Means command 66 Connect Points option 174. 180 corrected total 89 correlation 161–170 correlation coefficient 53 correlation matrix 163 Correlation of Estimates command 95 Correlations Multivariate 163 Cotter designs 206 count 34 Count Axis command 31 counts per unit (power and sample size) 235 covariance 161–170 Covariance Matrix 166 Cp 41. 62 comparison circles 50.240 Index JMP-SE C C Total 89 Capability Analysis command 39 with Control Charts 115 Categorical factors 203 categorical probabilities testing 36 categorical variables 29 graphs and reports 34 Caustic Soda 21 CCD See central composite designs c-Chart 132 CDF plot command 33 center points central composite designs 215 response surface designs 215 central composite designs 215. 63 interpretation 63 Comparison Circles command 66 Confid Curves Fit command 59 Confid Curves Fit option 52 Confidence Interval command 24. 219–220 See also response surface designs Chakravarty 205 changing individual levels 172 Chart launch dialog 181–182 Chart platform 171.

79. 61 density functions 33 descriptive statistics 22 design resolutions 204 designs aberration 205 Box-Behnken 215–216. 28. 155. 171. 71. 180 Index disclosure icon 53 DispayBox command 184 Display Options command 62. 167 Density Ellipse command 53. 66 defects 235 Denim. 171 details 21 Density Axis command 31 Density Ellipse 164–165. Student’s t command 62 Edit Formula 96 effect aliasing 207 eigenvalue 218 eigenvector 218 size 226 sparsity 199. 157 evolution 156 Expanded Estimates command 93 .Index JMP-SE 241 dragging in 46 Difference to Detect option 227. 29. 24 launching 22 report 25 DOE simple examples 199 double exponential smoothing 157 drag 147. 219 central composite 215 fractional factorials 204 full factorial 189. 86 crossed effect 86 crosstabs table 66 cumulative distribution function 33 cumulative logistic probability plot 67 cumulative probabilities 34 Current Estimates table 102 Custom 156 Custom Test command 94 cut and paste 18. 205–206 effect details 93 Effect Leverage Pairs 97 Effect Leverage personality 88 Effect Screening personality 88 Effect Test table 91 effects nonestimable 204 orthogonal 220 eigenvalue of effect 218 eigenvector of effect 218 Ellipse Alpha 166 Ellipse Color 166 Enter All 103 Entered 103 equal variances in t test 47 Error Bars command 62 error SS 90 error standard deviation 226–227 Estimate 103. 152. 195.jmp 21. 45. 166 dummy variables 93 Dunnett’s comparisons 62 Durbin-Watson Test 96 E Each Pair. 65 D damped-trend linear exponential smoothing 158 data table opening 15 Data Table Window 39. 22. 21 graphs 29 launch dialog 17. 199 full factorials 204 minimum aberration 205 mixed-level 205 orthogonal screening designs 204 surface designs 220 orthogonal arrays 205 Plackett-Burman 205 response surface 215 screening 199 uniform precision 220 desirability values 202 DF 151 DFE 104 dialog boxes Distribution platform 16–17. 230 Differencing Order 155 Direction 102 Cpk 41 Cpm 41 Cross button 82.

199. 18. 202 independent variables 55 Individual Confidence Interval 96 . 153 Forecast plot 153 Formula command 56 Formula Editor 56 Forward 102 fractional factorial designs 204 Freq button 86 frequencies table 34 frequency 34 H hand tool in Distribution platform 22 with Distribution platform 31 help system 11 Help tool 12 histogram 16. 24 bar position 23 bar widths 22 red bracket (box plot) 32 using 22 Histogram command 30 Hoeffding’s D 168. 58 Fit Model dialog 79–80. 199. 58 Fit Mean command 51. 175 Graph toolbar 13–14 Graphs tab 13 Group By command 54. 54. 173. 30 hypothesized means specifying 25 I identifying key factors 199 importance of responses 195.242 Index JMP-SE exponential smoothing see Smoothing Models extra parameters 227 full factorial designs 189. 44 Fit Line command 52. 61 group variances homogeneity 64 grouping variable 54. 87 Fixed 157 Forecast Periods 147. 204 examples 189 Full Factorial macro 83. 22.jmp 99 fitting lines 51 fitting personality 85. 181 F Ratio 47 F test 90 Factor 152 Factor Profiling option 193 factorial designs fractionals 204 full 189. 195. 180 File tab 13 File/Edit toolbar 13–14 Fit Distribution 26. 85 platform 79 Fit Model platform 100 examining results 81 launching 79 Save 96–97 Fit Polynomial command 59 Fit Special command 61 Fit Y By X platform 45 launching 45 Fitness. 170 Holt smoothing 158 homogeneity of variances 64 honestly significant difference 62 Horizontal command 185 Horizontal Layout command 17. 202 goals matching targets 202 minimizing and maximizing 202 Goodness of Fit 33 Grand Mean command 65 Graph 149 Graph menu 171. 86 F “F Ratio” 104 G general linear model 79 Go 101. 204 three level 205 Factorial Sorted macro 87 Factorial to Degree macro 86 factors categorical 203 continuous 203 key factors 199 false negatives 206 Fat Plus (selection) tool 18. 103 goal types 195. 195. 61.

202 maximizing goals 202 mean 16. adding 82 interactions 206 high-order 204 Intercept 155 intercept 152 interquartile range 31 Introduction sections. 93 least squares regression 51. 58 confidence limits 59 Lock 103 Log function 57 Log10 function 57 Logistic platform 55 Logistic Plot command 68 logistic regression 67 Lognormal 33 Long-term sigma 39 Lot Number column 21 Lower Spec Limit 39 LSMeans 82. 61. 93 LSMeans Contrast command 84. 221 Kendall’s Tau 168 Kendall’s tau-b 169 k-Sample Means (power and sample size) 231 linear contrasts 84 linear exponential smoothing 158 linear regression 51.Index JMP-SE 243 likelihood ratio tests 36 Line Chart command 183 Line of Fit command 65 Line Style 187 Individual Measurement Chart 127 inertia of Scroller tool 82 Inscribe option 220 interaction effect. 202 maximize responses 195. 58. 71 interpreting the scatterplot 73 launching 72 preparing the data 71 matching target goals 195. 36 specifying hypothesized 25 test 35 testing 24 L L18 Chakravarty 205 L18 Hunter 205 L18 John 205 L36 205 Label column 211. 93 LSMeans Plot 82 LSMeans Plot command 93 LSMeans Student’s t command 93 LSMeans Table command 93 M macros 83 Macros drop-down list 86 Make Model 103–104 Mallow’s Cp criterion 104 Marker Drawing Mode 184 Marker Size command 184 Markers command 184 marking points 48 matched pairs 47 plot interpretation 75 scatterplot 73 Matched Pairs platform 47. 79 legends with colors and markers 49 Level Midpoints command 38 Level Numbers command 38 Level Options command 185 level smoothing weight 156 Levene’s test 64 leverage plot 91–92 whole model 92 . 30 confidence interval 30. 222 lack of fit 59. 170 Invertible 151 Index J JMP Starter 12–13 JSL (JMP Scripting Language) animation scripts 228 K Keep the Same command 196. about 12 Inverse Corr table 163 inverse correlation 163. 210. 90 error 90 table 60 Lag 152 Lasso tool 48 leaf values 32 least squares means 82. 18.

50 Means Diamonds command 62. 210.244 Index JMP-SE Mean CI Lines command 65 Mean Confidence Interval 96 Mean Error Bars command 65 Mean Error Bars option 48 Mean Line 149 Mean Lines command 65 Needle Chart command 183 Needle option 186 Needle Plot command 172 nested effect 86 New Column means one and two sample 226 Means and Std Dev command 48 means diamonds 31. 65 Means Dots command 62 Means/Anova/t test command 50. 62 Means/Std Dev/Std Err command 62 median 16. 222 model sum of squares 90 Model Summary table 151.jmp 138 On Face option 220 N N Responses button 195. 202 minimizing goals 202 minimum aberration designs 205 missing value 163 missing values 167 Mixed 103 mixed-level designs 205 Mixture Response Surface macro 87 Model Comparison table 151 model effects 86 Model script Model Specification dialog 211. 163 Multivariate platform 161–170 command 57 nominal logistic regression see Logistic platform nominal variables 29 nominal/ordinal by continuous fit see Logistic platform nonconforming unit 43 nonestimable effects 204 Nonparametric Correlations 168 Nonparametric Measures of Association table 168 nonparametric tests 25. 155 Modeling tab 13 modeling type 29. 18. 93 multiple regression example 99–105 Multivariate 161. 217. 221 ordinal variables 29 orthogonal array designs 205 orthogonal designs screening designs 204 surface designs 220 Orthogonal option 220 Other 166 outlier box plot 30–31 . 45 Moments command 30 Moments table 36 More Moments command 30 mosaic plot 54. 64 Normal Quantiles command 38 normality 25 np-Chart 130 Number of Forecast Periods 150 number of runs screening designs 204 O O’Brien’s test 64 OC Curves 118 Oil1 Cusum. 31–32 Median rank scores 64 Method column 21 Minimal Report personality 88 minimize responses 195. 66 Mosaic Plot command 67 Moving Average Order 155 Moving Range Chart 127 MSE 104 multiple comparison tests 62–63. 32 Normal Quantile Plot command 25. 31. 202 nDF 104 one way analyses 47 one way ANOVA 46 one-sample and two-sample means 226 one-sample proportion (power and sample size) 233 one-sample variance (power and sample size) 232 opening data tables 15 order of runs 196. 35 Normal 33 Normal 26 normal density ellipse 164 normal quantile plot 25.

201. q parameters 155 Pairwise Correlations 163 Pairwise Correlations table 167 Parameter 103 parameter estimates 60 with fitted lines 52 Parameter Estimates Table 60. 47. 169 proportions (power and sample size) 233 Pumice Stone 21 pure error 90 Index outliers 31 outside effect 86 overlap marks 50 Overlay Color command 183 Overlay Marker Color 187 Overlay Marker command 183.Index JMP-SE 245 plots Actual-by-Predicted 212 Point Chart command 183 points axial 215 colors and markers 49 Points command 65 Points Jittered command 66 Points Spread command 66 Poisson-distributed counts 235 polynomial effect 87 Polynomial to Degree macro 87 power analyses 226 in statistical tests on means 226 one-sample and two-sample means 227–228 Power Analysis command 93 power and sample size calculations 225–237 animation 229 counts per unit 235 k-sample means 231 one-sample and two sample proportions 233 one-sample mean 228 one-sample variance 232 sigma quality level 236 two-sample means 230 Ppk Capability Labeling 40 Predicted Values 96 predicted values saving 58–59 prediction variances 220 Prediction Formula 96 prerequisites to using JMP INTRO 11 Press 96 Print command 18 printing reports 18 Prob Axis command 31 Prob Scores command 38 Prob to Enter 102 Prob to Leave 102 Prob>|t| 152 “Prob>F” 104 probabilities testing 26. 211. 35 Probability Labels command 65 process capability ratio 41 product-moment correlation 167. extra 227 Partial Autocorrelation 149 partial autocorrelation 148 Partial Corr table 164 partial correlation 164 Paste command 19. 85 p. 187 Overlay option 185–186 Overlay Pattern command 183 Overlay Plot platform 173. 81. 180 Pattern column 197.jmp 127 Pie command 185 Plackett-Burman designs 205 Plot Actual by Predicted 96 Plot Actual by Quantile command 65 Plot Effect Leverage 96 Plot Quantile by Actual command 65 Plot Residual By Predicted 96 Plot Residual By Row 96 Plot Residuals command 53 . 91 Parameter Estimates table 152 parameters. 222 patterns confounding 208 p-Chart 130 Pearson Chi Square test 36 Pearson correlation 167. d. 185 Connect Points option 174 platform options 186 single-plot options 186 Y Options 174 Y Options command 186 P p value 35. 169 Pen Style command 183 Periods Per Season 156 personality 99 Pickles.

88 runs additional 196. 210. 202 lower limits 202 upper limits 202 results annotating 28 revealing columns in reports 34 RMSE 193. 221 randomizing runs 196. 89 Rotatable option 220 Row Colors command 49 Row Markers command 49 row states 23 RSM (Response Surface Methodology) 218 RSquare 104. 210. 151 Rsquare 89 Adj 89 RSquare Adj 104 Run Charts 114 Run Model 100 Run Model button 84.246 Index JMP-SE p-value 36 Q q-q plot 31 Quantile Box Plot command 32 examples 216–223 introduction 218 purpose 215 reports 217 Response Surface Effect macro 87 Response Surface Methodology (RSM) 218 response surface models 87 responses custom designs 202. 212 root mean square error 59. 29.jmp 189–190 Reactor Response. 221 order they appear in table 196. 62 quartiles 31 R r 53 R2 59.jmp 189 red bracket (box plot) 32 red triangle popup menus 17 Redo Analysis command 66 regressor columns 206 Remove 147 Remove 85 Remove All 103 Remove button 82 Remove Fit command 53 reports setting titles 80 requesting additional runs 196. 210. 219 desirability values 202 goals 195.jmp 21. 221 Range option 186 Range Plot command 186 Ranks Averaged command 38 Ranks command 38 R-Chart 125 Reactor 32 Runs. 71. 79. 171 details 21 sample means 226 Sample Size and Power command 225 sample sizes example comparing one proportion to baseline and sample size plot 234 example comparing single-direction one-sample variances 232 example with counts per unit 235 one and two sample means 227 . 210. 221 screening designs 204 quantile-quantile plot 31 quantiles 32 Quantiles command 30. 89 R2 adjusted 59. 153 plotting 53 saving 58 resolution numbers 204 resolutions of designs 204 response surface designs S sample autocorrelation function 149 sample data Denim. 221 requesting additional 196. 89 Random Effect 99 Randomize within Blocks 210. 45. 221 re-running an analysis 82 rescaling designs 220 Residual Statistics 154 Residuals 96 residuals 53. 210.jmp 189 Reactor Factors. 48.

25. 155–159 smoothing weight 156 Sort Left to Right 196. 183.jmp 143 Set Alpha Level command 51. 64 setting titles windows and reports 80 Shewhart Control Charts 124–133 Shirts. error 226 Standardize command 38 standardized values 38 star points 215 starting JMP INTRO 12 statistical tests 34 Statistics 171 Statistics button 181 Std Dev Lines command 62. 65 Std Dev Lines option 48 Std Err Bars command 31 Std Error 152 . 221 sparsity. effect 199. Grouped by fixed subgroup size 40 Index sigma 39–40 Sigma Quality 41 sigma quality level (power and sample size) 236 signed-rank test 35 significance probability 167 stepwise regression 99 simple exponential smoothing 157 single-sample means (power and sample sizes) 228 Size/Scale commands 184 Smoothing Model dialog 156 smoothing models 143.Index JMP-SE 247 shortest half 32 Show Center Line 127 Show Confidence Interval 153–154 Show Correlations 165 Show Histogram 165 Show Points 149. 221 Sort Right to Left 196. 210. 210. 180 Separate Axes command 185–186 Seriesg. 18. 30 testing 35 standard deviation. 186 prospective power analysis 226 screening designs 189 Sand Blasted? column 21 Save 40 Save Centered command 65 Save Columns 154 Save commands 37 Save Normal Quantiles command 65 Save Predicted Values 96 Save Script 38 Save Script for All Objects command 66 Save Script to Data Table command 66 Save Script to Report command 66 Save Script to Script Window command 66 Save Standardized command 65 SBC 151 scaling axial 220 designs 220 scatterplot 45–46. 153–154 Show Points command 58. 185–186 scripts animation 228 generating the analysis model Model script See Model table property scripting See JSL Scroller tool 81 seasonal exponential smoothing 158 seasonal smoothing weight 156 Select Columns list 86 select rows in data table 161 selecting and marking points 48 selecting report items 18. 58 Scatterplot Matrix 164 scatterplot matrix 163 S-Chart 125 Schwartz’s Bayesian Criterion 151 score confidence intervals 37 screening designs 199 design types 204 Script 38 Script submenu 66. 180 selection tool 18.jmp 132 Short Term. 205–206 Spearman’s Rho 169 Spearman’s Rho 168 Spec Limits 40 Specified Sigma 40 specifying hypothesized means 25 Split command 71 Split command selecting rows 71 SS 104 SSE 104 Stable 151 Stable Invertible 156 Stack 38 Standard Deviation 151 standard deviation 16. Grouped by Column 40 Short Term.

248 Index JMP-SE Std Error of Individual 97 Std Error of Predicted 96 Std Error of Residual 97 StdErr Prob 34 Stem and Leaf command 32 Step 101. 89 sums of squares 89 surface designs See response surface designs T t Ratio 152 t statistic 35 t test 25. and nominal 29 . 103 Step command 186 Step History table 102 stepwise regression 100 Control panel 102–103 Stop 103 Studentized Residuals 96 Sum of Squared Errors 151 Summary of Fit table 59. 35–36. 46–47 report 46 two sample 46–47 Tables menu Split command 71 Tables tab 13 Tables toolbar 14 Tag Line option 28 Target 39 target values 202 Term 152 Test Mean commands 148–150 example 143–148 smoothing models 155–159 Time Series Plot 148 Time Series role 147 titles setting in windows and reports 80 toolbars 13 showing and hiding 14 Tools toolbar 13–14 trade-off in screening designs 204 trend 156 Try Fixed Width 179 Tukey-Kramer HSD 62 tutorial examples DOE 199–201 full factorial designs 189 multiple regression 99–105 response surface designs 216–223 time series 143–148 two-level categorical 200 two-level fractional factorials 204 two-level full factorials 204 two-sample and one-sample means 226. ordinal. 36 table 36 Test Std Dev command 35 testing a mean 24 testing for independence 55 testing probabilities 26 scaling estimated values 27 Tests command 67 Thread Wear column 21 Thread Wear Measured column 21 Time ID role 147 Time Series 143 Time Series Graph 149 Time Series platform 143–159 ARIMA 154–155 uniform precision designs 220 Uniform Scaling 38 Uniform Y Scale 186 Univariate 163 Upper Spec Limit 39 Use Median 127 User Defined option 220 using histograms 22 V values target 202 Van der Waerden 64 variables categorical 29 continuous. 230 two-sample proportion (power and sample size) 233 two-way contingency table 54 U u-Chart 131 Unconstrained 156–157 UnEqual Variances command 64 Ungroup Plots 186 command 24. 35 dialog 25 Test Probabilities command 27.

84 windows setting titles 80 Winter’s method 159 With Best. 26. 35 Window menu 23. Columns button 22 Z statistics 43 z test 25. 35 Zero To One 156 . 82. 79–80. Dunnett’s command 62 word processing program with cut and paste 28.jmp 130 Weibull 33 Weight button in Fit Model 85 weight. importance 195.Index JMP-SE 249 Index modeling type 29 standardized values 65 Variance Estimate 151 variance of prediction 220 variances equality in t test 47 Vertical command 185 W-Z Washers. Hsu’s MCB command 62 With Control. 202 Welch ANOVA 64 Westgard Rules 122 Where 38 whiskers 31 whole model 92 Whole Model Test table 68 Wilcoxon rank scores 64 Wilcoxon signed-rank test 25. 180 X role 147 X-Axis Proportional command 66 XBar Chart 125 Y button in Fit Model 85 Y Options command 185 Y role 147 Y.

.

PRODUCT LIABILITY OR OTHERWISE WILL BE LIMITED TO $50. IN NO EVENT WILL SAS INSTITUTE INC. WHETHER IN AN ACTION OF CONTRACT. THE EXCLUSION OF IMPLIED WARRANTIES IS NOT PERMITTED BY SOME STATES. BUSINESS INTERRUPTION. . ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE. NEGLIGENCE OR OTHER TORTIOUS ACTION. INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE. DATA OR PROFITS. GUARANTEE OR MAKE ANY REPRESENTATIONS REGARDING THE USE OR THE RESULTS OF THE USE OF THE SOFTWARE IN TERMS OF ITS CORRECTNESS. ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE. AND REGARDLESS OF THE FORM OF THE ACTION (WHETHER IN CONTRACT. EMPLOYEES OR AGENTS (COLLECTIVELY SAS INSTITUTE INC. INCIDENTAL OR INDIRECT DAMAGES (INCLUDING DAMAGES FOR LOSS OF BUSINESS PROFITS. IN NO EVENT SHALL KEITH PACKARD BE LIABLE FOR ANY SPECIAL. TORT (INCLUDING NEGLIGENCE). INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE. Scintilla is Copyright 1998-2003 by Neil Hodgson <neilh@scintilla. THE ABOVE EXCLUSION MAY NOT APPLY TO YOU.’S LICENSOR) BE LIABLE TO YOU FOR ANY CONSEQUENTIAL. AND THE LIKE) ARISING OUT OF THE USE OR INABILITY TO USE THE SOFTWARE EVEN IF SAS INSTITUTE INC. INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS. XRender is Copyright © 2002 Keith Packard. NEIL HODGSON DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE. Inc. SAS INSTITUTE INC. SAS INSTITUTE INC.’S LICENSORS AND THEIR DIRECTORS. SAS INSTITUTE INC. WHETHER IN AN ACTION OF CONTRACT. THE ABOVE LIMITATIONS MAY NOT APPLY TO YOU.org>. CURRENTNESS OR OTHERWISE. THE ENTIRE RISK AS TO THE RESULTS AND PERFORMANCE OF THE SOFTWARE IS ASSUMED BY YOU.’S LICENSOR’S HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.00.’S LICENSORS DO NOT WARRANT. INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS.Notices Technology License Notices The ImageMan DLL is used with permission of Data Techniques. BECAUSE SOME STATES DO NOT ALLOW THE EXCLUSION OR LIMITATION OF LIABILITY FOR CONSEQUENTIAL OR INCIDENTAL DAMAGES. REGARDING THE SOFTWARE. IN NO EVENT SHALL NEIL HODGSON BE LIABLE FOR ANY SPECIAL. NEGLIGENCE OR OTHER TORTIOUS ACTION. EXPRESS OR IMPLIED.’S LICENSORS MAKE NO WARRANTIES. RELIABILITY. INCLUDING WITHOUT LIMITATION THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. KEITH PACKARD DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE. ACCURACY. OFFICERS. DATA OR PROFITS. LOSS OF BUSINESS INFORMATION.’S LICENSOR’S LIABILITY TO YOU FOR ACTUAL DAMAGES FOR ANY CAUSE WHATSOEVER.

Sign up to vote on this title
UsefulNot useful