## Are you sure?

This action might not be possible to undo. Are you sure you want to continue?

Using JMP® Student Edition

For Windows and Macintosh

The User’s Guide to Statistics with JMP® Student Edition

The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2009. Using JMP® Student Edition. Cary, NC: SAS Institute Inc.

Using JMP® Student Edition

Copyright © 2009, SAS Institute Inc., Cary, NC, USA ISBN 978-1-60764-190-2 All rights reserved. Produced in the United States of America. For a hard-copy book: No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, or otherwise, without the prior written permission of the publisher, SAS Institute Inc. For a Web download or e-book: Your use of this publication shall be governed by the terms established by the vendor at the time you acquire this publication. U.S. Government Restricted Rights Notice: Use, duplication, or disclosure of this software and related documentation by the U.S. government is subject to the Agreement with SAS Institute and the restrictions set forth in FAR 52.227-19, Commercial Computer Software-Restricted Rights (June 1987). SAS Institute Inc., SAS Campus Drive, Cary, North Carolina 27513. 1st printing, April 2009 SAS® Publishing provides a complete selection of books and electronic products to help customers use SAS software to its fullest potential. For more information about our e-books, e-learning products, CDs, and hard-copy books, visit the SAS Publishing Web site at support.sas.com/publishing or call 1-800-727-3228. SAS® and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. ® indicates USA registration. Other brand and product names are registered trademarks or trademarks of their respective companies. For more information about this or any other JMP product, contact:

SAS INSTITUTE INC. SAS Campus Drive Cary, NC 27513 USA

**For permission to use this work, contact us:
**

www.jmp.com/se fax: 919.677.4444 phone: 919.677.8000

Technical Support is provided by the publisher of the work that JMP-SE is bundled with, and to registered instructors.

Table of Contents

1 Getting Started with JMP Student Edition

Prerequisites For This Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Computer and Operating System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Learning JMP Student Edition with its Documentation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Conventions Used in This Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Starting JMP Student Edition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . JMP Student Edition Toolbars . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Windows . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Macintosh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . First Session . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Finding Means, Medians, and Standard Deviations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Where to Go from Here . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 11 11 11 12 12 13 13 15 15 16 19

2

**The Distribution Platform
**

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . About the Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Launching the Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Using Histograms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Testing a Mean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Normality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Testing Probabilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Annotating Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Modeling Type of Variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Continuous Variable Graphs and Reports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Display Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Histogram Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Normal Quantile Plots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Outlier Box Plot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Quantile Box Plots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Stem and Leaf Plots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . CDF Plot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Fit Distribution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Categorical Variable Graphs and Reports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Statistical Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Testing a Mean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 21 22 22 24 25 26 28 29 30 30 30 31 31 32 32 33 33 34 34 35

Testing a Standard Deviation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Testing Categorical Probabilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Confidence Intervals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Saving Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Whole-Platform Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Capability Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

35 36 36 37 38 39

3

**The Fit Y by X Platform
**

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Launching the Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Computing a t-test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Pooled t test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Selecting and Marking Points . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Analysis of Variance (ANOVA) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Comparison Circles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Fitting Lines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Correlation Coefficient . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Disclosure Icon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Residuals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Two-Way Contingency Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Logistic Regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Formula Editor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Scatterplots—The Continuous by Continuous Case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Summary of Fit Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Lack of Fit Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Analysis of Variance Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Parameter Estimates Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Other Fitting Commands . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . One Way ANOVA—The Continuous by Categorical Case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Script Submenu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Contingency Analysis—The Categorical by Categorical Case . . . . . . . . . . . . . . . . . . . . . . . . . . . . Logistic Regression—The Categorical by Continuous Case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Whole Model Test Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Platform Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45 45 46 47 48 50 50 51 53 53 53 54 55 56 58 59 60 60 60 61 61 66 66 67 68 69

4

**The Matched Pairs Platform
**

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71 Preparing the Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71 Launching the Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72

. . . . . . . . 111 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110 Process Information . . . . . . Fitting Personalities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .The Matched Pairs Launch Dialog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Regression Reports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Model Effects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 All Possible Regressions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Fit Model Dialog Box . . . . . . . . . . . . . . . . . . Re-running an Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Roles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Leverage Plots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Emphasis Choices . . . . . . . . . . . . . . . . . . . 79 79 80 81 82 82 84 85 85 86 86 87 88 88 88 89 91 93 93 95 96 6 Stepwise Regression Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Save Commands . . . . . . . . . . . . . . . . . . . . . . . . . Launching the Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Row Diagnostics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Linear Contrasts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Stepwise Regression Control Panel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 Make Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 7 Control Charts Introduction . . . . . . . . . . . . . . 107 The Control Chart Launch Dialog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73 The Matched Pairs Scatterplot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Run Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Current Estimates Table . . . . . . . . . . . . . . . . . . 99 The Stepwise Window . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Fit Model Report Items . . Macros . . . . . . . . . . . . . . . . . . . . . . . 75 5 The Fit Model Platform Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Setting Titles . . . . . . . . . . Examining Results . . . . . . . . . . . . . . . . . . . Exploring the Estimates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Least Squares Means . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73 Interpreting the Matched Pairs Plot . . Effect Details . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103 Step History Table . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113 114 114 115 115 116 118 119 120 122 124 124 125 125 126 127 129 130 131 132 133 133 135 136 137 8 Time Series Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Cusum Chart Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Graph . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Parameter Estimates Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Westgard Rules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Run Charts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Autocorrelation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . XBar-. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Model Comparison Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Shewhart Control Charts for Variables . . . . . . . . . . . . . c-Charts . Time Series Commands . . . . . . . . . . . . . . . . . . . .and np-Charts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Tailoring the Horizontal Axis . . . . . . . . . . . . . . . . . . . . . . Model Summary Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . and Deleted Samples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Western Electric Rules . . . . . . . . . . . . . Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Time Series Platform . . . . . . . . . . . . . . . . . . . . Individual Measurement Charts . . . . . . . . . . . . . . . . . . . Number of Forecast Periods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . R-. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Shewhart Control Charts . . . . . . . . . . . . . . . . . . . . . . . . . . . . Modeling Reports . . . . . . . . . . . . . . . . . . . . . Using Specified Statistics . . . . . . .Charts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Tests for Special Causes . . . . . . . . . . . . . . . . . Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . u-Charts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Shewhart Control Charts for Attributes . . . . . . . . . . . . . . . . . . . . . . Cumulative Sum (Cusum) Charts . . . . . . . . The Time Series Graph . . . . . . . . . . . . Window Options . . . . . . . . . . . . . . . . . . . . . . . . Partial Autocorrelation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Hidden. . . . . . . . . . . . . . . . Single Chart Options . . . p. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Display Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . and S. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143 147 147 148 149 149 149 150 150 151 151 152 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Launch Options for Cusum Charts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Chart Type Information . . . . . . . . . . . . . Excluded. . . . . . . . . . . . . . Phases . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Importing Text Files . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Correlations Multivariate . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Covariance Matrix . . . . . . . Smoothing Models . ARIMA Model . . . . . . . . . . Macintosh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Launch the Platform and Select Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Importing Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Seasonal Exponential Smoothing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Simple Exponential Smoothing . . . . . . . . . . . . . . . . . . . . . . . . 161 163 163 163 164 166 167 167 168 169 169 169 170 10 Importing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Windows . . . . . . . . . . . . . . . . . . . . . . . . . . Using the Overlay Plot Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Results from Platforms . . . . . . . . . . . . . . . . . . . . . . Using the Chart Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Model Report Options . . . . . . . . . Pairwise Correlations . . . . . . . . . . . . . . . . . . . . . . Double (Brown) Exponential Smoothing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Smoothing Model Dialog . . . . . . . . . . . . . . . . . . . . . . . . Linear (Holt) Exponential Smoothing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Residuals . . . 153 153 153 154 154 155 156 157 157 158 158 158 159 9 Correlations and Multivariate Techniques Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Simple Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Nonparametric Measures of Association . . . . . . . . . . . . . . . . . . . . . Nonparametric Correlations . . . . . . . . . . . . . . . . . . . Computations and Statistical Details . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Importing Microsoft Excel Files . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Forecast Plot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Damped-Trend Linear Exponential Smoothing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Inverse Correlation Matrix . . . . . . . Iteration History . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Pearson Product-Moment Correlation . . . . . . . . . . . . . . . 171 171 173 175 175 176 177 179 180 . . . . . . . . . . . . . . . . . . Scatterplot Matrix . . . . . . . . . . . . . . . . . . . Winters Method (Additive) . . . . . . . . . . . . Inverse Correlations and Partial Correlations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . and Charting Data Introduction . . . . . . . . . . . . . . . . . . . . Exporting. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Entering Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Entering Responses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Continuing the Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Overlay Plot Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Viewing the Design Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181 182 183 184 185 185 186 186 11 Full Factorial Designs Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216 218 219 219 220 221 221 222 14 Prospective Power and Sample Size Prospective Power Analysis . . . . . . . . . . . . . . . . . . . . . . . . Single-Chart Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Entering Responses and Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Platform Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Level Options . . . . . . . . . . . . . . . . . . . . 199 201 202 203 203 207 210 211 211 13 Response Surface Designs Introduction . . . . . . . . . . . . 189 195 195 196 197 12 Screening Designs Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Displaying and Modifying the Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225 One-Sample and Two-Sample Means . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Platform Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Creating a Response Surface Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Selecting Output Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Specifying Output Options . . . Creating a Screening Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Choosing a Design . . . . . . . . . . Choosing a Design . . . . . . . Specifying Axial Value (Central Composite Designs Only) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .The Chart Platform . . . . . . Creating a Factorial Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Single-Plot Options . . . . . . . . . . . . . . . Continuing the Analysis. Specifying Output Options . . . . . If Needed . . . . . . . . . . . . . . . . . . . . . . . . Making the Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Entering Responses and Factors . . . 226 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Frame Options . . . . . . . . . . . . . . . . . . Viewing the Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . k-Sample Means . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228 229 230 231 232 233 235 236 Index JMP-SE . . . . . . . . . . . . . . . . . . . . . . . . . . 239 Notices Technology License Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 251 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Power and Sample Size Animation for a Single Sample . . . . . . . . . . . . . . . . . One-Sample Variance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Sigma Quality Level . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .Single-Sample Mean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Counts per Unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Two-Sample Means . . . . . . . . . . . . . . . . . . . . . . . . . . . . . One-Sample and Two-Sample Proportions . . . . . . . . . .

.

familiarity with standard computer operations and operating system terminology is assumed. graphs. and commands. select the help tool (see Figure 1. This book shows how to accomplish simple statistical tasks. It can be read like a book. Learning JMP Student Edition with its Documentation JMP Student Edition includes an extensive online help system. since it contains a complete table of contents. JMP Student Edition is easy to learn and easy to use. like those in all introductory statistics texts. JMP Student Edition opens help specific to the item you clicked. minimal knowledge about computers and statistics is necessary. closing. or it can be used to search for a specific topic. point-and-click format. allowing for interactive exploration of patterns and outliers. We hope that this visualization makes learning statistics more fun and easier than it has ever been before. All of the statistics are accessible in a familiar. standard menus. whenever they present themselves.1) and click anywhere inside a data table or report. JMP Student Edition is equipped with context-sensitive help. all the data tables. and charts are dynamically linked together. See the reference books for the operating system and computer for more information on these topics. and the statistical concepts are supported with both graphs and appropriate numerical results. especially use of the mouse. Prerequisites For This Book To use JMP Student Edition.1 Getting Started with JMP Student Edition Welcome to JMP Student Edition—the version of SAS Institute’s award-winning JMP Statistical Discovery software tailor-made for the introductory statistics student. . In addition. Knowledge of opening. To use it. it requires no formal statistics knowledge. Statistics Since JMP Student Edition is specially-made for the beginning statistics student. The specific prerequisites are as follows: Computer and Operating System In this manual. and saving files should also exist before reading this guide. In addition.

and report titles are usually set by JMP Student Edition and are not alterable by the user.2). • The notation File > Open means to select the Open command from the File menu. called the JMP Starter (see Figure 1. or other unique features. • New or important words are emphasized. • Variables under study are arranged in columns in the data spreadsheet.1 The JMP Student Edition Help Tool Help Tool Conventions Used in This Book Throughout this manual. so the words variable and column are often used interchangeably. Each “Introduction” section explores the Denim. special typefaces are used to designate commands. • File names are opened and saved to disk or network folders. Select View > JMP Starter. buttons. They are designated by a mouse on the left. • Sections titled “Introduction” provide a hands-on approach to learning the basics of JMP Student Edition. which is intended primarily as a reference. menu items. They are separate from the rest of the material in the chapter. all the “Introduction” sections could be read from each chapter before reading the rest of the material in the book.12 1 Getting Started with JMP Student Edition Conventions Used in This Book Figure 1. Certain paragraphs are meant to be carried out while reading the text.jmp sample data set using a specified platform or function. By default. In fact. JMP Student Edition begins by opening a special navigation window. Starting JMP Student Edition JMP Student Edition can be started in two ways: • Double-click the JMP Student Edition icon • Double-click a JMP Student Edition data set or script. . • Menu items. If the JMP Starter is not automatically opened.

3). organized way.1 Getting Started with JMP Student Edition JMP Student Edition Toolbars 13 1 Introduction to JMP Student Edition Figure 1. • The Tables tab contains commands used to manipulate data tables. There are nine tabs groups that partition the commands based on their function: • The File group contains commands related to opening and closing several types of files. the Tools toolbar (Figure 1.5). Windows Toolbars that duplicate the JMP Starter’s commands include the File/Edit toolbar (Figure 1. the Analyze toolbar (Figure 1. • The Graph group contains commands for charts and 3D graphics.4). they are also presented in the JMP Starter in a logical. • The Survival group contains reliability and survival commands. the Graph toolbar (Figure 1. • The Basic group contains commands that perform analyses for one-variable and two-variable situations. JMP Student Edition Toolbars An alternative way of accessing JMP Student Edition commands is by using toolbars. • The DOE group contains commands for designing an experiment. and the . • The Measure group contains tools for capability analysis.6). • The Control group contains commands for control charts. • The Model group contains commands for matched pairs (a special two-variable situation) and multivariate models. Although these commands are accessible through menus and toolbars.2 The JMP Starter Window This window provides quick and easy access to all the menu commands of JMP Student Edition.

7 The Tables Toolbar Subset Summary Stack Sort Split Some of these toolbars are not displayed by default.6 The Graph Toolbar Overlay Pareto Plot Bar and Plot Pie Spinning Charts Plot Figure 1.14 1 Getting Started with JMP Student Edition JMP Student Edition Toolbars Tables toolbar (Figure 1.4 The Tools Toolbar CrossArrow Hand Simple hair Lines Shapes Help Brush Selection Zoom Polygon Scroller Lasso Annotate Figure 1. as well as user-customizable toolbars.5 The Analyze Toolbar Distribution Survival/ Fit Reliability Fit Y By X Model Multivariate Matched Time Series Pairs Figure 1. There is also a Data Files toolbar. used to switch between open data tables. To activate toolbars that are not showing. Figure 1. • Select View > Toolbars > Show Toolbars to open the Show Toolbars window (Figure 1.7).8) .3 The File/Edit Toolbar Run New Script Cut Data Open Copy Save Table New Paste Print Script Figure 1. Each of these commands is explained fully in later chapters.

or click the Open button on the File/Edit toolbar. select Open Data Table from the JMP Starter.8 Show Toolbars Window Macintosh On the Macintosh. To see the definitions of each button on the toolbar. To open a data table. From the window that appears. The data should appear like the listing in Figure 1. or to add and subtract buttons from the toolbar. and closing a data table. requesting an analysis.jmp and click Open. Those that are unchecked are hidden. First Session This section is a guide through a few simple steps that demonstrate opening a data table. Figure 1. drag buttons onto the toolbar to add them.1 Getting Started with JMP Student Edition First Session 15 1 Introduction to JMP Student Edition Toolbars that are checked become visible. select File > Open. Control-click on the toolbar area of a window. toolbars are not set in groups.9. Select the file Denim. . but are all available to be added to a single toolbar.

16

1 Getting Started with JMP Student Edition First Session

Figure 1.9 Partial Listing of the Denim Data File

This data set contains data on the starch content of processed denim. In this example, we examine the data for the Starch Content (%) variable and answer the following questions: • What is the mean of the data? • What is its median? • What is its standard deviation? • Also, produce a histogram of the data.

**Finding Means, Medians, and Standard Deviations
**

To answer these questions, use the Distribution platform. Select Analyze > Distribution. This brings up the launch dialog as seen in Figure 1.10.

1 Getting Started with JMP Student Edition First Session

17 1 Introduction to JMP Student Edition

Figure 1.10 The Distribution Dialog

Select the variable Starch Content (%), then click the Y, Columns button. This step tells JMP Student Edition the variable to analyze. Since Starch Content (%) is the only variable of interest, we are finished with this dialog. Click OK. The report is presented in its default vertical format. However, some people prefer a horizontal layout for the Distribution report. To change the layout to horizontal, Click on the red triangle next to the word Starch Content (%) in the report (see Figure 1.11). Figure 1.11 Red Triangles Reveal Popup Menus

Popup Menus

All of these red triangles reveal popup menus when they are clicked. Watch closely for them—they reveal further options and explorations available during the data exploration process. The menu next to Starch Content (%) shows the options for this single variable, although there are cases (seen later) where the Distribution platform operates on several variables. Options available to all the variables in the report are in the menu next to the word Distributions. Select Display Options > Horizontal Layout to see the report in Figure 1.12.

18

1 Getting Started with JMP Student Edition First Session

Figure 1.12 The Starch Content Distribution Report

The answers to the four questions are all in this report. Read off the mean (25.516634), the median (24.349), and the standard deviation (9.6568876). The histogram is shown on the left. If a printed copy of this report is needed, Select File > Print. Alternatively, this output may be included in a lab report written using a word processor. To move the report into another program, use the cut and paste features of JMP Student Edition: Select the Selection tool, which looks like a fat plus.

Selection tool

Hold down the Shift key and click on each part of the report that needs to be copied. In Figure 1.13, all the text columns and the histogram have been selected. None of the headings have been, nor has the box plot. Note that the histogram’s axis is selected separately from the histogram itself. Figure 1.13 Selection of Report Parts

Select Edit > Copy.

1 Getting Started with JMP Student Edition First Session

19 1 Introduction to JMP Student Edition

In the word processor, select Edit > Paste. Now that the analysis is completed, close JMP Student Edition. Select File > Exit.

**Where to Go from Here
**

This simple example has shown all the steps needed to complete a JMP Student Edition analysis. From here, feel free to explore any of the sample data files that came with JMP Student Edition, explore the online help, or continue reading this book.

15.” p.jmp. some jeans are sand blasted. the time it takes for some of the starch present in the jeans to wear away and wash out. two measurements were taken: one to quantify the starch content of the fabric (measuring stiffness. see “First Session.2 The Distribution Platform Single-variable statistics are the domain of JMP Student Edition’s Distribution platform. after the initial washing process. Introduction Open the data file Denim. Whether or not the fabric was sand blasted is recorded in the Sand Blasted? column. The samples came from several different rolls of fabric. denim manufacturers subject the fabric to a variety of treatments to remove some of the starch. The measured thread wear has been converted into an ordinal variable in the Thread Wear column by using the Formula Editor. and is referred to in each introductory section of this book. However. For information on opening a file. . In addition. they usually contain a fair amount of starch. • Pumice Stone are a physical abrasive that is added to the wash water that literally pounds the starch out. creating stiffness and stability in the fabric. and computes hypothesis tests for these variables. recorded as a percentage of weight) and one as a count of destroyed threads (measuring wear-and-tear. most people find this stiffness undesirable—in fact. About the Data This file contains information from an experiment with blue jeans. some customers say that jeans have a “breaking in” period before they become truly comfortable. In an effort to minimize the amount of time needed to break in a new pair of jeans. • Caustic Soda is a chemical dissolved in the wash water that chemically destroys the starch. This breaking in period is. This experiment used three such treatments in differently-sized wash loads. are as follows: • Alpha Amalyze is an enzyme added to the wash water that eats the starch. It calculates summary statistics. These abrasive pebbles are the source of the so-called stone-washed jeans. When blue jeans are manufactured. After treating the jeans. with each roll identified in the Lot Number column. displays graphs. The three different treatments. recorded in the Thread Wear Measured column). in actuality. recorded in the Method column.

9 on page 29. Select the hand tool (Figure 2. and clicking on the second variable name.” p. Click the Y.2 Hand Tool Hand Tool Position the hand tool over the Starch Content (%) histogram and press the mouse button. and are seen in Figure 2. 30 and “Categorical Variable Graphs and Reports. Move the mouse horizontally (assuming the histogram is in its default vertical layout) to change the bar widths of the histogram. Two of the histograms from this report are used later in this chapter. suppose the bar widths and positions of the Starch Content (%) histogram need modifying. . This brings up the Distribution platform launch dialog. Click OK. For details on these reports. To change them. it is desirable to change these settings. Many descriptive statistics can be read directly off the text reports accompanying these histograms.” p. Histograms and textual information on all the variables now appear. the variables are examined one at a time.2) from the Tools toolbar.22 2 The Distribution Platform Introduction Launching the Platform In this example. Figure 2. Sometimes. Figure 2. For example. Using Histograms Histograms appear with bar widths and positions calculated internally by JMP Student Edition.1 Distribution Launch Dialog Select the variables Method and Starch Content (%) from the list on the left by clicking on the first variable name. Columns button. 34. see “Continuous Variable Graphs and Reports. holding down the Control (Windows) or Option (Macintosh) key. Select Analyze > Distribution from the menu bar.

To bring the Distribution report to the front. Notice that the corresponding rows in the data table are also highlighted. or labels—that persist in all of JMP Student Edition’s active plots. select Window > Denim-Distribution. Whenever a row is selected in any plot. For example. Try to determine if one of the methods results in lower starch content than the others. It is often useful to have confidence intervals on the means or levels in these histograms. These histograms are also useful in looking at some relationships among the variables. as are the bars in the other histograms for all the data points that have Alpha Amalyze as their method. a 95% confidence interval on the levels of Method and Starch Content (%).2 The Distribution Platform Introduction 23 2 Distribution: Single-Variable Statistics Move the mouse vertically to change the position of the bars. . To get. Click on the histogram corresponding to Alpha Amalyze in the Method histogram. paying attention to the starch content that gets highlighted with each one. select Window > Denim. its selection status ripples through all of JMP Student Edition’s open windows. Highlight and explore the other wash methods. Click in the histogram bars for Caustic Soda and Pumice Stone. Look at the corresponding points that are highlighted in the Starch Content (%) histogram. for example. The bar for Alpha Amalyze is highlighted. Data rows are highlighted in the data table so that they can be assigned row states—specific markers. To bring the data table to the front. colors.

Bring up the Distribution launch dialog by again selecting Analyze > Distribution from the menu bar. select Test Mean from the drop-down list next to Starch Content (%). • Make separate histograms for each of the three levels of the Method variable. To accomplish these two steps. Figure 2. Click OK.24 2 The Distribution Platform Introduction Select Confidence Interval > . Select Method in the list of variables and click the By button.95 from the drop-down menu next to the variable names in the histograms’ title bar. with the corresponding level indicated in the title bar of the histogram.3 Test Mean . two steps are required. To test that the mean of the Alpha Amalyze denim has a mean of 20%. Columns button. suppose that prior research claims that the mean starch content of Alpha Amalyze-washed denim is 20%. • Test the mean using the Alpha Amalyze histogram. Testing a Mean Continuing the analysis. Select Starch Content (%) in the list of variables and click the Y. In the Method=Alpha Amalyze section. Three histograms should appear.

the t-test is two tailed. leave the box for the Wilcoxon Signed-Rank test unchecked. This is a nonparametric test that is not usually covered in an introductory course. examine the p-value listed beside Prob > |t|.” p. Click OK. 31.2 The Distribution Platform Introduction 25 2 Distribution: Single-Variable Statistics Since the hypothesized mean is 20%. Figure 2. Select Normal Quantile Plot from the drop-down list next to one of the variable’s name. To produce a Normal Quantile Plot. which in this case is a non-significant 0. a z-test would be performed. Although there are usually more important things to worry about than the exact normality of the data.5 Normal Quantile Plot .4 Test Mean Dialog The true standard deviation is not known. This tells JMP Student Edition to compute a t-test of the mean. Figure 2. The online help contains further information on these topics. since the percentage could be higher or lower than 20%. Normality Many statistical tests make an assumption that the data is approximately normally distributed. Also. Therefore. JMP Student Edition provides a quick way of assessing normality through the Normal Quantile Plot. In this case. The results of the test are appended to the Distribution report. Complete details of the Normal Quantile Plot are in the section “Normal Quantile Plots. Type 20 in the entry field for Specify Hypothesized Mean. so leave the other entry field blank. If the standard deviation had been known and entered.5740.

select Goodness of Fit from the fitted distribution report. . then performing a goodness-of-fit test. in fact. Normal. When the report appears. Many times. a command needs to be sent to all the variables in the report. along with a goodness-of-fit statistic testing the null hypothesis that the distribution is. yet it is tedious to select the same command many times. Make sure the original Denim data table is the front window. To test this assumption. a Normal Quantile plot is appended to every histogram. Select Fit Distribution > Normal from the platform drop-down list. using the Control (Windows) or (Macintosh) key. This shortcut works for most commands in drop-down menus. select Window > Denim. Small p-values indicate a non-normal distribution. JMP Student Edition therefore provides a way to “broadcast” a command throughout a report. Hold down the Control (Windows) or (Macintosh) key and again select Normal Quantile Plot from the drop-down list next to the variable’s name.26 2 The Distribution Platform Introduction Scroll down the report to see that this command only added a Normal Quantile Plot for one variable in the report.jmp. Testing Probabilities Another question that could be asked about this data is whether the three levels of Thread Wear occur with equal frequency. This produces a report showing the parameters of the distribution. You can also test for Normality by fitting a Normal distribution. If not. a distribution of the Thread Wear variable is necessary. This time.

Make sure Fix omitted at estimated values. select Test Probabilities from the drop-down list next to Thread Wear in the title bar of the histogram. After the histogram appears. Figure 2. A screen shot of the addition appears later in this chapter. Enter 1 into each Hypoth Prob entry field in the Test Probabilities section of the report.2 The Distribution Platform Introduction 27 2 Distribution: Single-Variable Statistics Request a Distribution of Thread Wear (not Thread Wear Measured). An addition to the report appears. Click Done. in Figure 2. rescale hypothesis is selected. JMP Student Edition automatically scales the numbers entered into the entry fields so that they sum to one.6 Test Probabilities Results .16 on page 36. This allows an easy way to test for equal probabilities—simply enter 1 in each entry field.

shown to the right. Click and drag the corner to resize the annotation. The annotation turns yellow.7. or to turn in as a homework assignment). Right-click in the yellow annotation to bring up a formatting menu. The initial click point is used in the Tag Line option below. . so it is safe to say that the three levels occur with different probabilities. Type “These results are highly significant” in the box that appears.001). Figure 2.28 2 The Distribution Platform Introduction The results of the test are listed in the column labeled Prob>Chisq. Click and drag inside the annotation box to reposition it with the tag line anchored. To add comments to any JMP Student Edition output. use the Annotate tool from the Tools toolbar. there may be some annotation necessary. This test shows some highly significant results (p<0. as shown in Figure 2. Annotating Results If these results are to be cut and pasted into a word processing program (for a lab report. Select Tag Line. Click next to the top “0. For example. Control-click (Windows) or together. Figure 2. Click somewhere outside the annotation box.8.7 The Annotate Tool Annotate Tool Select the Annotate tool.8 Final Annotation -click (Macintosh) and drag to move the annotation and tag line The final annotation is shown in Figure 2. Move the mouse over the lower right corner until the cursor turns into a double arrow.3333” in the blank space to the right of the Test Probabilities table. you may wish to annotate the tested probabilities from above to state that the results of the test were significant.

temperature measurements are often on a continuous scale.jmp data set. choose Edit > Paste Special and choose one of the graphic formats. The modeling type of a variable determines what analyses JMP Student Edition performs.9 Distribution Graphs Histograms Outlier Box Plot • Categorical variables show a histogram. are represented as a nominal variable in JMP Student Edition.461 years. In the Denim. There is no order in the scale. For example. and “severe”. Thread Wear. For example. Variables can have one of three modeling types: • Continuous variables are numeric and measured on a continuous scale. The Modeling Type of Variables JMP Student Edition bases its reports on the modeling type of the variables it analyzes. The Distribution platform shows exactly this behavior. • Ordinal variables are measured on a discrete scale. for example. In the example data set Denim. but appends other graphs based on variable types.2 The Distribution Platform The Modeling Type of Variables 29 2 Distribution: Single-Variable Statistics Annotations are not preserved in standard (RTF) cut and paste. There is an implicit order in the measuring scale. To paste a report with annotations into another document. “moderate”. Figure 2. Identical platforms often result in different reports and graphs because the variables analyzed were of differing types. • Nominal variables simply name data. with values “low”. Method is a nominal variable in the Denim. is an ordinal variable. the variable Size of Load is a continuous variable. . yet it is obvious that some ages are older than others. although the data are not necessarily numerical.jmp data set. Ordinal and Nominal variables are often referred to collectively as categorical variables. limited only by the exactness of the measuring instrument. the age of people is often recorded on an ordinal scale — seldom do people report their age as 24. It produces histograms in any case.jmp. People’s names. but are instead combined within their respective reports as a single graphic.

Outlier plots are discussed in the section “Outlier Box Plot. • Moments shows or hides the Moments table. such as the mean. This table displays the mean. . JMP Student Edition produces graphs and text reports to give information from the analysis. The popup menu for continuous variables (Figure 2.30 2 The Distribution Platform Continuous Variable Graphs and Reports • Continuous variables have an outlier box plot. 31. Continuous Variable Graphs and Reports Initially. • Horizontal Layout arranges text reports to the right of their corresponding graphs and shows the histogram as a horizontal bar chart. number of data points. The text reports for continuous variables summarize typical univariate statistics. Histogram Options The Histogram Options menu contains the following items: • Histogram shows or hides the histogram. and data set size.” p. Figure 2. confidence interval on the mean.10 Continuous Variable Popup Menu Display Options The Display Options menu contains the following items: • Quantiles shows or hides the Quantiles table. kurtosis. upper and lower 95% confidence limits for the mean. and the coefficient of variation. Selecting this option again returns the report to a vertical layout. and quantiles. standard deviation. variance. standard error of the mean. skewness. constructed to show possible outliers in continuous variables. standard deviation.10) shows the options available for continuous variables. • More Moments adds to the Moments table the variable’s sum.

The standard error bar automatically adjusts to reflect the histogram’s bar widths and positions when you change them using the hand tool. sometimes called outliers. extending from both ends of the box. The ends of the box are the 25th and 75th quantiles. or q-q plot. If a variable is normal. The line across the middle of the box identifies the median sample value. This kind of plot is sometimes also called a quantile-quantile plot. and the means diamond indicates the sample mean and 95% confidence interval. Outlier Box Plot The Outlier Box Plot is a schematic that shows the dispersion of a variable. The difference between the quartiles is the interquartile range. • Prob Axis adds an axis that shows the proportion of each value represented by histogram bars. but the Density axis remains constant. Outliers are often identified as points that fall above the upper quartile + 1. • The Density Axis is the length of the bars in the histogram. • Count Axis adds an axis that shows the frequency of each value represented by the histogram bars. the Count and Prob axes change. The dashed lines in the outlier box plot are sometimes called whiskers. the data are approximately normal. relatively easy. Normal Quantile Plots The Normal Quantile Plot option adds a graph to the report that is used to visualize the extent to which the variable is normally distributed. also called the quartiles. Any combination of these axes can be added to categorical or continuous histograms. the normal quantile plot is approximately a diagonal straight line. .5×(interquartile range) or below the lower quartile – 1. As the length of the bars is changed with the hand tool.2 The Distribution Platform Continuous Variable Graphs and Reports 31 2 Distribution: Single-Variable Statistics • Std Err Bars draws the standard error bar on each level of the histogram. The Normal Quantile plot also shows confidence bounds. The whiskers extend to the outermost data point that falls within the distances computed for judging outliers. This makes the identification of points with extreme values.5×(interquartile range). If the data fall within these confidence bounds.

but have greater spacing at the other end (as in this picture). For example. Like the Normal Quantile Plot. in that they highlight corresponding data points in the data table when they are selected in the plot.11. Note that the quantile box plot is not the same as the outlier box plot from page 31. . the quantiles are approximately equidistant from each other. the distribution is skewed toward the end with more spacing. Figure 2. This is useful when determining the shape of underlying distributions. In the example pictured in Figure 2. It was developed for tallying data in the days when computer printouts were neither graphical nor easy to produce. half the data are below the 50th percentile (median). the Quantile Box Plot is useful for seeing normality in a graphical way. the third line of the table reveals that there are data points with values 40 and 41. if one exists). Quantiles are values that divide a distribution into two groups — where the pth quantile is larger than p% of the values. If a distribution is normal. Each line of the plot has a stem value that is the leading digit of a range of column values. Values are reconstructed by using the legend at the bottom of the plot. if the quantile marks are grouped closely at one end. Stem and Leaf Plots The Stem and Leaf command constructs a plot that is essentially a variation on the histogram.32 2 The Distribution Platform Continuous Variable Graphs and Reports The red bracket along the edge of the box identifies the shortest half. The data values can be reconstructed by joining the stem and leaf (and multiplying by the scale factor.11 Stem and Leaf Plot Stem and leaf plots have similar interactive capabilities to JMP Student Edition’s graphics plots. the smallest length that contains 50% of the data. They remain useful because they show the actual data at the same time as the shape of the data. Quantile Box Plots The Quantile Box Plot command shows additional quantiles (sometimes called percentiles) on the axis of the histogram. For example. The leaf values are made from the remaining digits of the values.

12) estimates the area under the density curve up to each data point. including a Goodness of Fit test. Lognormal. you can select among several options. Weibull) to the data. After fitting. A CDF plot (Figure 2. Figure 2.12 CDF Plot for Size of Load Fit Distribution The Fit Distribution menu allows you to fit certain distributions (Normal. Consult a statistics text for a definition of a density function.2 The Distribution Platform Continuous Variable Graphs and Reports 33 2 Distribution: Single-Variable Statistics CDF Plot The CDF Plot command plots a cumulative distribution function step plot using the observed data (with weights or frequencies if specified). .

30. Figure 2.15) work the same as those for continuous variables (see “Continuous Variable Graphs and Reports.34 2 The Distribution Platform Categorical Variable Graphs and Reports Categorical Variable Graphs and Reports The only text report that appears by default in categorical distribution reports is a frequencies table (Figure 2. The standard error of these probabilities (StdErr Prob) and the cumulative probabilities (Cum Prob) for the data are also computed.14) Select Columns to reveal a popup menu of all possible columns for the table. To see them. Right-click on the table to bring up a popup menu (Figure 2.” p. but are not initially shown in the results table. and the probability associated with each variable.13 Frequencies Table This table lists the levels of a categorical variable. Tables that are currently shown have a check mark beside them. Select the column to be shown or hidden.15 Drop-down Menu for Categorical Variables Statistical Tests JMP Student Edition contains numerous statistical tests for single variables. including: . Figure 2. This probability is simply the ratio of each level’s count to the total count.13).) Figure 2.14 Table Popup Menu The options listed in the drop-down menu for categorical variables (Figure 2. the count (sometimes called the frequency) of each level.

This is the p-value for observed significance of the two-tailed t-test.2 The Distribution Platform Statistical Tests 35 2 Distribution: Single-Variable Statistics • a test of the mean of a continuous variable • a test of the standard deviation of a continuous variable • a test of the probabilities of a categorical variable These tests are all accessed through the popup menu next to the variable’s name at the top of the report. the Test Mean table is appended to the bottom of the reports for that variable. Testing a Standard Deviation The Test Std Dev command requests a test value for statistical comparison to the sample standard deviation. the sample standard deviation is used to compute a t-statistic. and the probabilities associated with that Chi Square value: . Testing a Mean The Test Mean command prompts for a test value to compare to the sample mean. a new table is appended to the text report. Each time the mean is tested. The test assumes the distribution is normal. the Test Standard Deviation table is appended to the bottom of the reports for that variable. The value of this probability is half of Prob > |t|. • Prob < t is the probability of obtaining a t value less than the computed sample t ratio by chance alone when the sample mean is not the hypothesized value. The Test Standard Deviation table shows the computed Chi Square statistic that tests whether the hypothesized standard deviation is the same as the computed sample standard deviation. After clicking OK. • Prob > t is the probability of obtaining a t value greater than the computed sample t ratio by chance alone when the sample mean is not the hypothesized value. Optionally. The Test Std Dev command can be used repeatedly to test different values. The Test Mean command calculates and displays the following statistics: • t Test (or z test) lists the value of the test statistic and the p-values for the two-sided and one-sided alternatives. This is the p-value for observed significance of a one-tailed t-test. This is the p-value for observed significance of a one-tailed t-test. The value of this probability is 1 – Prob>t. Use the Test Mean command repeatedly to test different values. • Signed-Rank lists the value of the Wilcoxon signed-rank statistic followed by the p-values for the two-sided and one-sided alternatives. the nonparametric Wilcoxon signed-rank test can be requested. Otherwise. Each time a standard deviation is tested. a new Test Mean table is appended to the text report. After clicking OK. The probability values given in the Test Mean table are defined: • Prob > |t| is the probability of obtaining a greater absolute t value by chance alone when the sample mean is not different from the hypothesized value. a z-test is computed. If a value is entered for the standard deviation. The test assumes nothing about the normality of the distribution. only that it is symmetric.

This is the p-value for observed significance of a one-tailed t-test. This is the p-value for observed significance of the two-tailed t-test.16. the easiest way to test that all the probabilities are equal is to enter a one in each field. The Likelihood Ratio and Pearson Chi Square tests are calculated for those probabilities. • Prob>ChiSq is the probability of obtaining a Chi Square value greater than the computed sample Chi Square by chance alone when the sample standard deviation is not the hypothesized value. confidence intervals for a mean are automatically displayed in the Moments table (Figure 2.36 2 The Distribution Platform Confidence Intervals • Prob>|ChiSq| is the probability of obtaining a greater absolute Chi Square value by chance alone when the sample standard deviation is not different from the hypothesized value. To test a subset of the probabilities.17). Testing Categorical Probabilities The Test Probabilities option displays the dialog shown in Figure 2. leave the levels that are not involved blank.16 Test Probabilities Test Probabilities can scale the hypothesized values so that the probabilities sum to one. Therefore. Confidence Intervals For continuous variables. . JMP Student Edition substitutes estimated probabilities for those left blank. • Prob<ChiSq is the probability of obtaining a Chi Square value less than the computed sample Chi Square by chance alone when the sample standard deviation is not the hypothesized value. Figure 2. The radio buttons on the dialog allow a choice between rescaling hypothesized values to sum to one or using the entered value without rescaling. This is the p-value for observed significance of a one-tailed t-test. where hypothesized probabilities are entered.

This enables the same statistic to be saved multiple times under different circumstances. Figure 2.2 The Distribution Platform Saving Information 37 2 Distribution: Single-Variable Statistics Figure 2.95.95. To obtain a confidence interval alpha that is not listed on the Confidence Interval menu. for example.17 The Moments Table for Continuous Variables To calculate confidence intervals for a specific value of α. If the Save com- . select Confidence Interval > Other and enter the desired level. 0. An example is shown in Figure 2. use the Save menu commands. such as before and after combining histogram bars. Select Confidence Interval from the popup menu next to the variable name in the title bar. Figure 2.18 Confidence Intervals for Continuous Variables To obtain the equivalent table for categorical variables. Each command generates a new column in the current data table named by appending the response column name (denoted colname in the following definitions) to the saved statistic’s name. A dialog box appears that lets you specify which confidence interval to display.19. The Save commands can be used repeatedly. select Confidence Interval > 0.19 Confidence Intervals for Categorical Variables Saving Information To save information computed from continuous response variables. Select the desired level to see.

to create unique column names. If there are duplicates in the column. When there are By-Groups the script includes JSL Where clauses that identify the By-Group levels. its average rank is computed as the sum of its value’s ranks divided by k. starting at 1. and intervals so that the distributions are easily compared. called Midpoint colname. which is then divided by the column standard deviation). if it is saved with the current data table. If the script is saved to a file. Whole-Platform Options Each statistical platform has a popup menu in the outermost outline level next to the platform name. • Standardized creates a new column. it is available to run the next time you open the table. you can edit it. For N non-missing scores. • Normal Quantiles creates a new column. • Stack lets you orient all the output in the report window as either portrait or landscape. This contains the original column’s standardized values (each value in the column has had the column mean subtracted. called N-Quantile colname. The level number of each observation corresponds to the histogram bar that contains the observation. • Script lets you rerun or save the JSL script that produced the platform results. colname2. called Prob colname. If a value occurs k times. • Ranks creates a new column called Ranked colname that contains a ranking for each of the corresponding column’s values.38 2 The Distribution Platform Whole-Platform Options mand is used multiple times. they are assigned consecutive ranks in order of their occurrence in the spreadsheet. This column is similar to the empirical cumulative distribution function. and so forth. the column name for the statistic is named colname1. called Level colname. • Ranks averaged creates a new column. The whole-platform options for the Distribution platform include the following: • Uniform Scaling scales all axes with the same minimum. Options and commands in this menu affect all text reports and graphs on the platform. maximum. beginning with 1. called RankAvgd colname. the probability score of a value is computed as the averaged rank of that value divided by N+1. This option applies to reports for all response variables when selected. The midpoint value for each observation is computed by adding half its level width to its lower level bound. The JSL generated by Save Script for All Objects is the same as Save Script to Script Window if there are no By-Groups. • Prob Scores creates a new column. The Save menu contains the following commands: • Level Numbers creates a new column. its averaged rank is the same as the rank. These normal scores are Van Der Waerden approximations to the expected order statistics for the normal distribution. The histogram bars are numbered from low to high. . called Std colname. If a value is unique. • Level Midpoints creates a new column.

You only have to enter one of the three values. Capability Analyses can calculate capability indices using several different short-term estimates for σ. Upper Spec Limit. you can enter a known value for sigma. and Target.21 Capability Analysis Dialog Box All capability analyses use the same formulas. the process standard deviation. This is the option used for Ppk statistics. which is especially useful when there are By-Groups. Optionally. select Capability Analysis from the popup menu on the outline bar for the variable of interest.21 appears. grouped by a column or a fixed sample size. Figure 2. The difference between the options lies in how sigma is computed. The Dialog box shown in Figure 2. Capability Analysis The Capability Analysis option gives a capability analysis for quality control applications. allowing specification of long-term or one or more short-term sigmas.20 The Script Menu • Data Table Window gives a view of the underlying data table.2 The Distribution Platform Capability Analysis 39 2 Distribution: Single-Variable Statistics Figure 2. Only those fields you enter are part of the resulting Capability Analysis table. A dialog prompts you for Lower Spec Limit. These options for sigma can be explained as: • Long-term uses the overall sigma. and has sigma computed as . The capability study measures the conformance of a process to given specification limits. After requesting a Distribution.

When you click OK. 2 i=1 ----------------------------------n–r–1 (Note that this is the same formula for Short Term. if r is the number of subgroups and each ith subgroup is defined by the order of the data. known sigma used for computing capability ( xi – x ) ------------------n–1 2 analyses. where the sigma used in the chart is entered (in the dialog) as the specified sigma. Grouped by fixed subgroup size computes σ using the following formula. This is the option used for control chart-generated capability analyses. they appear on the histogram when opened at a later time. . the platform appends a Capability Analysis table. When you save the specification limits.40 2 The Distribution Platform Capability Analysis n σ = ∑ i=1 • Specified Sigma allows the user to enter a specific. user-specified and is therefore not computed. This option is found using File > Preferences.) Note: There is a preference for Distribution called Ppk Capability Labeling that will label the long-term capability output with Ppk labels. • Short Term. Sigma is. Grouped by Column brings up a column list dialog from which you choose the grouping column. at the bottom of the text reports. You can remove and redo a Capability Analysis as many times as you want. In this case. choose Spec Limits from the Save command menu. The specification limits can be stored and automatically retrieved as a column property.22. obviously. sigma is computed as n ∑ σ = ( x ij – x ) i. To do this. like the one in Figure 2. 2 i=1 ----------------------------------n–r–1 • Short Term. Grouped by fixed subgroup size and is commonly referred to as the Root Mean Square Error or RMSE. In this case. with r equal to the number of subgroups. sigma is computed as n ∑ σ = ( x ij – x ) i.

Upper Spec Limit. Sigma Quality = Normal Quantile ⎛ 1 – Expected ( # defects )⎞ + 1. Cp (USL – LSL)/6s where USL is the upper spec limit LSL is the lower spec limit . and Spec Target. or a 6. The above and below columns do not sum to the total column because Sigma Quality uses values from the Normal distribution.2 The Distribution Platform Capability Analysis 41 2 Distribution: Single-Variable Statistics Figure 2.1 “Capability Index Names and Computations.1 Capability Index Names and Computations Index Index Name Computation CP process capability ratio. The lower portion of the Capability Analysis table lists five basic process capability indexes. and is therefore not additive. and is also referred to as the process sigma.22 The Capability Analysis Table The Capability Analysis table is organized into two parts.5 -----------------------------------------------⎠ ⎝ n For example. • The Value column lists the values you specified for each limit and the target • %Actual is the observed percent of data falling outside the specification limits. This Sigma Quality measurement is frequently used in Six Sigma methods. They are Lower Spec Limit. The upper part of the table shows these quantities: • The Specification column lists the names of items for which values are shown. describes these indices and gives computational formulas. Table 2. Table 2. the formula yields 6.03 Sigma process.” p. their values.03.The PPM column (parts per million) is the Percent column multiplied by 10. 41.000. if there are 3 defects in n=1. and their upper and lower Confidence Intervals.000 observations. It also lists the percent and PPM for areas outside the spec limits.000.

1 Capability Index Names and Computations CIs for Lower CI 2 χ1 – α CP on CP w CP --------------------------n–1 ------------.-------------------.exp ⎛ -------⎞ + c ( 1 – 2Φ ( – c ) ) ⎝ 2 ⎠ π 2 n–2 2 Γ ⎛ -----------⎞ ⎝ 2 ⎠ 2 1 ------------------n–1 n – 1⎞ -------------------⎛ -----------+ (1 + c ) – ⎝ 9 n(n – 3) 72n ⎠ ⎛ n – 1⎞ Γ ----------⎝ 2 ⎠ 2 ⎛ d n – 2 -.--------------. Cpk Expected Let Value ˆ E ⎛ C pk⎞ ⎝ ⎠ min(CPL..exp ⎛ ⎛ – c -⎞ – 2c ( 1 – 2Φ ( – c ) )⎞ ⎞ ------⎝ ⎝⎝ 2 ⎠ ⎠⎠ π 2 2 Lower CI Upper CI ˆ ˆ E ⎛ C pk⎞ – k Var ( C pk ) ⎝ ⎠ ˆ ˆ E ⎛ C pk⎞ + k Var ( C pk ) ⎝ ⎠ .exp ⎛ ⎛ – c -⎞ – 2c ( 1 – 2Φ ( – c ) )⎞ ⎞ ------⎝⎝ 2 ⎠ ⎠⎠ 6 2n ⎛ n – 1⎞ ⎝ π Γ ----------⎝ 2 ⎠ Variance Var ⎛ C pk⎞ ⎝ ⎠ Using c and d from above. n – 1 (continued) Upper CI on CP χ1 – (1 – α) 2 w ---------------------------------------n–1 ------------------------.d n – 2 -. n – 1 - CPK (PPK for AIAG) CIs for CPK process capability index.⎛ n – 1⎞ – --------.----------d .⎛ n – 1⎞ 36 ⎝ n – 3⎠ 9 n ⎝ n – 3⎠ 2 –c 2 -. Then the expected value is n–2 Γ ⎛ -----------⎞ 2 1 n – 1 ⎝ 2 ⎠⎛ 2 -.42 2 The Distribution Platform Capability Analysis Table 2.⎠ ⎝ 2 c = -----------------------------------------------σ denote the noncentrality parameter and d = USL – LSL ------------------------σ represent the specification limit in σ units.----------. the variance is d . CPU) n ⎛ µ – USL + LSL⎞ ------------------------.

Cpm CIs for Lower CI 2 χ1 – α CPM on CPM ------------. (1990). Z represents (according to the AIAG Statistical Process Control manual) the number of standard deviation units from the process average to a value of .000. Exact 100(1 – α)% lower and upper confidence limits for CPL are computed using a generalization of the method of Chou et al.2 The Distribution Platform Capability Analysis 43 2 Distribution: Single-Variable Statistics Table 2. who point out that the 100(1 – α) lower confidence limit for CPL (denoted by CPLLCL) satisfies the equation Pr { T n – 1 ( δ = 3 n )CPLLCL ≤ 3CPL n } = 1 – α where Tn–1(δ) has a non-central t-distribution with n – 1 degrees of freedom and noncentrality parameter δ.33 is considered to be the minimum acceptable. who point out that the 100(1 – α) lower confidence limit for CPU (denoted CPULCL) satisfies the equation Pr { T n – 1 ( δ = 3 n ) ( CPULCL ≥ 3CPL n ) } = 1 – α where Tn–1(δ) has a non-central t-distribution with n – 1 degrees of freedom and noncentrality parameter δ. For a normal distribution. Exact 100(1 – α)% lower and upper confidence limits for CPU are also computed using a generalization of the method of Chou et al..1 Capability Index Names and Computations (continued) CPM process min ( target – LSL. γ 2 CPM -------------------γ 2 Upper CI on CPM χ1 – (1 – α) 2 CPM -------------------------------γ ------------------------. USL – target ) --------------------------------------------------------------------------capability 2 2 3 s + ( mean – target ) index. At the bottom of the report.mean)/3s capability ratio of one-sided upper specification In Japan. γ - CPL CPU process (mean – LSL)/3s where s is the estimated standard deviation capability ratio case of one-sided lower specification process (USL . Z statistics are reported. this gives an expected number of nonconforming units of about 6 per 100. a capability index of 1. (1990).

P(USL)) where P(LSL) = Prob(X < LSL) = 1 .44 2 The Distribution Platform Capability Analysis interest such as an engineering specification. . described in the next section.Cum Prob(Z USL) Note: You can also do a non-normal capability analysis through Fit Distribution options. Z USL = (USL-Xbar)/sigma = 3 * CPU Z LSL = (Xbar-LSL)/sigma = 3 * CPL Z Bench = Inverse Cumulative Prob(1 . Z USL is the distance to the upper specification limit and Z LSL is the distance to the lower specification limit. a capability analysis is automatically generated by using the quantile values and target you specified. If you give a target value. When used in capability assessment.P(LSL) .Cum Prob(Z LSL) P(USL) = Prob(X > USL) = 1 . After you fit a distribution. you have the option to generate quantiles and a target value for the fitted distribution.

1 Fit Y By X Launch Dialog Notice the word “Contextual” in the title bar. and contingency table analysis—most of the two-variable analyses seen in an introductory statistics course. Details about this data are found in Chapter 2. Introduction After starting JMP Student Edition. 21. scatterplots.” p. Select Analyze > Fit Y By X from the menu bar. This brings up the Fit Y By X platform launch dialog as seen in Figure 3. the variables are examined in pairs. “The Distribution Platform” in the section “About the Data.3 The Fit Y by X Platform Any time two variables need to be compared. Open the data file Denim. the Fit Y by X platform is the choice to make.jmp. More . Launching the Platform In this introduction section. It is there because this dialog launches other platforms depending on the modeling types (continuous or categorical) of the variables in the analysis. This single platform produces one way ANOVA.1 Figure 3.

Is the starch content different for the two levels of Sand Blasted? This is a typical situation examined with a two-sample t-test. All three analyses are requested at the same time to illustrate some of JMP Student Edition’s interactive capabilities.46 3 The Fit Y by X Platform Introduction information on modeling types is found in “The Modeling Type of Variables. as in the next step. and Sand Blasted? are the X variables. Drag these highlighted variables to the box to the right of the X. In the middle plot.3 t-test Results Some things should be noticed about this report. a situation leading to one-way ANOVAs.” p. These analyses would be equally valid if performed separately. Initially. Response button. examine the plot on the far right. 29. Size of Load (lbs). JMP Student Edition produces a scatterplot of two continuous variables. dot plots of each level of a nominal variable are plotted side by side.2 Fit Y by X Results On the far left and far right. hold down the Shift key. Note that these dialog boxes respond to dragging as well as button clicks. Click OK. Select Starch Content (%) from the list of columns and click the Y. Computing a t-test As a simple example. .2. Figure 3. To select all three X variables. Factor button. To conduct the t-test. The t-test report appears in the outline beneath the plot labeled t test. then click Sand Blasted?. Figure 3. click on Method. with Starch Content (%) as the Y variable in all of them. Select t test from the drop-down menu in the plot’s title bar. relating starch content to whether the fabric was sand blasted or not. this example consists of three analyses. Three plots appear as in Figure 3. a situation leading to fitting lines and curves. Method.

” p. select Means/Anova/t test to remove the pooled t report. 71 in the “The Matched Pairs Platform” chapter. the before-and-after measurements of a patient taking an experimental medication). Select the Means/Anova/Pooled t command from the report’s drop-down menu. Here. examine the plot on the left side in the report in Figure 3.2. 50. use the Matched Pairs platform. of Starch Content (%) vs. with two different methods. In fact. Details on matched pairs are found in “The Matched Pairs Platform. the square of the t statistic (listed under t-Test) is equal to the value of the F statistic (listed in the ANOVA table as F Ratio). JMP Student Edition has tested the same hypothesis twice.3 The Fit Y by X Platform Introduction 47 3 Fit Y by X: Comparing T wo Variables • There is a statement on the second line of the t-test report that says “Assuming unequal variances”.” p. Notice that this is the same value as listed in the new Analysis of Variance report in the column labeled Prob>F. note that the p-value for the unpooled t-test is listed beside Prob>|t| in the original t test report. • This is a two sample t-test. If you want the pooled version (where the variances are assumed to be equal). Pooled t test Now. This test is also known as the unpooled t-test. From the drop-down menu in the Oneway Analysis title bar. not a matched pairs t-test. Denim washed with Alpha Amalyze appears to have a lower starch content than denim washed with . • The plot gets embellished with means diamonds. Method. All of these are discussed later in this tutorial in “Analysis of Variance (anova). In essence. and both methods agree (as they always should!). If the data from the two groups have a natural pairing (for example. select the Means/Anova/Pooled t command. and other text tables.

as shown in Figure 3. From the same drop-down menu. The “captured” points become selected. and box plots are superimposed on the plot.” p. Select the Lasso tool from the Tools toolbar. giving clues to the underlying distribution of each level. but also for selecting results in other plots. .” p. it is helpful to look at text reports of these results. From the drop-down menu in the Oneway Analysis title bar.4 Lasso Tool Lasso Tool The Lasso tool is used to draw curves around points. or the online help. Details on these additions are found in “One Way anova—The Continuous by Categorical Case. and quantiles for the three levels of the Method variable. These additions can also be removed. Text reports appear below the plot. In the same drop-down menu.48 3 The Fit Y by X Platform Introduction Caustic Soda or Pumice Stone. see the section titled “Quantile Box Plots. select Quantiles and Means and Std Dev again. mean error bars and standard deviation lines appear on the plot. Figure 3. examining the mean. Box plots are superimposed on the plot. select Means and Std Dev. which are produced as follows. 32.4. select Quantiles. For details on box plots. Selecting and Marking Points This plot is not only useful for computing results. For more specificity. standard deviation. In addition to the new text reports. median. 61.

select Color or Mark by Column. Make sure that the Set Color by Value. With the points selected. To mark all the data at once. In fact. The Alpha Amalyze points change to the triangle in all the plots.3 The Fit Y by X Platform Introduction 49 3 Fit Y by X: Comparing T wo Variables While holding down the mouse button. Unique colors and markers are assigned to each level of the Method variable in all the plots. right-click inside the plot. To make these points distinctive. right-click inside the plot and select Row Markers.5 Selecting Points with the Lasso Tool JMP Student Edition briefly flashes how many points are contained in the selection region (32 in this case) and selects the points. select Row Colors from the popup menu. Figure 3. . For example. as shown in Figure 3. and choose a red color from the color palette. there is an easier way to change the colors and markers of points in a plot if there is a certain column that divides up the data. Set Marker by Value. Notice that these points are highlighted in all the plots. drag the Lasso tool completely around the points for Alpha Amalyze. The Alpha Amalyze points turn red in all the plots. select Method from the list of variables. suppose that plots are needed that clearly distinguish the three levels of the Method variable. and Make Window with Legend checkboxes are checked. From the Rows menu. and in the data table. assign them a unique color and marker.5. like scatterplots and leverage plots. The Lasso tool works with all plots that show individual points. Select the small triangle from the markers palette. Release the mouse button. In the resulting dialog box. Again.

6. Select Compare Means > Each Pair. at least one level of the Method variable has a significantly higher or lower starch content than the others. Click on the bottom comparison circle. For groups with equal sample sizes. as shown by the schematic in Figure 3. Overlap marks are drawn above and below the group mean. and are clickable. . In other words. Student’s t Complete details of comparison circles are on page 63. Comparison Circles To show comparison circles. JMP Student Edition uses comparison circles to explore this.6 Means Diamonds Illustrated Group Mean Overlap Marks 95% CI Examining the ANOVA table shows that Method is a highly significant predictor of starch content. When a circle is clicked. so is not available on the menu. The vertical span of each diamond represents the 95% confidence interval for each group. and is produced in the same way as the t-test above. Put simply. it turns red. and means diamonds appear on the plot. levels that are not significantly different from it turn red. Select Means/Anova from the drop-down menu next to Oneway Analysis. A means diamond illustrates a sample mean and its 95% confidence interval. they show differences among levels of a variable. An ANOVA table appears beneath the plots. Note that his command reads Means/Anova/t test when the categorical variable only has two levels. overlapping overlap marks indicate that the two group means are not significantly different at the 95% confidence level. the t test is not appropriate. corresponding to Alpha Amalyze. The horizontal line across each diamond represents the group mean. Figure 3. In all other cases (like this one). The obvious question is which levels are different from each other. and levels that are significantly different from it turn gray. To see this.50 3 The Fit Y by X Platform Introduction Analysis of Variance (ANOVA) Is knowledge of the wash method useful in predicting starch content of the denim? The statistical test to answer this question is called a one-way ANOVA.

For example.3 The Fit Y by X Platform Introduction 51 3 Fit Y by X: Comparing T wo Variables Figure 3. it can be changed.8 Fit Mean Results Platform Menu Fit Menu . and Fit Mean appears in a legend below the plot. Fitting Lines The middle plot of the report in Figure 3. Notice that the comparison circles change diameter when the α-level changes. These comparison circles are based on the confidence interval around the mean. the α-level is 5%. a situation that allows fitting of lines and curves through least-squares regression. By default.8. which is itself based on the α level.2 is of two continuous variables. as shown in Figure 3. Notice that Fit Mean below the plot has its own drop-down menu. A line representing the mean appears on the plot.7. To see this mean. suppose you want to predict starch content based on the size of the wash load. A good guess may simply be the mean starch content from all the data points.10. This shows that Alpha Amalyze is significantly different from the other two wash methods. select Set α Level > . From the popup menu in the Oneway title bar.7 Starch Content Comparison Circles The display changes to the one shown in Figure 3. However. Figure 3. Click on the other two circles to discover their relationships. Select Fit Mean from the drop-down list in the title bar of the plot.

To fit a regression line to this data. In this case.52 3 The Fit Y by X Platform Introduction A more interesting statistical question is whether a line or a curve is a better predictor of starch content than this simple mean. It is statistically sound to use the fitted line in predictions. the dotted confidence interval around the linear fit does not contain the mean. JMP Student Edition can draw confidence intervals for the fit around the fitted line. are found in the Linear Fit report. significant by almost any standard. One is the p-value associated with the slope of the line. As seen in Figure 3. A line is superimposed on the graph. Therefore.9 Fit Line Results There is also an option to produce shaded confidence curves. This line should be compared with the simple mean to see if it is helpful in prediction. using the Confid Shaded Fit command. To do this comparison. as well as several computed statistics.0045.9. there are numerical measures of the significance. Values of the slope and intercept are also printed in the Parameter Estimates section of the report. the p-value is 0. If these confidence intervals do not contain the horizontal mean. reinforcing the graphical results above. then the fitted line is helpful. Select Fit Line from the platform menu in the plot’s title bar. the linear fit is statistically significant. also found in the Parameter Estimates report. Select Confid Curves Fit from the Linear Fit menu in the legend below the plot. Aside from the graphical confidence-curve method detailed above. . The equation of the line. Figure 3.

Figure 3. Its value does not appear on any of the reports so far. Residuals One of the best diagnostic tools for a linear fit is its residuals. A new report named Correlation appears at the bottom of the text reports. frequently denoted by r.to reveal this report The correlation coefficient is listed under the word Correlation. there are different residuals for each fit. select Remove Fit from the fit popup menu. remove the fits that are there now.10).11 Residual Plot The plot is getting cluttered.. In other . although the square of its value (r2) is listed beside RSquare in the Summary of Fit text report. The red horizontal line on the plot shows the mean of the residuals (which should. but can be opened by clicking on the blue disclosure icon (Figure 3.. To see residuals for the linear fit. so residuals commands are found in the fit menus in the legends below the plots. the first step in interpretation of residuals is to see them plotted. To compute the value of r itself. which is not duplicated here. request a density ellipse. so before continuing.3 The Fit Y by X Platform Introduction 53 3 Fit Y by X: Comparing T wo Variables Correlation Coefficient Another measure of fit is the correlation coefficient. Another interesting question is whether a single line (like this model) is enough to describe starch content for all wash methods.0045) is the same as that listed for the slope coefficient in the Parameter Estimates table.. be near zero).. and the same as the Prob>F value in the Analysis of Variance table. . ideally.10 The Disclosure Icon Click here. Of course.95 from the platform menu on the plot’s title bar. All (good) introductory statistics textbooks discuss interpretation of residuals. or if a different line is needed for each level of the Method variable. Select Density Ellipse > . A plot of the residuals appears at the bottom of the report. It is initially closed. It is interesting to note that its significance (p=0. Disclosure Icon Figure 3. Select Plot Residuals from the fit popup menu in the legend below the plot. However. For each of the fits below the plot.

both X and Y are categorical variables. select Method as the grouping variable. 66. is starch content related to load size in the same way when washed in Alpha Amalyze as when washed in Caustic Soda or Pumice Stone? Although this question is more in the realm of the Fit Model platform (detailed in “The Fit Model Platform” chapter). Note that the mosaic plot is clickable. like all plots in JMP Student Edition. Response role. Assign Method to the X. Figure 3. Two-Way Contingency Tables In the next example. For example. Separate lines for each wash method appear on the plot. From the platform menu on the plot’s title bar. Click in the lower. Assign Thread Wear (not Thread Wear Measured) to the Y.54 3 The Fit Y by X Platform Introduction words. First. Now. Factor role. To generate a contingency table for this problem. red section in the mosaic plot in the bar above Alpha Amalyze (see Figure 3. Click OK. Since both variables are categorical. instead of the more compact (but possibly less accurate) reporting of the single line found previously.” p. Details of these displays are in the section “Contingency Analysis—The Categorical by Categorical Case. as shown in Figure 3.12. some initial investigation is easy with this platform. In the resulting dialog box. a mosaic plot appears. The analysis uses contingency tables—orderly ways of arranging count data.12 Group By and Linear Fits The statistical question is whether these lines are different enough to warrant the extra trouble in reporting all three. Select Analyze > Fit Y by X from the menu bar.13) . instruct JMP Student Edition to group its calculations for each level of the Method variable. The question is whether the method of washing denim has an effect on thread count. Select Fit Line from the Platform popup menu on the title bar above the plot. request JMP Student Edition to fit a line as before. to select all rows washed in Alpha Amalyze with a low thread wear. select Group By. followed by a contingency table.

. or severe for each load size. The p-value for this test appears in the column labeled Prob>ChiSq. Click OK. This has the effect of estimating a single logistic curve. The report that appears shows the probability that the thread wear is low. There is a distinction between nominal and ordinal responses on this platform: • Nominal logistic regression estimates a set of curves to partition the attributed probability among the responses. there is not enough evidence to say that the thread count of denim is affected by wash method. As an example. moderate.3 The Fit Y by X Platform Introduction 55 3 Fit Y by X: Comparing T wo Variables Figure 3. The fitted model estimates probabilities attributed to each x value. Select Analyze > Fit Y By X Assign Thread Wear as Y. Factor.13 Mosaic Plot Click in this area to select Alpha Amalyze-washed jeans with low thread wear Just below the contingency plot are tests for the independence of the two variables. This model is less general but more parsimonious. There is not enough evidence to say that these two variables are not independent—in other words.76. Response and Size of Load (%) as X. and is recommended for ordered responses. • Ordinal logistic regression models the probability of being less than or equal to a given response. The logistic platform is the nominal/ordinal by continuous personality of the Fit Y by X command. which is shifted horizontally to produce probabilities for the ordered categories. which in this case is the non-significant 0. Logistic Regression The Logistic platform fits the probabilities for response categories to a continuous x predictor.

In the Denim. They are especially useful when transforming data. The Formula Editor window operates like a pocket calculator with buttons. displays. Select Formula from the menu that appears. It is linked to (and dependent on) all other columns that are part of its formula. Right-click in the heading of the column Thread Wear. There are two methods to create a new column. A new column is needed to store the new values in. . the ordinal variable Thread Wear is computed with a formula that partitions the values of Thread Wear Measured into low. Example The essential features of the formula editor are best seen through an example. Its values are automatically recomputed whenever the values in these columns are edited. Formulas serve a wide variety of purposes. The Formula Editor A powerful (and often under used) feature of JMP Student Edition is its formula editor. and severe categories. Click Cancel to return to the data table. Suppose that it is necessary to calculate the logarithm of the values in the Starch Content (%) column (This calculation is common in real-world statistics). and an extensive list of easy-to-use features. A column whose values are computed by a formula is both linked and locked.0657 hints at a weak association between these two variables. moderate. To see the formula. from assigning simple values to computing complex calculations with parameters and conditional clauses.jmp sample data table. It is locked so that its data values cannot be edited individually.56 3 The Fit Y by X Platform Introduction The p-value of 0.

Select Formula from the menu that appears. . double-click in the area to the right of the last column in the data table. then select Log from the resulting menu. are found in JMP Student Edition’s online help. Alternatively. select Cols > New Column. add a formula to the column. Now. Click OK to close the Formula Editor and apply the formula. Change the title of the new column to Log Starch. When a new column is created. the default title is highlighted and ready to be changed. When the formula editor appears. Common (base 10) logs are computed using the Log10 function. Click on Transcendental from the Functions list. Select Starch Content (%) from the columns list. Further examples. Right-click in the heading of the column Log Starch. Note that Log is the natural logarithm (base e).3 The Fit Y by X Platform Introduction 57 3 Fit Y by X: Comparing T wo Variables From the main menu bar. as well as complete documentation of all the formula editor functions.

As with all the fitting commands in this platform. where additional commands for each fit are accessed.15 Fit Commands The Fit Line command adds a straight-line fit to the plot using least squares regression. JMP Student Edition produces a bivariate analysis that initially shows a scatterplot. all accessed through the popup menu beside the variable name in the title bar (Figure 3.14) Figure 3. and the sum of squared errors around the mean Figure 3.15) has commands to save predicted values and residuals for the linear fit as new columns in the current data table. its standard deviation. It is often useful to first fit the mean as a reference line for other fits.58 3 The Fit Y by X Platform Scatterplots—The Continuous by Continuous Case Scatterplots—The Continuous by Continuous Case If both the X and Y variables are continuous. . The Fit Mean command adds a horizontal line to the plot at the mean of the response variable (Y). There are a number of options once the scatterplot appears. Its drop-down menu (accessed as in Figure 3.14 Bivariate Popup Menu Show Points alternately hides or displays the points in the plot.The Fit Mean Fit table shows the value of the mean. its standard error. a legend appears below the plot with its own drop-down menu.

3 The Fit Y by X Platform Scatterplots—The Continuous by Continuous Case 59 3 Fit Y by X: Comparing T wo Variables If the confidence area around this line (produced through the Confid Curves Fit command) includes the horizontal line at the response mean. The Fit Polynomial command fits a polynomial curve of the degree selected from the Fit Polynomial submenu.05 significance level. three additional tables are appended to the report (see Figure 3. options can save predicted values and residuals as new columns in the current data table for each polynomial fit.16 Linear Fit Text Reports The Summary of Fit Table The summary of fit table shows the following information: • R2 (RSquare). . The Fit Polynomial option can be selected multiple times with different polynomial degrees for comparison. A Lack of Fit table also appears if there are replicates. 95% confidence limits are plotted with the Confid Curves Fit and Confid Curves Indiv display options.” p. After selecting the polynomial degree. 90 or the online help. see “The Lack of Fit Table. used to compare models with different numbers of variables • The Root Mean Square Error. Each time a linear or polynomial fit is chosen. For details of the lack of fit test. an estimate of the standard deviation of the random error • The mean of the response variable • The number of observations (or. As with the linear fit. Figure 3. the curve is fit to the data points using least squares regression. if weighted variables are involved.16). a measure of how well the line fits • The adjusted R2 (RSquare Adj). then the slope of the line of fit is not significantly different from zero at the 0. the sum of the weights).

• Prob > F is the observed significance probability (p-value) of obtaining a greater F value by chance alone if the specified model fits no better than the overall response mean.60 3 The Fit Y by X Platform Scatterplots—The Continuous by Continuous Case The Lack of Fit Table The Lack of Fit table shows a special diagnostic test that appears only when the data and the model provide the opportunity. They are used in constructing tests and confidence intervals. see “The Lack of Fit Table. A significant F statistic in this table indicates that a different model should be examined. However.05 is interpreted as evidence that the parameter is significantly different from zero. These estimates are the coefficients in the linear model. there may be some differences in the terminology for the ANOVA table’s parts. Often. a value below 0. Parameter Estimates Table The terms in the Parameter Estimates table for a linear fit (seen previously in Figure 3. given a true hypothesis. Analysis of Variance Table This table look similar to the ANOVA tables in most textbooks. It is a test to see if a different form of the model would fit the data better. Error. 90 or the online help. The “C” in C Total stands for corrected. as in corrected for the mean. Looking for a t ratio greater than 2 in absolute value is a common rule of thumb for judging significance. • Prob>|t| lists the observed significance probability calculated from each t ratio. • Mean Square is a sum of squares divided by its associated degrees of freedom.16) are the intercept and the single X variable. and C Total. • F Ratio is the model mean square divided by the error mean square. called Model. It is the probability of getting. It is the ratio of the parameter estimate to its standard error. If a parameter is a significant model effect. • t Ratio lists the test statistics for the hypothesis that each parameter is zero. . The underlying hypothesis of the fit is that all the regression parameters (except the intercept) are zero. For details of the lack of fit test. The intercept is a constant term in all models. • DF records the associated degrees of freedom (DF) for each source of variation • Sum of Squares records an associated sum of squares (SS for short) for each source of variation.05 significance level. • Estimate lists the parameter estimates of the linear model. JMP Student Edition uses the following: • Source lists the three sources of variation. • Std Error lists the estimates of the standard errors of the parameter estimates. The Parameter Estimates table displays the following: • Term lists the name of each parameter in the requested model. a t ratio greater (in absolute value) than the computed value.” p. the F Ratio is usually higher than expected by chance alone. because it approximates the 0. by chance alone.

The Group By command is checked in the fitting menu when a grouping variable is in effect. Compare it to other fitted lines to see the concept of lack of fit. reciprocal. When a grouping variable is selected. square root. The density ellipsoid is a good graphical indicator of the correlation between two variables. select the Group By command again and respond to its dialog as before. The Group By command in the fitting menu displays a dialog. allowing selection of a classification (grouping) variable. but in the continuous by continuous bivariate platform. Transformations include log. The Fit Each Value command fits a value to each unique X value. The Density Ellipse command draws an ellipse that contains the specified mass of points. The ellipsoid collapses diagonally as the correlation between the two variables approaches either 1 or –1. The Other selection allows the specification of any probability greater than zero and less than or equal to one. determined by the probability chosen from the Density Ellipse submenu. JMP Student Edition produces a one way ANOVA.17). the Fit Y by X platform computes a separate analysis for each level of the grouping variable. Select the Group By command to remove (uncheck) the existing variable. initially displaying a plot that shows a vertical distribution of Y points for each X value. but an extra report shows measures of fit transformed in the original Y scale (if there was a Y transformation). all accessed through the popup menu beside the variable name in the title bar (Figure 3. so it appears as a curve on the plot. and overlays the regression curves or ellipses on the scatterplot. To change a grouping variable that is already in effect. square. The Density Ellipse table that accompanies each Density Ellipse fit shows the correlation coefficient (r) for the X and Y variables and the probability that the correlation between the variables is significant. with individual popup menus to save or remove fitting information. The fit for each level of the grouping variable is identified beneath the scatterplot. . The ellipsoid is more circular (less diagonally oriented) if the two variables are uncorrelated. Then. The regression report is shown with the transformed variables. and exponential.3 The Fit Y by X Platform One Way anova—The Continuous by Categorical Case 61 3 Fit Y by X: Comparing T wo Variables Other Fitting Commands The Fit Special command displays a dialog with choices for transformations of both the Y and X variables. Fitting each value is like doing a one-way analysis of variance. There are a number of options once this scatterplot appears. The fitted line is plotted on the original scale. One Way ANOVA—The Continuous by Categorical Case If the X variable is categorical and the Y variable is continuous.

and standard errors computed with the pooled estimate of the error variance.” p. Error Bars. The Means and Std Dev command fits means for each group.62 3 The Fit Y by X Platform One Way anova—The Continuous by Categorical Case Figure 3. See “Analysis of Variance (anova). but uses standard deviations computed within each group rather than the pooled estimate of the standard deviation used to calculate the standard errors of the means. Hsu’s MCB displays a table that shows Hsu’s MCB (Multiple Comparison with the Best) comparisons of group means to the best (maximum or minimum) group mean. means. 50 for a detailed description of means diamonds. The Means/Anova/t test command fits means for each group and performs a one-way analysis of variance to test if there are differences among the means. Tukey HSD displays a table that shows the Tukey-Kramer HSD (honestly significant difference) comparisons of group means. 75%. 90%. and 100% (maximum) quantiles for each group. Dunnett’s displays a table showing Dunnett’s comparisons of group means with a control group. and a table that lists group frequencies. • With Best. • With Control. which lists the 0% (minimum). 50% (median).17 One Way ANOVA Popup Menu The Quantiles command displays the Quantiles table. 10%. Student’s t displays a table with Student’s t statistics for all combinations of group means. . Three tables are produced: a summary table. and Std Dev Lines display options. a one-way analysis of variance table. This option automatically activates the Means Diamonds display option. This command also displays Means Dots. 25%. It also activates Box Plots from the Display Options menu. a t-test also shows. • Each Pair. • All Pairs. All activate the Comparison Circles display option. If there are only two groups. Compare Means has a submenu that provides the following four multiple comparison methods for comparing sets of group means.

Circles for means that are significantly different either do not intersect or barely intersect.18 Alignment of Comparison Circles Compare each pair of group means visually by examining how the comparison circles intersect. Circles representing means that are not significantly different from the highlighted circle show with thin lines (see Figure 3. it is easy to verify whether the means are significantly different by clicking on the comparison circle. Figure 3.20). The highlighted circle appears with a thick solid line. The outside angle of intersection tells whether group means are significantly different (see Figure 3. so that the outside angle of intersection is less than 90°. The illustration in Figure 3.3 The Fit Y by X Platform One Way anova—The Continuous by Categorical Case 63 3 Fit Y by X: Comparing T wo Variables Each multiple comparison test begins with a comparison circles plot. click in the graph outside the circles. thus highlighting it.19 Angles in Comparison Circles angle greater than 90 degrees angle equal to 90 degrees angle less than 90 degrees not significantly different borderline significantly different significantly different . If the intersection angle is close to 90°. If the circles intersect by an angle of more than 90°. The plot follows with a table of means comparisons. Circles representing means that are significantly different show with a thick gray pattern.18 shows the alignment of comparison circles with the confidence intervals of their respective group means. Figure 3. the means are not significantly different.19). or if they are nested. a visual representation of group mean comparisons. To deselect circles.

or the specification of any level with the Other selection. the usual analysis of variance assumptions are not satisfied. However. • Wilcoxon rank scores are the simple ranks of the data. Nonparametric tests are useful to test whether group means or medians are located the same across groups. and van der Warden tests. There is a valid variant of the standard ANOVA. and modifies the upper and lower confidence level values in reports. Changing the alpha level recalculates any confidence lim- its. Normal Quantile Plot shows overlaid normal quantile plots for each level of the X variable. When the variances across groups are not equal. • Median rank scores are either 1 or 0 depending on whether a rank is above or below the median rank. Set Alpha Level has a submenu that allows a choice from the most common alpha levels. which is displayed. It uses (and reports) four different tests: O’Brien’s test. The Wilcoxon test is the most powerful rank test for errors with logistic distributions. the usual analysis of variance assumption of normality is not made. Along with the standard normality-assessing capabilities of the single-variable Normal Quantile Plot. adjusts the means diamonds on the plot if they are showing. Nonparametric tests use functions of the response variable ranks. so the standard ANOVA F test is not valid. and Bartlett’s test. . The Median test is the most powerful rank test for errors with doubly exponential distributions. called rank scores.20 Comparison Circles after Clicking The Nonparametric submenu allows computation of three nonparametric tests: the Wilcoxon. • Van der Waerden rank scores are the ranks of the data divided by one plus the number of observations transformed to a normal score by applying the inverse of the normal distribution function. the Brown-Forsythe test.64 3 The Fit Y by X Platform One Way anova—The Continuous by Categorical Case Figure 3. called the Welch ANOVA. The UnEqual Variances command tests for equality of group variances. Median. this plot shows both the differences in the means (vertical position) and the variances (slopes) for each level of the categorical X factor (Figure 3.21). Levene’s test. The Van der Waerden test is the most powerful rank test for errors with normal distributions.

CDF plots the cumulative distribution function for all the groups in the Oneway report.3 The Fit Y by X Platform One Way anova—The Continuous by Categorical Case 65 3 Fit Y by X: Comparing T wo Variables Figure 3. This is the centered response divided by the standard deviation within each level.21 Normal Quantile Plot Slopes show standard deviations Separations show differences in means Normal Quantile Plot has these additional options: • Plot Actual by Quantile generates a quantile plot with the response variable on the y-axis and quantiles on the x-axis. The plot shows quantiles computed within each level of the categorical X factor. • Grand Mean draws the overall mean of the Y variable on the scatterplot. • Mean CI Lines draws lines at the upper and lower 95% confidence levels for each group. Complete details of means diamonds is found in “Analysis of Variance (anova). • All Graphs shows or hides all graphs. • Save Normal Quantiles saves normal quantile values computed within each level of the categorical factor variable. • Std Dev Lines shows dotted lines one standard deviation above and below the mean of each group. Display Options allows addition or removal of plot elements.and y-axes. • Probability Labels shows probabilities on the right axis of the Quantile by Actual plot and on the top axis of the Actual by Quantile plot. • Means Diamonds draws Means Diamonds. • Box Plots shows outlier box plots for each group. 50. Save has a submenu of commands to save the following quantities as new columns in the current data table: • Save Centered saves values computed as the response variable minus the mean of the response variable within each level of the factor variable. as shown in Figure 3. • Mean Lines draws a line at the mean of each group. . • Plot Quantile by Actual reverses the x. • Save Standardized saves standardized values of the response variable computed within each level of the factor variable. • Points shows data points on the scatterplot.” p. • Line of Fit draws the straight diagonal reference lines for each level of the X variable. • Mean Error Bars identifies the mean of each group with a large marker and shows error bars one standard error above and below the mean.21.

• Redo Analysis repeats the analysis represented in the report. • Save Script to Script Window produces a script that can re-create the report in a text window. and a table of chi-square tests. • Points Jittered adds random horizontal jitter so that points that overlay on the same Y value can be seen. The resulting script generates all reports in the window. contingency table (sometimes referred to as a crosstabs table). • Mean of Means • X-Axis Proportional makes spacing on the x-axis proportional to the sample size of each level.22 Contingency Analysis . • Connect Means connects the group means with a straight line. Script Submenu The Script submenu contains commands related to saving a script to redo an analysis. • Points Spread spreads points over the width of the interval. • Save Script for All Objects is useful when several analyses — like those from a By group. • Data Table Window brings the data table to the front of the display. JMP Student Edition produces a contingency analysis that initially shows a mosaic plot.66 3 The Fit Y by X Platform Contingency Analysis—The Categorical by Categorical Case • Comparison Circles show comparison circles computed for the multiple comparison method selected in the platform menu. • Save Script to Data Table generates a script that can redraw the report. Figure 3. or from several variables in a single Distribution report— are in the same window. This script can then be edited or saved to an external file. • Save Script to Report appends a script to the top of the report. and attaches it to the data table. Contingency Analysis—The Categorical by Categorical Case If both the X and Y variables are categorical.

and Tests all operate as toggles. margin total frequencies. the probability scale in the y direction is divided up (partitioned) into probabilities for . Figure 3. The cumulative logistic probability plot gives a complete picture of what the logistic model is fitting. divided by the grand total • Deviation is the observed (actual) cell frequency minus the expected cell frequency • Cell ChiSq is the Chi Square values computed for each cell as ( Observed – Expected ) -------------------------------------------------------Expected 2 Logistic Regression—The Categorical by Continuous Case If the Y variable is categorical and the X variable is continuous. Script. Figure 3.” p. 66.24 Contingency Table Popup Menu • Count is the cell frequency. Display Options > Horizontal Mosaic rotates the mosaic plot 90 degrees. is explained in the section “Script Submenu. and grand total (total sample size). Contingency Table. At each x value. The final item. JMP Student Edition produces a logistic analysis that initially shows a logistic plot and text reports.3 The Fit Y by X Platform Logistic Regression—The Categorical by Continuous Case 67 3 Fit Y by X: Comparing T wo Variables The popup menu for contingency analyses contains items to turn parts of the report on and off. • Row % is the percentage of each cell count to its row total • Col % is the percentage of each cell count to its column total • Expected is the expected frequency of each cell under the assumption of independence. Mosaic Plot. It is computed as the product of the corresponding row total and column total. • Total % is the percentage of cell counts and margin totals to the grand total.23 Contingency Popup Menu The contingency table itself has a popup menu to turn its cell contents on and off.

The fit tries to avoid points attributed to have a small probability. The negative sum of logs of the observed probabilities is called the negative log-likelihood (–LogLikelihood).” p. which are points crowded by the curves of fit. with the total across all Y category probabilities sum to 1. and Script options. There are only a couple of options in the Logistic drop-down menu (see Figure 3. The probabilities are measured as the vertical distance between the curves. It is a specific likelihood-ratio Chi-square test that evaluates how well the categorical model fits the data. and allow the curves to get close together where there is no data. This table is analogous to the Analysis of Variance table for a continuous response model. Figure 3. (See “Script Submenu.26)—turning the plot on and off through the Logistic Plot command. 66 for details on scripting options. The . The data pushes the curves because the criterion that is maximized is the product of the probabilities fitted by the model.) The Whole Model Test Table The Whole Model Test table shows if the model fits better than constant response probabilities.68 3 The Fit Y by X Platform Logistic Regression—The Categorical by Continuous Case each response category. with the y position jittered randomly within the range corresponding to the response category for that row. You can see that the points tend to push the lines apart and make vertical space where they occur in numbers.25 Interpreting the Logistic Plot P(thread wear is severe) at load size 250 P(thread wear is moderate) at load size 250 P(thread wear is low) at load size 250 These three probabilities sum to one Markers for the data are drawn at their x-coordinate.

26 Logistic Plot Popup Menu . The difference in the log-likelihood from the model fitted by the data and the model with equal probabilities is a Chi-square statistic. They turn various plot elements on and off.26. In Figure 3. and allow adjustment of the line color. Platform Options The menu of platform options is shown in Figure 3. Figure 3. the standard Script menu appears. This test statistic examines the hypothesis that the x variable has no effect on the response.3657.3 The Fit Y by X Platform Logistic Regression—The Categorical by Continuous Case 69 3 Fit Y by X: Comparing T wo Variables negative log-likelihood for categorical data plays the same role as sums of squares in continuous data. In addition. which is not statistically significant.25. the p-value of the Chi-square statistic is 0.

.

and call for the Matched Pairs platform. so they form a paired situation. Data in this form are handled by the Matched Pairs platform. 21 in the “The Distribution Platform” chapter. However. the column needs to be split into two starch columns. To use the Matched Pairs platform. select Sand Blasted? from the list of columns and click Split By. Preparing the Data This example examines the starch content of blue jeans. A classic example is a before-and-after study of the effect of a medication. having split Starch Content (%) into two columns based on the value in the Sand Blasted? column. . the paired data must be in two columns. Select Tables > Split. all the starch data is in the single column Starch Content (%). the split command is set to make a new data table. In fact. The examination is of jeans that come from the same lot. Select Lot Number and Method and click Group. no variables other than the ones already in the dialog need to be retained. To drop the unnecessary variables. with one group having been sand blasted. type “Paired Denim” to name the new data table. Open the data file Denim. in the Denim file. Select the Drop All radio button at the bottom of the Split dialog. so they do not need to be included in this new table. Therefore. At this point.jmp. In the Output table name box.” p. Select Starch Content(%) from the list of columns and click Split Columns. In the dialog that results. and the other not. In the forthcoming example analysis. Introduction After starting JMP Student Edition. based on whether the denim was sand blasted or not. Details about this data are found in “About the Data.4 The Matched Pairs Platform Some two-variable data have a natural pairing to them. The dialog should appear like the one in Figure 4. all of the original variables are not used.1. To split the data in this way.

2.72 4 The Matched Pairs Platform Introduction Figure 4.2 Paired Denim Data Launching the Platform Select Analyze > Matched Pairs from the menu bar.1 Split Columns Dialog Click OK to create the data table. . Figure 4. This brings up the Matched Pairs platform launch dialog as shown in Figure 4. The data table appears as in Figure 4. with new columns no and yes containing starch information.4.

4 Matched Pairs Launch Dialog Optionally. the difference is significant The text reports below this plot show the same result—there is a difference in starch content of denim based on sand blasting.3 Paired Denim Matched Pairs Report The red line is the difference given by the data The horizontal gray line represents zero The dotted lines are a 95% confidence interval on the difference. with a p-value of 0. If it doesn’t contain zero. The resulting report is easily interpretable.4 The Matched Pairs Platform Introduction 73 4 Paired Variables To launch the dialog. Figure 4. See the online help for an example using a grouping variable. select the two paired variables to be analyzed. The Matched Pairs Scatterplot After it is launched. The primary graph in the platform is a plot of the difference of the two responses on the y-axis. Grouping role to have JMP Student Edition estimate means for the groups. Paired Response button. Select the no and yes variables from the columns list and click the Y. Figure 4. the Matched Pairs platform displays a scatterplot and numerical results. . These two variables are the values that are paired.002. and the mean of the two responses on the x-axis. The Matched Pairs Launch Dialog The Matched Pairs platform launch dialog (Figure 4. as shown in Figure 4.3. and test both between and among the pairs.4) requires at least two variables to be entered. a grouping variable can be entered in the X. This graph is the same as a scatterplot of the two original variables.

then the means are not significantly different at the 0. If the confidence region includes the horizontal line at zero.6. Figure 4.6. shown in Figure 4. with the 95% confidence interval above and below. • The mean of the mean of pairs is shown by the vertical line. The square reference frame can also be toggled on and off. and standard scripting items are available. or plotting the difference by the row number.7 The Matched Pairs Men .5 Comparison of Scatterplot and Matched Pairs Plot Notice the following in Figure 4. Figure 4.” p. See the online help for more details of this transformation. this plot shows the difference between the two variables.05 level. as in Figure 4. the difference is significant.5). By rescaling. and the mean of the two variables. • The mean difference is shown as the horizontal line.74 4 The Matched Pairs Platform Introduction but turned 45° clockwise (see Figure 4. allows two plot options — plotting the difference by the mean. See “Script Submenu.7.6 The Matched Pairs Scatterplot 95% Confidence Interval Mean Difference Line where the two variables are equal Mean of Means The Matched Pairs menu.6: • The 45° tilted square shows the frame of the scatterplot of the original columns. A 45° rotation turns the original coordinates into a difference and a sum. In the example shown in Figure 4. 66 for details on the Script submenu. Figure 4.

8 No Change The distribution vertically is small and centered at zero. rotated coordinates. This is the high-positive-correlation pattern that is the typical situation. Figure 4.4 The Matched Pairs Platform Interpreting the Matched Pairs Plot 75 4 Paired Variables Interpreting the Matched Pairs Plot There are many possibilities for making statements regarding the patterns to be discovered in the new. The change from Y1 to Y2 is not significant.9 Highly Significant Shift Down The Y2 score is consistently lower than Y1 across all subjects. The examples below show six different situations and their interpretations. . Figure 4.

the mean is the same from Y1 to Y2. the mean is the same from Y1 to Y2. Overall. But Reverse Relationship This example shows a high variance of the difference. but the high Y1s are associated with low Y2s. but individually. Figure 4. This is a high-negative-correlation pattern. the high scores got higher and the low scores got lower. and vice-versa. . and low variance of the mean of the two values within a subject. Overall.11 No Average Shift. But Amplified Relationship This situation shows a low variance of the difference.10 No average Shift.76 4 The Matched Pairs Platform Interpreting the Matched Pairs Plot Figure 4. and high variance of the mean of the two values within a subject. and is unusual.

and low scores increase a little. but Damped Instead of Accelerated Overall. . the mean is the same from Y1 to Y2.12 No Average Shift.4 The Matched Pairs Platform Interpreting the Matched Pairs Plot 77 4 Paired Variables Figure 4. Positive Correlation. but the high scores drop a little.

.

In the list of columns. are fit using this single platform. 21 in the “The Distribution Platform” chapter. Details about this data are found in “About the Data. click on Sand Blasted?.1. fit a simple model with only main effects—no interactions. Launching the Platform Select Analyze > Fit Model from the menu bar. click on Method. select Starch Content (%) and click the Y button. hold down the Shift key. and which are predictors of thread wear in denim. no powers. To select all the model effects together. To begin with. . Unlike the other platform launchers in JMP Student Edition. including stepwise procedures. Open the data file Denim. Click Window in the top menu bar to see a list of open windows. Click Run Model.jmp. Introduction After starting JMP Student Edition. models are developed to determine which variables (if any) are predictors of the starch content of denim.5 The Fit Model Platform General linear models—those that have more complicated forms than can be fit with simple linear regression—are fit with the Fit Model platform. then click the Add button.” p. Standard least squares fitting. This brings up the Fit Model dialog. which is illustrated in Figure 5. In this introduction. the Fit Model dialog does not go away once the model is run. To see that it is still available.

Make sure the report window is the front window.80 5 The Fit Model Platform Introduction Figure 5. Title bars within the reports are also editable. Again type “Main Effects Only” and press Enter. Select Window > Set Title. change its title to something meaningful as follows. . To make identification easy.1 The Fit Model Dialog Setting Titles This report generated here is used later in this presentation. type “Main Effects Only”. so it will need to be identified among the other windows. (Window > Set Report Title on Macintosh) In the resulting dialog box. Double click on the title bar that says Response Starch Content (%).

printed in several places. Move the Scroller tool over the results report. There is a lot of information in this report. To examine the p values for the individual effects. Then. examine these results. Leverage plots are detailed in “Leverage Plots. . scroll the window until each effect is visible. Shift-clicking a tool keeps the tool active for multiple clicks. The first leverage plot is for the entire model. all of it is documented in sections that follow. Without Shift-clicking. Figure 5. JMP Student Edition reverts back to the arrow tool after other tools’ first use. 91. Shift-click the Scroller tool in the Tools toolbar.” p. In this initial look at the data.2 The Scroller Tool Scroller Tool To scroll using the Scroller tool. and its p value indicates that the model is significant. first check if the model as a whole is significant.5 The Fit Model Platform Introduction 81 5 Fit Model Examining Results Now. Scrolling is accomplished using scroll bars along the edges of the window. or by using the Scroller tool. and although only a portion is used in this example. including just below each leverage plot. look at the p values associated with each effect.

Try moving the mouse while releasing the mouse button. Remove the existing effects from the model and re-run the analysis with interactions by doing the following. several short click-and-drags.3 LS Means Table and Plot In this case. Double-click on each of the other variables. To see a plot of the LSMeans. so that the effect of each variable can be examined. this analysis is fairly primitive. called LSMeans by JMP Student Edition. in short movements. which is in turn higher than Alpha Amalyze.82 5 The Fit Model Platform Introduction Hold the mouse button and move the scroller tool to see the window move. Now. Click on the Window menu and select the Fit Model dialog. There are two ways to remove effects from the model. the LSMeans are displayed in this example model. .05 level. Figure 5. it should be clear that they are all significant at the 0. However. since it does not consider any interactions among the variables. By default. Select one of the variables and click the Remove button located above the effects list.) This illustrates the “inertia” that the scroller tool imparts on reports. Least Squares Means Least squares means. Re-running an Analysis After moving around the report a bit and observing the p-values for each effect. repeatedly. the starch content for Caustic Soda is higher than Pumice Stone. show the values of the response (starch content in this case) for levels of a nominal effect. Select LSMeans Plot from the drop-down menu on the title bar of an effect. add in an interaction effect. (That is. the plot suggests that when controlled for the other effects in the model. The response values are adjusted for the other terms in the model. We want to remove all the effects in this case. Select Method and Size of Load (lbs) in the list of columns and click the Cross button.

remembering that the Control (Windows) or (Macintosh) key allows for multiple selections. Select Method. For now. . Make sure Starch Content (%) is still in the Y role at the top of the launch dialog. Click the Macros button and select Full Factorial from the popup menu. request a full factorial model—all main effects with all possible interactions.5 The Fit Model Platform Introduction 83 5 Fit Model Since it is rather tedious to specify all main effects and all crossed effects one at a time. The appropriate effects appear in the effects list. These macros are completely discussed in “Macros. 86. Size of Load (lbs). JMP Student Edition provides some pre-defined macros to add popular effects combinations to models. and Sand Blasted? in the model effects list.” p.

at this point. where details of the linear contrast are specified.05 level. For example. This is exactly the reason that the Fit Model dialog persists. For example. Select LS Means Contrast from the drop-down menu in the title bar of the Method variable. Another report appears. remove them one at a time from the model. JMP Student Edition attaches a Contrast Dialog (Figure 5. Figure 5. Linear Contrasts Another common task is to test that levels within an effect are different from each other. to test that the Alpha Amalyze wash method is significantly different from the Pumice Stone wash method. examine the p values of each effect. If you examine the new model results. Click Run Model.4) to the report.84 5 The Fit Model Platform Introduction Click Run Model. Remove the Method*Size of Load*Sand Blasted effect. . then re-run the model to repeat the process until all effects are significant. This iterative procedure may be repeated several times. A prudent model maker would. Many models have to be tweaked after initial results are examined. Again bring the Fit Model dialog to the front using the Window menu.4 Contrast Dialog Click the + button once next to Alpha Amalyze. so it can be removed from the model as well. noting that neither of the levels of the Method*Size of Load*Sand Blasted effect are significant at the 0. you see that the Method*Size of Load effect levels are not significant. This is accomplished by using linear contrasts. this time much larger. even after clicking the Run Model button.

the Fit Model dialog box is the first step. an optional role that identifies a column whose values signify the importance of each row in the model . In JMP Student Edition. This test shows a highly significant p-value.5 The Fit Model Platform The Fit Model Dialog Box 85 5 Fit Model Click the .1 on page 80. This facilitates experimentation with the model. This is where the roles of each variable get specified and the type of fit is selected. The dialog box is completely illustrated in Figure 5. highlight it and click Remove. The Fit Model Dialog Box Regardless of the model to be fit. or ordinal logistic) is referred to as the fitting personality. nominal logistic. the type of fit (standard least squares. alternatively. giving confirmation that Alpha Amalyze is significantly different from Pumice Stone in affecting the resulting starch content of denim. If one of the variables is not significant. select the variable name and click the appropriate button. To remove a variable from its role. The Fit Model dialog is different from JMP Student Edition’s other launch dialogs in that it does not disappear after the model is launched. The roles a variable can take are: • Y. double-click on the variable’s name. Roles To assign a variable to a role. or. which identifies one or more response variables (the dependent variables) • Weight. Click Done. it can be removed and the model re-run quickly.button once next to Pumice Stone.

B. C.86 5 The Fit Model Platform The Fit Model Dialog Box • Freq. A*B. and A*B*C in the Model Effects list. and C. select the two variables to be crossed (use Control-click or for multiple selections) in the Select Columns list and click the Cross button. the Full Factorial selection places A. For example. • Select the outside effects in the column selection list and click Add or Cross. an optional role that identifies a column whose values designate the frequency of rows in the analysis Model Effects Effects are added to the model by using the buttons in the Construct Model Effects section of the dialog. select the variable name and click the Add button. -click When levels of an effect (call it B) only occur within a single level of an effect (call it A). Macros Common models can be generated using the macros drop-down list. To add a crossed effect to a model. B. and A is called the outside effect. B*C. use Full Factorial. Factorial to Degree To create a limited factorial. such as in a factorial design. To add a simple regressor to the model. Figure 5. . It creates the set of effects corresponding to all crossings of all variables selected in the columns list. with selected variables A. To add a nested effect. A second degree factorial is a very common analysis.5 Macros drop-down list The following models are available: Full Factorial To look at many crossed factors. select Factorial to Degree and enter the degree of interactions in the Degree box. A*C. • When the outside effect appears in the Model Effects list. select it again. then B is said to be nested within A. • Select the nested variable in the column selection list and click Nest.

and so forth. To specify a Response Surface effect. then select Response Surface Effect from the Attributes menu. The critical values for the surface are calculated from the parameter estimates and presented with a report on the shape of the surface. select the variable name. To specify a polynomial effect. Response Surface Response surface models find the values of the terms that produce a maximum or a minimum expected response.6 Fitting Personalities • Standard Least Squares models one or more continuous responses in the usual way through fitting a linear model by least squares. shown in Figure 5. All main effects are listed first. • Stepwise regression is an approach to selecting a subset of effects for a regression model. Figure 5. then all three-way interactions. but lists them in order of degree. Select Mixture Response Surface from the Attributes menu after selecting a variable name. Scheffe Cubic Cubic Models are an advanced topic not usually covered in an advanced course. Fitting Personalities The three available personalities for model fitting are available in the Personality drop-down list. followed by all two-way interactions. Polynomial to Degree Polynomial effects are a series of terms that are powers of a single variable.6. See the JMP help if you are interested in Scheffe cubics. • click one or more variables in the column selection list • enter the degree of the polynomial in the Degree box • select the Polynomial to Degree command in the Macros popup menu. Response surface effects appear with an ampersand (&) appended to their name. Mixture Response Surface Mixture response surface variables are selected in the same way as Response Surface Effect variables. The Stepwise feature computes estimates that are the same as those of the Standard Least Squares . This is accomplished by fitting a collection of terms in a quadratic model.5 The Fit Model Platform The Fit Model Dialog Box 87 5 Fit Model Factorial Sorted The Factorial Sorted selection creates the same set of effects as Full Factorial.

Multiple categorical responses call for MANOVA or other advanced methods. followed by a scaled parameter report and the Prediction Profiler. • Minimal Report suppresses all plots. Fit Model Report Items When a model is fit with the Standard Least Squares or Stepwise personality. Use the dialog to make changes to the model for additional fits. or make changes to the data and then refit the same model. The Stepwise personality allows only one continuous Y.88 5 The Fit Model Platform Fit Model Report Items personality. only available in the professional version of JMP. several reports appear based on the Emphasis selected in the Fit Model dialog. but it facilitates searching and selecting among many models. Select effect details and other statistical reports from the report itself. Run Model The Run Model button submits the model to the fitting platform. Request plots and reports from the report itself. . Any report that does not appear by default can be requested from the platform menu. but does not close the dialog window. Emphasis Choices The Emphasis popup menu controls which plots and tables are initially shown in the analysis report: • Effect Leverage initially displays leverage and residual plots for the whole model. • Effect Screening shows whole-model information.

It is the square root of the mean square for error in the corresponding analysis of variance table. The variance measured around this mean is the Corrected Total (C Total) mean square in the Analysis of Variance table. If there is a column assigned the role of weight. An R2 of 1 occurs when there is a perfect fit (the errors are all zero). It is important as a base model for prediction because all other models are compared to it. The table compares the model to a model containing only the mean: Source lists the three sources of variation. called Model. Weights are used in weighted least squares — an advanced topic. and it is commonly denoted as s. ters. The Summary of Fit Table The Summary of Fit table appears first and shows the following numeric summaries of the response for the multiple regression model: Rsquare (R2) estimates the proportion of the variation in the response around the mean that can be attributed to terms in the model. . and C Total. Since adding terms to an existing model always increases R2. Each is the sum of squares of the differences between the fitted response and the actual response. this adjustment compensates for adding terms to a model that already has terms in it. Observations (or Sum of Weights) records the number of observations used in the fit. this is the same as the number of rows in the data table. There is only one degree of freedom used (the estimate of the mean parameter) in the calculation of variation. this is the sum of that column’s values. DF records an associated degrees of freedom for each source of variation. Error. The Analysis of Variance Table The Analysis of Variance table shows the basic calculations for a linear model.5 The Fit Model Platform Fit Model Report Items 89 5 Fit Model Regression Reports Regression reports give textual information about the fit. An R2 of 0 means that the fit predicts the response no better than the overall response mean. Sum of Squares records an associated sum of squares for each source of variation. • The Error DF is the difference between the C Total DF and the Model DF. rather than to random error. so the C Total DF is always one less than the number of observations. Root Mean Square Error estimates the standard deviation of the random error. Rsquare Adj adjusts R2 to make it more comparable over models with different numbers of parame- The Mean of Response is the overall mean of the response values. The C Total degrees of freedom is for the simple mean model. It is also the square of the correlation between the actual and predicted response. • The Total (C Total) SS is the sum of squared distances of each response from the sample mean. It is a ratio of mean squares instead of sums of squares. If there are no missing values and no excluded rows. The total degrees of freedom are partitioned into the Model and Error terms: • The Model degrees of freedom is the number of parameters (except for the intercept) used to fit the model.

Significance probabilities of 0. or if too few interaction effects exist in an analysis of variance model. Note that the pure error DF is pooled from each group where there are multiple rows with the same values for each effect. This sum of squares corresponds to the unexplained error (residual) after fitting the regression model. . This computation converts the sum of squares to an average. The difference between the residual error from the model and the pure error is called lack of fit error. the F Ratio is higher than expected by chance alone. to try to better capture the functional form of a regressor. This is the portion of the sample error that cannot be explained or predicted no matter which form the model uses for the X variables. A Mean Square is a sum of squares divided by its associated degrees of freedom. The Lack of Fit table shows information about the error terms: Source lists the three sources of variation called Lack of Fit.05 or less are often considered evidence that there is at least one significant regression factor in the model. Note that large values of Model SS and small values of Error SS lead to large F ratios and low p values— desirable if the goal is to declare that terms in the model are significantly different from zero. If there is a significant effect in the model. and Total Error. there are no degrees of freedom for lack of fit. The Lack of Fit Table The Lack of Fit table shows a special diagnostic test and appears only when the data and the model provide the opportunity.” p. The F Ratio is the model mean square divided by the error mean square. Prob>F is the probability of obtaining a greater F value by chance alone if the specified model fits no better than the overall response mean. Sometimes. meaning that the model itself has a degree of freedom for each different X value.90 5 The Fit Model Platform Fit Model Report Items • The Error SS is the sum of squared differences between the fitted values and the actual values. it is possible to estimate the error variance independently of whether the right form of the model is the one under consideration. It tests the hypothesis that all the regression parameters (except the intercept) are zero. The only additional information is the Max RSq. A lack of fit error can be significantly greater than pure error if a regressor is in the model with the wrong functional form. This significance is also shown graphically by the whole-model leverage plot. Most practitioners check this F test first and make sure that it is significant before delving further into the details of the fit. the maximum R2 that can be achieved by using only the variables in the model. if appropriate. The remaining portions of the Lack of Fit table are similar to those of the Analysis of Variance Table. In these cases. The error for these exact replicates is called pure error. There are two common situations where there is no lack of fit test: • There are no exactly replicated points with respect to the X data. consider adding interaction terms. Pure Error. This occurs when observations are exact replicates of each other in terms of the X variables. described in “Leverage Plots. therefore. 91. and therefore there are no degrees of freedom for pure error. • The model is saturated.

The dummy variables are coded as 1. DF is the degrees of freedom for the effect test. It is the ratio of the estimate to its standard error. . An example leverage plot is shown in Figure 5.05 are often considered as significant evidence that the parameter is not zero. Regressors that are dummy indicator variables constructed from nominal or ordinal effects are labeled with the names of the levels in brackets. a larger F statistic would only occur due to random error. the note Lost DFs appears to the right of the line in the report. The Parameter Estimates table shows these quantities: Term names the estimated parameter. This is the two-tailed test against the alternatives in each direction.8). no part of the effect is testable. an estimate of the standard deviation of the distribution of the parameter estimate. Estimate lists the parameter estimates for each term. Probabilities less than 0. Simple continuous regressors have only one parameter. Crossed effects multiply the number of parameters for each term.7. Nparm is the number of parameters associated with the effect. It is the probability that if the null hypothesis is true. The fitting platform produces a leverage plot for each effect in the model. Sum of Squares is the sum of squares for the hypothesis that the listed effect is zero. Leverage Plots Leverage plots reveal the significance of an effect in the model. It is the ratio of the mean square for the effect divided by the mean square for error. The first parameter is always the intercept. Nested effects depend on how levels occur. Prob>F is the significance probability for the F ratio. Note that if DF is zero. which is coded as –1 across all the other dummy variables for that effect. Nominal effects have one less parameter than the number of levels. This is the value used to construct t-tests and confidence intervals for the parameter. Whenever DF is less than Nparm. t Ratio is a statistic that tests whether the true parameter is zero. Models with complex classification effects have a parameter for each anticipated degree of freedom.5 The Fit Model Platform Fit Model Report Items 91 5 Fit Model The Parameter Estimates Table The Parameter Estimates table shows the estimates of the parameters in the linear model and a t-test for the hypothesis that each parameter is zero. F Ratio is the F statistic for testing that the effect is zero. Continuous effects have 1 parameter.0005 appear as 0. except for the last level. The Effect Test Table The Effect Test table shows the following information for each effect: Source lists the names of the effects in the model.0000. These plots show point-by-point what the residual would be both with and without that effect in the model (See Figure 5. given the hypothesis that the parameter is zero. Simple regressors show as the name of the data table column. Values less than 0. Prob>|t| is the probability of getting an greater t statistic (in absolute value). Std Error is the standard error.

This illustrates the same test reported in the Analysis of Variance report. there is a special leverage plot titled Whole Model that shows the actual values of the response plotted against the predicted values. Significance of the effect is seen by comparing the slope of the sloped line with that of the horizontal one.7 Example Leverage Plot from Denim Data In addition. In general. Figure 5. as in Figure 5. This Whole Model leverage plot dramatizes the test that all the parameters (except intercepts) in the model are zero.9.8 General Leverage Plot residual residual constrained by hypothesis points farther out pull on the line of fit with greater leverage than the points near the middle Figure 5. The sloped line represents the values of the model with the effect included.92 5 The Fit Model Platform Fit Model Report Items Figure 5. the horizontal line on the plot represents what the values of the model would be if the effect was removed from the model.9 Significance of Effects Significant Borderline Not Significant confidence curve crosses horizontal line confidence curve asymptotic to horizontal line confidence curve does not cross horizontal line .

the estimates are the coefficients of dummy variables whose value is 1 for all levels of the variable except the last. For nominal effects.5 The Fit Model Platform Fit Model Report Items 93 5 Fit Model Effect Details In a Standard Least Squares analysis.11. the dialog normalizes after each click to make the sum for a column zero and the absolute sum equal to two after each click.” p.10 Effect Details • LS Means Table shows predicted values from the specified model across the levels of a categorical effect.11 Parameter Estimates Table The Expanded Estimates command shows the same information. • The LS Means Student’s t command requests multiple comparison tests. To construct a contrast.buttons beside the levels to be compared. Expanded Estimates The standard Fit Model output includes a Parameter Estimates Table. 84 for an example of using contrasts. • LS Means Contrast displays a dialog for specifying contrasts with respect to an effect. . the estimates are the (estimated) coefficients of each term in the linear model. dealing with least squares means. The other model factors are controlled —that is. For continuous effects. Figure 5. Figure 5. as seen in Figure 5. which gets the value -1. the following effect details are available. designated LS Means by JMP Student Edition. If possible. holding the other variables in the model constant. but with a coefficient for each continuous variable and each level of other variables. It adds to the plus or minus score proportionately.) This command is enabled only for categorical effects. Exploring the Estimates The following commands allow you to further explore the estimated coefficients of the model. • LS Means Plot plots the LSMeans for nominal and ordinal main effects and two-way interactions. Least squares means shows which levels produce higher or lower responses. click the + and . set to neutral values. More detailed descriptions of each command are available in the online help. (See “Linear Contrasts. Least squares means are also called adjusted means or population marginal means. Ordinal effects show coefficients for dummy variables that measure the difference at levels of the variable from the mean of all levels of the effect.

frequently hypothesizing that a parameter is zero. For example. However. or that some parameters are equal to others. which displays the dialog shown in Figure 5. or another value. select Custom Test from the Estimates popup menu. shown here. one.13. Figure 5.94 5 The Fit Model Platform Fit Model Report Items Figure 5. it is reasonable to test that several parameters are zero. These tests are known in statistics as a general linear hypotheses. it is possible to test far more complicated null hypotheses than this. Custom Test In introductory statistics courses. . and are tested using JMP’s Custom Test command. To test a custom hypothesis. null hypotheses are often about one variable at a time.12 Expanded Estimates Table Compare the expanded estimates with the prediction formula for this model.13 Custom Test Launch You can enter a descriptive label for this test—useful if you are doing several tests The space beneath the Custom Test title bar is an editable area for entering a test name.

you may be testing that a certain combination of the factors sums to 1. and other statistics for each test column in the dialog. Sample output for a custom test (that the Size of Load coefficient is equal to 1) is shown in Figure 5. the function value of the parameters tested. You would enter a 1 beside the “=” in this dialog. Figure 5. In the edit box to its right.5 The Fit Model Platform Fit Model Report Items 95 5 Fit Model Parameter lists the names of the model parameters. A joint F test for all columns is at the bottom. Row Diagnostics Leverage Plots (the Plot Actual by Predicted and Plot Effect Leverage commands) are covered previously in this chapter under “Leverage Plots. Add Column adds another column of zeros so that several linear functions of the parameters can be jointly tested. instead of using a Custom test. consider using the contrast dialog. One of the parameters is labeled “=”.14 Custom Test Output Note: For tests within a categorical effect. The results are appended to the bottom of the dialog. . enter the value that you are testing the contrast against. Use the Add Column button to add as many columns to the test as needed. 91.” p. the standard error. Correlation of Estimates The Correlation of Estimates option in the Estimates platform menu produces a correlation matrix for the all effects in a model.14. which tests hypotheses about the least squares means. When the test is specified. When the custom test is done. click Done to see the test performed. the report lists the test name. For example. Click in these cells to enter a new hypothesized parameter value corresponding to the desired test. To the right of the list of parameters are columns of zeros corresponding to these parameters.

you are prompted to enter an α-level for the computations. This Durbin-Watson table is only appropriate for time series data when you suspect that the errors are correlated across time. called StdErr Pred colname.96 5 The Fit Model Platform Fit Model Report Items • Plot Actual by Predicted displays the observed values by the predicted values of Y. which are the observed response values minus predicted values. Note: If you hold down the Shift key and select Save Individual Confidence Interval. This is useful for predicting values in new rows or for obtaining a picture of the fitted model. If you do not need the formula with the column of predicted values. use the Save Predicted Values option. The Durbin-Watson table has a popup command that computes and displays the exact probability associated with the statistic. • Plot Residual By Row displays the residual value by the row number of its observation. • Durbin-Watson Test displays the Durbin-Watson statistic to test whether or not the errors have first-order autocorrelation. The autocorrelation of the residuals is also shown. This is the leverage plot for the whole model. Mean Confidence Interval creates two new columns called Lower 95% Mean colname and Upper 95% Mean colname. Predicted Values creates a new column called Predicted colname that contain the predicted val- ues computed by the specified model. Save Commands The Save submenu offers the following choices. The new columns contain lower and upper 95% confidence limits for individual response values. You typically want to see the residual values scattered randomly about zero. containing the predicted values computed by the specified model. The prediction formula can require considerable space if the model is large. Individual Confidence Interval creates two new columns called Lower95% Indiv colname and Upper95% Indiv colname. The new columns contain the lower and upper 95% confidence limits for the line of fit. Use the Column Info command and click the Edit Formula button to see the prediction formula. you are prompted to enter an α-level for the computations. Note: If you hold down the Shift key and select Save Mean Confidence Interval. . It differs from the Save Predicted Values column in that the prediction formula is saved with the new column. • Plot Residual By Predicted displays the residual values by the predicted values of Y. Std Error of Predicted creates a new column. • Plot Effect Leverage produces a leverage plot for each effect in the model showing the point-by-point composition of the test for that effect. containing the standard errors of the predicted values. called Pred Formula colname. Studentized Residuals creates a new column called Studentized Resid colname. where colname is the name of the response variable: Prediction Formula creates a new column. Residuals creates a new column called Residual colname containing the residuals. Each selection generates one or more new columns in the current data table titled as shown. The new col- umn values are the residuals divided by their standard error.

containing the stan- dard errors of the individual predicted values. If the response column name is R and the effects are X1 and X2. The new columns consist of an X and Y column for each effect in the model.5 The Fit Model Platform Fit Model Report Items 97 5 Fit Model Std Error of Residual creates a new column called. Effect Leverage Pairs creates a set of new columns that contain the values for each leverage plot. StdErrResid colname. called StdErr Indiv colname. The columns are named as follows. Std Error of Individual creates a new column. containing the stan- dard errors of the residual values. then the new column names are X Leverage of X1 for R X Leverage of X2 for R Y Leverage of X1 for R Y Leverage of X2 for R. .

.

This chapter uses the term “significance probability” in a mechanical way to represent that the calculation would be valid in a fixed model. The significance levels on the statistics for selected models violate the standard statistical assumptions because the model has been selected rather than tested within a fixed model. but it facilitates searching and selecting among many models. the approach has been of practical use for 30 years in helping to trim out models to predict many kinds of responses. stepwise regression is a personality of the Model Fitting platform—it is one of the selections in the Fitting Personality popup menu on the Model Specification dialog (see Figure 5. On the positive side. To find a good oxygen uptake prediction equation. It is used when there is little theory to guide the selection of terms for a model and the modeler.jmp data table in the Sample Data folder. Aerobic fitness can be evaluated using a special test that measures the oxygen uptake of a person running on a treadmill for a prescribed distance. . The Stepwise platform lets you search through models with combinations of effects and choose the model you want. it would be more economical to find a formula that uses simpler measurements that evaluate fitness and predict oxygen uptake. Open the Fitness.jmp data table. you need to compare many different regression models. weight. brings statistical sense to model selection statistics. measurements of age. To identify such an equation.5 miles.1 shows a partial listing of the Fitness. As an example. and pulse were taken for 31 participants who ran 1. Miller (1990). This data shows results from an aerobic fitness study. by A. Introduction In JMP.6 on page 87). However. runtime. The book Subset Selection in Regression. wants to use whatever seems to provide a good fit. recognizing that the true significance probability could be nowhere near the reported one. J. The approach is somewhat controversial. in desperation. Figure 6. The Stepwise feature computes estimates that are the same as those of other least squares platforms.6 Stepwise Regression Stepwise regression is an approach to selecting a subset of effects for a regression model.

. Runtime. and MaxPulse as Effects. Choose Oxy as the Y response Choose Weight.2 Model Specification Dialog for a Stepwise Model To do stepwise regression. Click Run Model. RstPulse.1 The Fitness Data Table Note: For purposes of illustration. certain values of MaxPulse and RunPulse have been changed from data reported by Rawlings (1988.105). p. Figure 6.100 6 Stepwise Regression Introduction Figure 6. In the Model Specification dialog. Select Stepwise from the Fitting Personality popup menu. Select Fit Model in the Analyze menu. RunPulse.

a combination of the two detailed below. we to add any effects that are significant at the 0.25 as the Prob to Enter. To add the first detected effect. Leave 0. .6 Stepwise Regression Introduction 101 6 Stepwise Regression When the report appears. This example uses forward selection. This produces a report identical to those seen in “The Fit Model Platform” chapter.3).3 Current Estimates Table After one step After all steps Now that we have selected the effects that contribute to explaining Oxy. We now want to add significant effects. The bottom table in Figure 6.25 level or better.25 specified in the Stepwise Regression Control Panel. you are presented with a control panel. Figure 6. To automatically add all the detected effects automatically (rather than manually with the Step button) Click Go to see the stepwise process run to completion. the most significant term Runtime is entered into the model (top Current Estimates table in Figure 6. Click the Step button. After one step. In this example. Click Run Model. Click Make Model. We now have a choice of three stepwise methods: Forward (where effects are added as they become significant).3 shows that all the terms have been added except RstPulse and Weight which are not significant at the Prob to Enter value of 0. we can make a model and examine its analysis. A Fit Model dialog appears. so Leave the default Forward selection method. used to specify how effects should enter or exit the model. Backward (where effects are removed as they become noon-significant) or Mixed.

shown next. Click the field to enter a value. begin or stop the selection process. determine the method of selecting effects. which is an interactive control panel for operating the platform • The Current Estimates table. and create a model. buttons and a popup menu. has editable areas. The following sections describe the components of these areas and tell how to use them. . • Prob to Leave is the significance probability that must be attributed to a regressor term in order for it to be considered as a backward step and removed from the model. which shows the current status of the specified model. • Direction accesses the popup menu shown here. and provides additional control features • The Step History table. Click the field to enter a value. given that term is significant at the level specified by Prob to Enter. which lets you choose how you want variables to enter the regression equation. the stepwise platform displays a window that shows three areas: • The Stepwise Regression Control panel. • Forward brings in the regressor that most improves the fit. You use the Control Panel as follows: • Prob to Enter is the significance probability that must be attributed to a regressor term for it to be considered as a forward step and entered into the model. Stepwise Regression Control Panel The Stepwise Regression Control Panel (Control Panel for short). which lists the steps in the stepwise selection. You use these dialog features to limit regressor effect probabilities.102 6 Stepwise Regression The Stepwise Window The Stepwise Window When launched.

) if the effect is not currently in the model. It includes the most significant term that satisfies Prob to Enter and removes the least significant term satisfying Prob to Leave.6 Stepwise Regression The Stepwise Window 103 6 Stepwise Regression • Backward removes the regressor that affects the fit the least. • Remove All removes all terms from the model. as is shown here. remove. You use check boxes to define the stepwise regression process: • Lock locks a term in or out of the model. • Parameter lists the names of the regressor terms (effects). The following quantities update continually during the fitting process and are used in determining the final model. The intercept is permanently locked into the model. The platform begins with no terms in the model except for the intercept. • Stop stops the background selection process. • Make Model forms a model for the Model Specification Dialog from the model currently showing in the Current Estimates table. • Step stops after each step of the stepwise process • Enter All enters all unlocked terms into the model. In cases where there are nominal or ordinal terms. and lock in model effects. • Entered shows whether a term is currently in the model. It continues removing terms until the remaining terms are significant and then it changes to the forward direction. Lock does not permit a term that is checked to be entered or removed from the model. Make Model can create new data table columns to contain terms that are needed for the model. It is missing (. The process continues to run in the background until the model is finished. • Estimate is the current parameter estimate. You can click a term’s check box to manually bring an effect into or out of the model. given that term is not significant at the level specified in Prob to Leave. Click an effect’s check box to change its lock status. • Mixed alternates the forward and backward steps. Current Estimates Table The Current Estimates table lets you enter. Buttons on the controls panel let you control the stepwise processing: • Go starts the selection process. .

For example. The adjusted R2 is useful in stepwise procedure because you are looking at many different models and want to adjust for the number of terms in the model. DFE. In this example. • AIC is Akaike’s Information Criterion. MSE are the sum of squares. select the model where Cp approaches p. Statistics for the current model appear above the list of effects: • SSE. the Model Specification dialog appears as shown in Figure 6. and mean square error (residual) of the current model. it could have a reported SS of zero. It is the square of a t-ratio. It is in quotation marks because it does not have an F-distribution for testing the term because the model was selected as it was fit. If you use Mallow’s Cp as a model selection criterion. the Step History table records the effect of adding a term to the model. the number of parameters in the model. degrees of freedom. If a term is restricted in some fashion. if you click Make Model after the backward selection in Figure 6. Like the “F Ratio.4. three or four variables appear to be a good choice for a regression model. • “Prob>F” is the significance level associated with the F-statistic.” it is in quotation marks because it is not to be trusted as a real significance probability.104 6 Stepwise Regression The Stepwise Window • nDF is the number of degrees of freedom for a term. • RSquareAdj adjusts R2 to make it more comparable over models with different numbers of parameters by using the degrees of freedom in its computation. Make Model When you click Make Model. • SS is the reduction in the error (residual) SS if the term is entered into the model or the increase in the error SS if the term is removed from the model. • “F Ratio” is the traditional test statistic to test that the term effect is zero. A term has more than one degree of freedom if its entry into a model also forces other terms into the model. Step History Table As each step is taken. • RSquare is the proportion of the variation in the response that can be attributed to terms in the model rather than to random error. the model seen in the Current Estimates table appears in the Model Specification dialog.4 without a fitting personality selection. The Step History table for the Fitness data example shows the order in which the terms entered the model and shows the effect as reflected by R2 and Cp. . • Cp is Mallow’s Cp criterion.

It is accessible from the red triangle drop-down menu on the stepwise control panel (see Figure 6. all possible models of the regression parameters are run.6 Stepwise Regression All Possible Regressions 105 6 Stepwise Regression Figure 6. Figure 6.4 New Model Specification dialog from Forward Stepwise Procedure All Possible Regressions Stepwise includes an All Possible Models command. resulting in the report seen in Figure 6.5). RunPulse.6. and MaxPulse.5 All Possible Models When selected. Note that this report is for a three-variable model consisting of Runtime. .

6 All Models Report The models are listed in decreasing order of the number of parameters they contain. but can strain computer memory (and human patience). Note: Mallow’s Cp statistic is computed.106 6 Stepwise Regression All Possible Regressions Figure 6. but initially hidden in the table. We suggest that no more than about 15 variables be used with this platform. Right-click (Control-click on the Macintosh) and select Columns > Cp from the menu that appears. . More may be possible. The model with the highest R2 for each number of parameters is highlighted. To make it visible.

Choose the Graph > Control Chart > XBar command.jmp data in the Quality Control sample data folder (taken from the ASTM Manual on Presentation of Data and Control Chart Analysis). To replicate this example. • The vertical axis of a control chart is scaled in the same units as the summary statistic. • The horizontal axis of a control chart identifies the subgroup samples. • The center line on a Shewhart control chart indicates the average (expected) value of the summary statistic when the process is in statistical control. Introduction Control charts have the following characteristics: • Each point represents a summary statistic computed from a subgroup sample of measurements of a quality characteristic. This monitoring process is often called quality control or QC.7 Control Charts Control charts are a graphical and analytic tool for deciding whether a process is in a state of statistical control and for monitoring an in-control process.1. • The upper and lower control limits. give the range of variation to be expected in the summary statistic when the process is in statistical control. labeled UCL and LCL. An X -chart and an R-chart for the process are shown in Figure 7. • Graph > Control Chart subcommands create control charts that can be updated dynamically as samples are received and recorded or added to the data table. The quality characteristic of interest is the Weight column. A subgroup sample of four is chosen. • A point outside the control limits signals the presence of a special cause of variation. . out of control point measurement axis UCL centerline LCL subgroup sample axis The following example uses the Coating.

Sample six indicates that the process is not in statistical control. Change the Sample Size Constant from 5 to 4. Click OK. The corresponding rows highlight in the data table. Specify Weight as the Process variable. Since our example has four samples in each subgroup. To check the sample values.108 7 Control Charts Introduction Note the selected chart types of XBar and R. . click the sample six summary point on either control chart.

1 Variables Charts for Coating Data { You can use Fit Y by X for an alternative visualization of the data. Figure 7.2 show that the sixth sample has a small range of high values.7 Control Charts Introduction 109 7 Quality Control with Control Charts Figure 7. First. change the modeling type of Sample to Nominal. Specify the interval variable Weight as Y and the nominal variable Sample as X. The box plots in Figure 7.2 Quantiles Option in Fit Y By X Platform all values in sample six are high .

. you specify exactly how you want your charts created.) Initially. the dialog shows three kinds of information: • process information.3 Control Chart Menu Specific information shown for each section varies according to the type of chart you request.110 7 Control Charts The Control Chart Launch Dialog The Control Chart Launch Dialog When you select a Control Chart from the Graph > Control Chart menu (Figure 7. you see a Control Chart Launch dialog similar to the one in Figure 7.3).4.4 Control Chart Launch Dialog process information chart type information limits specification enter or remove known statistics Add capability analysis to report Through interaction with the Launch dialog. The following sections describe the panel elements. Figure 7. Figure 7. for measurement variable selection • chart type information • limits specification. (The exact controls vary depending on the type of chart you choose.

c-. it identifies the sample. • If the sample subgroups have an unequal number of rows or have missing values and you have a column identifying each sample. • If the sample subgroups are the same size. specify measurements as the process. . Sample Label enables you to specify a variable whose values label the horizontal axis and can also identify unequal subgroup sizes. The illustration in Figure 7. and u-charts). • For attribute charts. using the Coating. check the Sample Grouped by Sample Label radio button and enter the sample identifying column as the sample label. • For variables charts. a Range Span text box appears.jmp sample data from the Quality Control sample data folder. Here. the samples are identified by their subgroup sample number. The range span specifies the number of consecutive measurements from which the moving ranges are computed. np-. check the Sample Size Constant radio button and enter the size into the text box. For attribute charts (p-. If you entered a Sample Label variable. When the chart type is IR. Process selects variables for charting. its values are used to label the horizontal axis. this variable is the subgroup sample size. specify the defect count or defective proportion as the process.5 shows an X -chart for a process with unequal subgroup sample sizes.7 Control Charts The Control Chart Launch Dialog 111 7 Quality Control with Control Charts Process Information The Launch dialog displays a list of columns in the current data table. If no sample label variable is specified. you specify the variables to be analyzed and the subgroup sample size. In Variables charts.

Phases generate. phases might correspond to time periods during which a new process is brought into production and then put through successive changes. or sections. . a new sigma. set of limits. zones.5 Variables Charts with Unequal Subgroup Sample Sizes Phase The Phase role enables you to specify a column identifying different phases. for each level of the specified Phase variable.112 7 Control Charts The Control Chart Launch Dialog Figure 7. For example. and resulting tests. A phase is a group of consecutive observations in the data table.

7 Control Charts The Control Chart Launch Dialog 113 7 Quality Control with Control Charts Chart Type Information Shewhart control charts are broadly classified as variables charts and attribute charts. . • The IR menu selection has checkbox options for the Individual Measurement. Moving Range. Moving average charts and cusum charts can be thought of as special kinds of variables charts. and Median moving range charts. and S checkboxes. X . r.and s- IR CUSUM • XBar charts menu selection gives XBar. R. • The Cusum chart is a special chart for means or individual measurements.

. and Run Charts. Figure 7. and U charts. have no additional specifications. Reasonable choices for α are 0. Parameters You specify computations for control limits by entering a value for k (K Sigma) or by entering a probability for α(Alpha). C. The dialog default for K Sigma is 3. the number of sigmas. K Sigma specifies control limits at k sample standard errors above and below the expected value. The example here shows 1 as the standard deviation of the process variable and 20 as the mean measurement. click K Sigma and enter a positive k value into the textbox.6 K Sigma =3 (left) and K Sigma=4 (right) Control Limits Alpha specifies control limits (also called probability limits) in terms of the probability α that a single subgroup statistic exceeds its control limits.001 Using Specified Statistics If you click the Specify Stats (when available) button on the Control Chart Launch dialog. which is three standard deviations. There must be a specification of either K Sigma or Alpha. This lets you enter historical statistics (statistics obtained from historical data) for the process variable. a tab with editable fields is appended to the bottom of the launch dialog. To specify alpha.01 or 0.114 7 Control Charts The Control Chart Launch Dialog • P. click the Alpha radio button and enter the probability you want. K Sigma allows specification of control limits in terms of a multiple of the sample standard error.jmp data with control lines drawn with K Sigma = 3 and K Sigma = 4. To specify k. NP. The usual choice for k is three. The examples shown in Figure 7. assuming that the process is in control.6 compare the X -chart for the Coating. The Control Chart platform uses those entries to construct control charts. which shows as the center line.

a dialog appears as the platform is launched asking for specification limits. Tailoring the Horizontal Axis When you double-click the x-axis. Its items vary with the type of chart you select. number of ticks. the x-axis is labeled at every other tick.7. Figure 7. the Pickles. it is labeled in the plot as µ0. If you check the Capability option on the Control Chart launch dialog (see Figure 7.7 Control Charts Tailoring the Horizontal Axis 115 7 Quality Control with Control Charts Note: When the mean is user-specified. as shown in the chart on the right. the x-axis is labeled once for each day.7 Example of Labeled x-Axis Tick Marks Display Options Control Charts have popup menus that affect various parts of the platform: • The menu on the top-most title bar affects the whole platform window. by default. as shown to the left in Figure 7. After entering the specification limits and clicking OK. axis values.JMP data lists eight measures a day for three days. the X Axis Specification dialog appears for you to specify the format. If you specify a label at an increment of eight. which is the default option.4). In this example. Sometimes this gives redundant labels. with seven ticks between them. The standard deviation for the control chart selected is sent to the dialog and appears as a Specified Sigma value. For example. • There is a menu of items on the chart type title bar with options that affect each chart individually. gridline and reference lines to display on the x-axis. capability output appears in the same window next to the control chart. .

Initially. 25th percentile. the points show. The box plot shows the subgroup maximum. You can use this option to suppress the markers denoting subgroup means when the Box Plots option is in effect. The Box Plots option is available only for X -charts. Box Plots superimposes box plots on the subgroup means plotted in a Mean chart. .116 7 Control Charts Display Options Single Chart Options The popup menu of chart options appears when you click the icon next to the chart name. Needle connects plotted points to the center line with a vertical line segment. Connect Points toggles between connecting and not connecting the points. Markers for subgroup means show unless you deselect the Show Points option. The control limits displayed apply only to the subgroup mean. Show Points toggles between showing and not showing the points representing summary statistics. It is most appropriate for larger subgroup sample sizes (more than 10 samples in a subgroup). or right-click the chart space. and median. 75th percentile. minimum.

Tests 5 to 8 apply to Mean charts and Individual Measurement charts only. the Tests option is dimmed. until the sample sizes become equal. Options are Thin. Tests 1 to 4 apply to Mean. Tests shows a submenu that enables you to choose which tests to mark on the chart when the test is positive.jmp Data Connect Color displays the JMP-SE color palette for you to choose the color of the line segments used to connect points. These spe- . Limits Color displays the JMP-SE color palette for you to choose the color of the line segments used in the upper and lower limits lines. Medium. Show Center Line initially displays the center line in green. Deselecting Show Center Line removes the center line and its legend from the chart. Line Width allows you to pick the width of the control lines. but will not appear for charts whose control limits vary due to unequal subgroup sample sizes. Tests apply only for charts whose limits are 3σ limits.7 Control Charts Display Options 117 7 Quality Control with Control Charts Figure 7. If tests do not apply to a chart. Tests apply.8 Box Plot Option and Needle Option for Airport. Center Line Color displays the JMP-SE color palette for you to choose the color of the line segments used to draw the center line. Show Control Limits toggles between showing and not showing the chart control limits and their legends. or Thick. Individual and attribute charts.

. The seven zone lines are set one sigma apart. 122. LCL. c-. You can check each chart to show or hide it. B. UCL. regardless of the sample size being constant. Window Options The popup menu on the window title bar lists options that affect the report window. Select the chart on which you want the curve based. JMP-SE opens a new window containing the curve. and wanted to flag any points beyond the limits of an r-chart. See the text and chart in “Westgard Rules. K. The example menu shown here appears if you request XBar and R at the same time. see “Tests for Special Causes” on page 119 later in this chapter. and u-charts.jmp sample data. This test works on all charts with limits. np-. and regardless of the size of k or the width of the limits. Show Zones toggles between showing and not showing the zone lines with the tests for special causes. The zones are labeled A. When you choose the OC Curve option from the control chart option list. Alternatively. The curve shows how the probability of accepting a lot changes with the quality of the sample. and C as shown here in the Mean plot for weight in the Coating. and sample size. Westgard Rules are detailed in a later section. Test Beyond Limits flags as a “*” any point that is beyond the limits. Sigma. p-. you can run an OC curve directly from the QC tab on the JMP-SE Starter window. using all the calculated values directly from the active control chart. you could use this command.” p. For more information on special causes tests.118 7 Control Charts Display Options cial tests are also referred to as the Western Electric rules. OC Curve gives Operating Characteristic (OC) curves for specific control charts. Control Chart tests use the zone lines as boundaries. then a dialog prompts you for Target. For example. OC curves are defined in JMP-SE only for X -. if you had unequal sample sizes. centered on the center line.

the label of the numerically lowest test specified appears beside the point. The following options show for all control charts except Run charts: Show Limits Legend shows or hides the Avg. The left-hand chart in Figure 7.7 Control Charts Tests for Special Causes 119 7 Quality Control with Control Charts The specific options that are available depend on the type of control chart you request. Capability launches a capability analysis. Figure 7. . Details are found in Figure 2. The middle chart has samples 8. and LCL values to the right of the chart. If a selected test is positive. You can select one or more tests for special causes with the options popup menu.9 Example of Connect thru Missing Option Script has a submenu of commands available to all platforms that let you redo the analysis or save the JSL commands for the analysis to a window or a file. Unavailable options show as grayed menu items. UCL.9 is a control chart with no missing points. where the sequence is the moving set of points evaluated for that particular test. Nelson (1984) developed the numbering notation used to identify special tests on control charts. and 10 missing with the points not connected. Tests for Special Causes The Tests option in the chart type popup menu displays a submenu for test selection. When you select several tests for display and more than one test signals at a particular point. Connect thru Missing connects points when some samples have missing values. 9. the last point in the test sequence is labeled with the test number. which is the default. The right-hand chart appears if you use the Connect thru Missing option.20 on page 39.

Small trends will be signaled by this test before Test 1. np-. Note: All Tests and zones require equal sample sizes in the subgroups of nonmissing data.1 on page 120 lists and interprets the eight tests. Table 7. A with zones C nearest the center line. and Figure 7. and 6 apply to the upper and lower halves of the chart separately. each with a width of one standard deviation. or operators. or a single aberration in the process. gle (upper or lower) side of Zone C or beyond Test 3 Six points in a row steadily detects a trend or drift in the process increasing or decreasing mean. C.120 7 Control Charts Tests for Special Causes Western Electric Rules Western Electric rules are implemented in the Tests submenu. • Any point lying on a line separating two zones lines is considered belonging to the outermost zone. 5. Tests 3. C. and u-charts. B.detects a shift in the process mean. 3σ limits zones centerline Nelson (1984. Tests 1 through 8 apply to Mean ( X ) and individual measurement charts. Test 2 Nine points in a row in a sin.10 illustrates the tests. the R-chart can be used to rule out increases in variation. • The zones are labeled A. 4. Tests 1. • A point lies in Zone B or beyond if it lies beyond the line separating zones C and B. The following rules apply to each test: • The area between the upper and lower limits is divided into six zones. That is. 2. if it is more than one standard deviation from the centerline. 1985) for further recommendations on how to use these tests. For interpreting Test 1. c-. Test 4 Fourteen points in a row detects systematic effects such as two alteralternating up and down nately used machines. Tests 1 through 4 can also apply to p-. 1985) Table 7. B. 7. an increase in the standard deviation. and 8 apply to the whole chart. . See Nelson (1984. vendors.1 Description and Interpretation of Special Causes Tests Test 1 One point beyond Zone A detects a shift in the mean.

Test 7 Fifteen points in a row in detects stratification of subgroups when Zone C. Test 6 Four out of five points in detects a shift in the process mean. but subgroups come from different sources with different means. Any Zone B or beyond four out of five points provide a positive test.7 Control Charts Tests for Special Causes 121 7 Quality Control with Control Charts Table 7. Any two out of three points provide a positive test. . above and below the the observations in a single subgroup center line come from various sources with different means. Test 8 Eight points in a row on both detects stratification of subgroups when sides of the center line with the observations in one subgroup come none in Zones C from a single source.1 Description and Interpretation of Special Causes Tests Test 5 Two out of three points in a detects a shift in the process average or row in Zone A or beyond increase in the standard deviation.

For example. 1985) Westgard Rules Westgard rules are implemented under the Westgard Rules submenu of the Control Chart platform. 1 2s refers to a test that one point is two standard deviations away from the mean.122 7 Control Charts Tests for Special Causes Figure 7. . The different tests are abbreviated with the decision rule for the particular test.10 Illustration of Special Causes Tests Test 1: One point beyond Zone A UCL A B C C B A Test 2: Nine points in a row in a single (upper or lower) side of Zone C or beyond UCL A B C C B A Avg 2 Avg LCL LCL 1 Test 3: Six points in a row steadily increasing or decreasing A 3 UCL B C C B A Test 4: Fourteen points in a row alternating up and down A B C C B A UCL Avg LCL Avg LCL 4 Test 5: Two out of three points in a row in Zone A or beyond UCL A 5 5 B C C B A Test 6: Four out of five points in a row in Zone B or beyond UCL A 6 B C C B A Avg 5 LCL Avg LCL Test 7: Fifteen points in a row in Zone C (above and below the centerline) UCL A B C C B A Test 8: Eight points in a row on both sides of the centerline with none in Zone C UCL A B C C B A 7 Avg LCL Avg 8 LCL Nelson (1984.

+3s +2s tive measurements are more than one +1s standard deviation from the mean. -1s -2s -3s Rule 2 2s is triggered when two consecu+3s +3s +2s +1s -1s -2s -3s UCL Avg LCL UCL Avg LCL UCL Avg LCL UCL Avg LCL UCL Avg LCL UCL Avg LCL tive control measurements are farther than +2s +1s two standard deviations from the mean. Rule 1 3s refers to a rule common to +3s Levey-Jennings plots where the control limits are set 3 standard deviations away +2s +1s from the mean.2 Westgard Rules Rule 1 2s is commonly used with Levey-Jennings plots. -1s -2s -3s Rule 10 X is triggered when ten consecu.7 Control Charts Tests for Special Causes 123 7 Quality Control with Control Charts Because Westgard rules are based on sigma and not the zones. +1s -1s -2s -3s .+3s +2s tive points are on one side of the mean. The rule is triggered when any one point goes beyond these limits. -1s -2s -3s Rule 4s is triggered when one measure- ment in a group is two standard deviations above the mean and the next is two standard deviations below. The rule is triggered when any one point goes beyond these limits. +3s +2s +1s -1s -2s -3s Rule 4 1s is triggered when four consecu. they can be computed without regard to constant sample size. where control limits are set 2 standard deviations away from the mean. Table 7.

and Deleted Samples The following table summarizes the effects of various conditions on samples and subgroups: Table 7. the sample label will still be removed from the axis. and before creating the chart. and the limits will change. and the sample marker does not appear on the graph. Hidden. Sample is both excluded and hidden Sample is included in the calculation of the limits. For example. Shewhart Control Charts Shewhart control charts are broadly classified into control charts for variables and control charts for attributes. Hidden. if the second observation in the sample is hidden. it does not appear on the graph. the sample label will still appear on the axis. axis will not include a value for the sample. Sample is hidden after creating the Sample is included in the calculation of the limits. and it chart. Moving average charts are special kinds of control charts for variables. but limits remain the same. Sample is hidden before creating the Sample is included in the calculation of the limits. 2 An exception to the exclude/hide rule: Tests for Special Causes can flag if a sample is excluded. the graph will shift. The sample marker will disappear from the graph. Some additional notes: 1 Exclude and Hide operate only on the row state of the first observation in the sample. but does after creating the chart. the sample will still appear on the chart. Sample is excluded after creating the Sample is included in the calculation of the limits. but does chart. and deleted after creating chart.124 7 Control Charts Excluded. does not appear on the graph. and Deleted Samples Excluded. Nothing will change on the output by excluding a sample with the graph open. appears in the graph. while the first observation is not hidden. Sample is both excluded and hidden Sample is not included in the calculation of the limits. The sample marker will disappear from the graph. The sample marker will disappear from the graph. Data set is subsetted with Sample Sample is not included in the calculation of the limits.3 Excluded. the sample label will still appear on the axis. but does chart. it appears on the graph. not appear on the graph. but limits remain the same. but will not flag if a sample is hidden. Hidden. not appear on the graph. not appear on the graph. The Control Chart platform in JMP-SE implements a variety of control charts: . Data set is subsetted with Sample Sample is not included in the calculation of the limits. and Deleted Samples Sample is excluded before creating the Sample is not included in the calculation of the limits. but chart. the deleted before creating chart.

XBar-. Coating.Charts For quality characteristics measured on a continuous scale.s Example. To replicate this example.11. Sample as the Sample Label variable.or S-chart. IR-. in the Quality Control sample data folder. • Choose the Graph > Control Chart > XBar command. R-. r-.jmp. • Phase Control Charts for X -. • Specify the column. • The Sample Size option should automatically change to Sample Grouped by Sample Label. The IR selection gives two additional chart types: • Individual Measurement charts display individual measurements • Moving Range charts display moving ranges of two or more successive measurements. is that they update dynamically as data is added or changed in the table. . • Specify Weight 2 as the Process variable. and S-charts. different from other platforms in JMP-SE. c-. np-. p-. if you are charting individual measurements. c-. • Click OK. This time the quality characteristic of interest is the Weight 2 column.charts One feature special to Control Charts. • Individual and Moving Range charts. Or. np-. and u. An X -chart and an S chart for the process are shown in Figure 7. the individual measurement chart shows above its corresponding moving range chart. and u-charts. • p-. • Run charts display data as a connected series of points.7 Control Charts Shewhart Control Charts for Variables 125 7 Quality Control with Control Charts • X -. a typical analysis shows both the process mean and its variability with a mean chart aligned above its corresponding R. X . • Select the chart types of XBar and S.and S-charts with varying subgroup sizes This example uses the same data as example 1. R-. and S. Shewhart Control Charts for Variables Control charts for variables are classified according to the subgroup summary statistic plotted on the chart: • X -charts display subgroup means (averages) • R-charts display subgroup ranges (maximum – minimum) • S-charts display subgroup standard deviations.

jmp. Run Charts Run charts display a column of data as a connected series of points.11 X and S charts for Varying Subgroup Sizes Weight 2 has several missing values in the data. and the zone and/or test option is selected. as neither are valid on charts with unequally sized samples. . 3. Although. If the samples change while the chart is open and they become equally sized. each sample has the same number of observations. 5. so you may notice the chart has uneven limits. Note: Although they will turn on and appear checked. samples 1. The following example is a Run chart for the Weight variable from Coating. and 7 each have a missing value. the zones and/or tests will be applied immediately and appear on the chart.126 7 Control Charts Shewhart Control Charts for Variables Figure 7. no zones or tests will appear on the chart until all samples are equally sized.

a line is drawn through the center value of the column. Run charts can also plot the group means when a sample label is given. Otherwise. When Use Median is selected. . and 3 PM. these charts should be interpreted with care. both the overall mean and median are saved. Because the pickles are sensitive to acidity and produced in large vats. and acidity measurements. the mean is used. Moving ranges are computed for the number of consecutive measurements you enter in the Range Span box. time. The data table records day. When saving limits to a file. Example.7 Control Charts Shewhart Control Charts for Variables 127 7 Quality Control with Control Charts Figure 7. 2. the median is used as the center line. either on the dialog or through a script.13 creates Individual Measurement and Moving Range charts with date labels on the horizontal axis. The dialog in Figure 7. Moving Range Chart Type displays moving ranges of two or more successive measurements. Individual Measurement and Moving Range Charts The Pickles. Individual Measurement charts are appropriate when only one measurement is available for each subgroup sample.jmp data in the Quality Control sample data folder contains the acid content for vats of pickles. Because moving ranges are correlated. high acidity ruins an entire pickle vat. Individual Measurement Charts Individual Measurement Chart Type displays individual measurements.12 Run Chart When you select the Show Center Line option in the Run Chart drop-down. The acidity in four vats is measured each day at 1. The default range span is 2.The center line is determined by the Use Median setting of the platform drop-down.

• Select both Individual Measurement and Moving Range chart types. • Click OK. • Specify Date as the Sample Label variable. • Specify Acid as the Process variable. . The individual measurement and moving range charts shown in Figure 7.128 7 Control Charts Shewhart Control Charts for Variables Figure 7.13 Launch Dialog for Individual Measurement and Moving Range Chart To complete this example.14 monitor the acidity in each vat produced. • Choose the Graph > Control Chart > IR command.

rather than the Average Moving Range. Another type of data is count data. where the variable of interest is a discrete count of the number of defects or blemishes per subgroup. Since the counts are measured per subgroup. Shewhart Control Charts for Attributes In the previous types of charts. like variables charts. measurement data was the process variable. as they are based on binomial and poisson models. the number of defects is counted non-conforming The subgroups are a The subgroups vary in The subgroups are a The subgroups vary in constant size size constant size size np-chart p-chart c-Chart u-chart . the limits on the Individuals chart use the Median Moving Range as the sigma. For discrete count data.7 Control Charts Shewhart Control Charts for Attributes 129 7 Quality Control with Control Charts Figure 7. This data is often continuous. Attribute charts.14 Individual Measurement and Moving Range Charts for Pickles Data Note: If you choose a Median Moving range chart. are classified according to the subgroup sample statistic plotted on the chart: Table 7. it is important when comparing charts to determine whether you have similar number of items in the subgroups between the charts.4 Determining which Attribute Chart to use Each item is judged as either conforming or For each item. attribute charts are applicable. and the charts are based on continuous theory.

follow these steps: • Choose the Graph > Control Chart > NP command. it was deemed nonconforming or defective. • Change the Constant Size to 400. p. Thus. p-Charts Again. Since each subgroup for a p-chart consists of N items.jmp data. and an item is judged as either conforming or nonconforming. • np-charts display the number of nonconforming (defective) items in constant sized subgroup samples. Figure 7. • u-charts display the number of nonconformities (defects) per unit in subgroup samples that can have a varying number of inspection units. the maximum number of nonconforming items in subgroup i is Ni. • Choose # defects as the Process variable. The washers were inspected for finish defects such as rough galvanization and exposed steel. .and np-Charts Example. and an item is judged as either conforming or nonconforming. The example here illustrates an np-chart for the number of defects. using the Washers. • Click OK. which would allow for varying sample sizes.130 7 Control Charts Shewhart Control Charts for Attributes • p-charts display the proportion of nonconforming (defective) items in subgroup samples which can vary in size.15 np-Chart Example. we can specify a sample size variable. the defect count represents how many washers were defective for each lot of size 400. np-Charts The Washers. Since each subgroup for a np-chart consists of Ni items. To replicate this example. the maximum number of nonconforming items in a subgroup is N.jmp data in the Quality Control sample data folder contains defect counts of 15 lots of 400 galvanized washers. If a washer contained a finish defect. • c-charts display the number of nonconformities (defects) in a subgroup sample that usually consists of one inspection unit.

Note:When you generate a u-chart. The number of boxes inspected (per day) is the subgroup sample size. • Choose # defects as the Process variable. The u-chart. Figure 7. the y-axis. • Choose Lot as the Sample Label variable. • Choose Lot Size as the Sample Size variable. JMP-SE launches the Poisson Fit in Distri- bution and gives a Poisson-specific capability analysis. Avg and limits are all different since they are now based on proportions u-Charts The Braces.16 p-Chart Note that although the points on the chart look the same as the np-chart. Follow these steps or submit the JSL script below: • Choose the Graph > Control Chart > P command. which can vary. and select Capability. The upper and lower bounds vary according to the number of units inspected. shown here. is monitoring the number of brace defects per subgroup sample size. The chart shown here illustrates a p-chart for the proportion of defects.7 Control Charts Shewhart Control Charts for Attributes 131 7 Quality Control with Control Charts Note: This data contains all constant sample sizes.jmp data in the Quality Control sample data folder records the defect count in boxes of automobile support braces. A box of braces is one inspection unit. • Click OK. .

• Choose # defects as the Process variable.jmp data in the Quality Control sample data folder. • Choose Date as the Sample Label.17 u-Chart Example.jmp data in the Quality Control sample data folder. • Choose Box Size as the Sample Size. Example 10. . follow these steps or submit the JSL below. Prior to shipment.132 7 Control Charts Shewhart Control Charts for Attributes Figure 7. and select Capability. Since the manufacturer is interested in the average number of flaws per shirt. follow these steps or submit the JSL below. • Open the Shirts. However. c-Charts for Noncomformities per Unit In this example. the number of flaws found in each box is divided by ten and then recorded. c-charts can also be used to monitor the average number of defects per inspection unit. Note:When you generate a c-chart. • Click OK. • Choose # Defects as the Process variable. each shirt is inspected for flaws. a clothing manufacturer ships shirts in boxes of ten. u-Charts To replicate this example. • Choose Box as the Sample Label. • Open the Braces. JMP-SE launches the Poisson Fit in Distribu- tion and gives a Poisson-specific capability analysis. • Choose Unit size as the Unit Size variable. • Choose the Graph > Control Chart > C command. made up of one or more units. To replicate this example. • Choose the Graph > Control Chart > U command. c-Charts c-charts are similar to u-charts in that they monitor the number of nonconformities in an entire subgroup. they require constant subgroup sizes.

u-. Figure 7. found in the Quality Control sample data folder. row by row. set of limits. for each level of the specified Phase variable. On the dialog for X -. r-. • Click OK. a Phase variable button appears. • Select Graph > Control Chart > XBar. and Phase as the Phase. phases might correspond to time periods during which a new process is brought into production and then put through successive changes. zones. to identify to which phase each row belongs. . Saving to a limits file reveals the sigma and specific limits calculated for each phase. Phases generate. Example Open Diameter. DAY as the Sample Label.18 c-Chart Phases A phase is a group of consecutive observations in the data table. Presummarized. p-. For example. c-.7 Control Charts Phases 133 7 Quality Control with Control Charts • Click OK. a new sigma. np-. and resulting tests. • Choose DIAMETER as the Process. IR-. and Levey-Jennings charts. If a phase variable is specified.JMP. This data set contains the diameters taken for each day. the phase variable is examined. both with the first prototype (phase 1) and the second prototype (phase 2). s-.

134 7 Control Charts Phases Figure 7.20 Phase Control Chart .19 Launch Dialog for Phases The resulting chart has different limits for each phase Figure 7.

and the arms extended backward on the x-axis. you compare the points with limits that compose a V-mask.21 Illustration of a V-Mask for a Two-Sided Cusum Chart upper arm vertex lower arm d h the rise in the arm corresponding to the distance (d) from origin to vertex 1 unit k.21. as in Figure 7. Figure 7. To interpret a two-sided Cusum chart. the limits can be either in the form of a V-mask or a horizontal decision interval. which detect a shift in one direction from a specified target mean. two-sided charts allow specification in terms of error probabilities α and β. the rise in the arm corresponding to one sampling unit Shifts in the process mean are visually easy to detect on a cusum chart because they produce a change in the slope of the plotted points. Points crossing the lower arm signal an increasing process mean.7 Control Charts Cumulative Sum (Cusum) Charts 135 7 Quality Control with Control Charts Cumulative Sum (Cusum) Charts Cumulative Sum (Cusum) charts display cumulative sums of subgroup or individual measurements from a target value. In cusum charts. Both charts can be specified in terms of geometric parameters (h and k described in Figure 7. The origin of a V-mask is the most recently plotted point. There are major differences between cusum charts and other control (Shewhart) charts: • A Shewhart control chart plots points based on information from a single subgroup sample. On a cusum chart. JMP cusum charts can be one-sided. the cumulative sum sequence is updated and the origin is relocated at the newest point. The point where the slope changes is the point where the shift occurs. and points crossing the upper arm signal a downward shift. Cusum charts are graphical and analytical tools for deciding whether a process is in a state of statistical control and for detecting a shift in the process mean. each point is based on information from all samples taken up to and including the current subgroup. or two-sided to detect a shift in either direction. A V-mask is formed by plotting V-shaped limits. . A condition is out-of-control if one or more of the points previously plotted crosses the upper or lower arm of the V-mask.21). As data are collected. horizontal control limits define whether a point signals an out-of-control condition. • On a Shewhart control chart.

Beta is the probability of a Type II error and is available only when you specify Alpha. Lucas (1976) comments that a V-mask detects a 1σ shift about four times as fast as a Shewhart control chart. A cusum chart is more efficient for detecting small shifts in the process mean. a one-sided scheme is used and no V-mask appears. If it is not checked. or from an economic design. including appropriate options and specifications as shown here. H is the decision interval. Note: The following items pertain only to cusum charts: Two Sided requests a two-sided cusum scheme when checked. Data Units specifies that the cumulative sums be computed without standardizing the subgroup means or individual values so that the vertical axis of the cusum chart is scaled in the same units as the data. When you click H. Launch Options for Cusum Charts When you choose Graph > Control Charts > Cusum.26). If an H value is specified. Choose H as a multiple of the standard error. H is the vertical distance h between the origin for the V-mask and the upper or lower arm of the V-mask for a two-sided scheme. a decision interval is displayed. Note: Data Units requires that the subgroup sample size be designated as constant.136 7 Control Charts Cumulative Sum (Cusum) Charts • The control limits on a Shewhart control chart are commonly specified as 3σ limits. Beta specifies the probability of failing to discover that the specified shift occurred. which lets you enter the process variable specifications. from error probabilities. the limits are determined from average run length. . the Control Charts Launch dialog appears. For a one-sided scheme. Specify Stats appends the panel shown here to the Control Charts Launch dialog. On a cusum chart. You also enter a value for the increase in the lower V-mask per unit change on the subgroup axis (See Figure 7. the Beta entry box is labeled K.

σ. respectively. Cusum Chart Options Cusum charts have these options (in addition to standard chart options). You enter the shift value in the same units as the data. Show Points shows or hides the sample data points. Delta specifies the absolute value of the smallest shift to be detected as a multiple of the process standard deviation or of the standard error. σ0. By default.7 Control Charts Cumulative Sum (Cusum) Charts 137 7 Quality Control with Control Charts Target is the target mean (goal) for the process or population. and n is the (common) subgroup sample size. The target mean must be scaled in the same units as the data. S0. Sigma specifies a known standard deviation. You can use Sigma instead of the Alpha option on the Control Charts Launch dialog. depending on whether the shift is viewed as a shift in the population mean or as a shift in the sampling distribution of the subgroup mean. Enter Head Start as a multiple of standard error. Shift is the minimum value you want to detect on either side of the target mean. . σ represents the process standard deviation. the Control Chart platform estimates sigma from the data. Connect Points connects the sample points with a straight line. for a one-sided cusum scheme (S0 is usually zero). Head Start specifies an initial value for the cumulative sum. Delta is an alternative to the Shift option (described next). for the process standard deviation. You can choose either Shift or Delta. The relationship between Shift and Delta is given by ∆ δ = ----------------------(σ ⁄ ( n)) where δ represents Delta. ∆ represents the shift. and you interpret it as a shift in the mean of the sampling distribution of the subgroup mean.

Subgroup samples of four cans are selected and weighed every hour for twelve hours. Show ARL displays the average run length (ARL) information. is 0.jmp file from the Quality Control sample data folder. To create a Cusum chart for this example. µ0. .10 ounces. The process is set so that the average weight of a filled can. • Click the Two Sided check box if it is not already checked. σ0.10 ounces. • Specify hour as the Sample Label. Show Parameters displays a Parameters table (see Figure 7. The observations are sorted so that the values of hour are in increasing order. open the Oil1 Cusum. A machine fills 8-ounce cans of two-cycle engine oil additive. A two-sided cusum chart is used to detect shifts of at least one standard deviation in either direction from the target mean of 8. is 8.26) that summarizes the Cusum charting parameters.jmp data table contains one value of weight along with its associated value of hour. • Choose the Graph > Control Chart > CUSUM command. • Specify weight as the Process variable. Show Shift shows or hides the shift you entered.05 ounces. Previous analysis shows that the standard deviation of fill weights. Example 1.138 7 Control Charts Cumulative Sum (Cusum) Charts Mask Color displays the JMP color palette for you to select a line color for the V-mask. or center line. The filling process is believed to be in statistical control. Connect Color displays the JMP color palette for you to select a color for the connect line when the Connect Points option is in effect. Center Line Color displays the JMP color palette for you to select a color for the center line. Each observation in the Oil1 Cusum. • Click the H radio button and enter 2 into the text box. Two-Sided Cusum Chart with V-mask To see an example of a two-sided cusum chart. The Control Chart platform assumes that the data are sorted in increasing order. Show V Mask shows or hides the V-mask based on the parameters (statistics) specified on the Control Charts Launch dialog when Cusum is selected as the Chart Type.

the chart in Figure 7. . Because none of the points cross the arms of the V-mask. • Set Sigma to the standard deviation of 0. Figure 7.22. The finished dialog should look like the one in Figure 7.jmp Data You can interpret the chart by comparing the points with the V-mask whose right edge is centered at the most recent point (hour=12).23 appears. • Enter a Delta value of 1.23 Cusum Chart for Oil1 Cusum.05. there is no evidence that a shift in the process has occurred. Figure 7.1.7 Control Charts Cumulative Sum (Cusum) Charts 139 7 Quality Control with Control Charts • Click Specify Stats to open the Known Statistics for CUSUM chart tab.22 Dialog for Cusum Chart Example When you click OK. • Set Target to the average weight of 8.

To do this example. The center line and V-mask adjust to reflect the process condition at that point.140 7 Control Charts Cumulative Sum (Cusum) Charts A shift or out-of-control condition is signaled at a time t if one or more of the points plotted up to the time t cross an arm of the V-mask. The Cusum chart in Figure 7. • Specify hour as the Sample Label. The resulting output should look like the picture in Figure 7. You can move the origin of the V-mask by using the hand to click a point. A one-sided Cusum Chart can be used to identify data approaching or exceeding the side of interest. An upward shift is signaled by points crossing the lower arm.25.05.1. • Set Target to the average weight of 8. Anything 0. Figure 7. where the machine fills 8-ounce cans of engine oil. One-Sided Cusum Chart with no V-mask Consider the data used in Example 1.25 ounces beyond the mean of 8. The cusum chart automatically updates when you add new samples. .24 is the previous chart with additional points. and a downward shift is signaled by points crossing the upper arm. • Choose the Graph > Control Chart > CUSUM command. The time at which the shift occurred corresponds to the time at which a distinct change is observed in the slope of the plotted points. and not so concerned about under-filling. Consider also that the manufacturer is now concerned about significant over-filling in order to cut costs.1 is considered a problem. • Click Specify Stats to open the Known Statistics for CUSUM chart tab. • Enter a Delta value of 1.24 Updated Cusum Chart for the OIL Data Example 2. • Specify weight as the Process variable. • Set Sigma to the standard deviation 0.25 into the text box. • Deselect the Two Sided check box. • Click the H radio button and enter 0.

is the average run length for detecting a shift the size of the specified Delta • ARL(0). sometimes denoted ARL1.26. The parameters report summarizes the charting parameters from the Known Statistics for CUSUM chart tab on the Control Chart Launch dialog. Also note that no V-mask appears with One-Sided Cusum charts. is the in-control average run length for the specified parameters (Montogomery (1985)).25 One-Sided Cusum Chart for the OIL Data Notice that the decision interval or horizontal line is set at the H-value entered (0. Show ARL.7 Control Charts Cumulative Sum (Cusum) Charts 141 7 Quality Control with Control Charts Figure 7.26 Show Parameters and Show ARL Options . Figure 7. adds the average run length (ARL) information to the report. An additional chart option.25). sometimes denoted ARL0. The average run length is the expected number of samples taken before an out-of-control condition is signaled: • ARL (Delta). The Show Parameters option in the Cusum chart popup menu shows the Parameters report in Figure 7.

.

These indicate how and to what degree each point in the series is correlated with earlier values in the series and can be used to identify the type of model appropriate for describing and predicting (forecasting) the evolution of the time series.1 Launch Dialog . the platform displays graphs of the autocorrelations and partial autocorrelations of the series. Optionally. For this example. autoregressive integrated moving-average.yN of observations taken over a series of equally-spaced time periods. The model types include • ARIMA. evenly spaced. and without missing values. assign Passengers as Y. Introduction The data for the next examples are in the Seriesg. A time series is a set y1. Time Series and Time as X. This dialog allows you to specify the number of lags to use in computing the autocorrelations and partial autocorrelations. The time series variable is Passengers and the time ID is Time. sorted ascending. analyze. . y2. it must be continuous. It also lets you specify the number of future periods to forecast using each model fitted to the data. Figure 8. In addition. Note: The Time Series Launch dialog requires that one or more continuous variables be assigned as the time series. .. which is used to label the time axis. you can specify a time ID variable. The analysis begins with a plot of the points in the time series. Select Analyze > Time Series to display the Time Series Launch dialog (Figure 8. often called Box-Jenkins models • Smoothing Models. several forms of exponential smoothing and Winter’s method. Time ID..8 Time Series The Time Series platform lets you explore.jmp table found in the Time Series sample data folder (Box and Jenkins 1976). and forecast univariate time series. If a time ID variable is specified.1).

The series now has an acceptable appearance for modeling. Time ID. its autocorrelation graph. A column containing a count of Log Passengers is already included in the table. .2 Initial Time Series Report The graph shows that the series has an increasing spread over time. increasing variances are transformed using logarithms. and its partial autocorrelation graph.144 8 Time Series Introduction The first thing you see is a graph showing the time series. Figure 8. This should be accounted for before modeling the series. In general. Time Series and Time as X. Again select Analyze > Time Series Assign Log Passengers as Y.

Enter a 1 beside q.3 Log Passenger Series Since the autocorrelation graph decreases slowly and steadily. a reasonable guess for a model is an MA(1). . which is initially closed.8 Time Series Introduction 145 8 Time Series Figure 8. To try this model. JMP Student Edition estimates the model and displays a model summary. is the residuals. and a forecast graph. parameter estimates. however. Click Estimate. Moving Average Order. The most important graph. Select ARIMA from the platform menu. but the partial autocorrelation graph drops off drastically after lag 1.

5 MA(1)Model Results .4 Model Results Open the Residuals node to reveal a graph and autocorrelation plots for the model residuals. Figure 8.146 8 Time Series Introduction Figure 8.

where n is the number of observations. in the Model Comparison table. so an MA(1) is not an appropriate model. Time ID for the x-axis. In fact. one variable used for labeling the time axis Y. The Forecast Periods box allows you to set the number of periods into the future that the fitted models are forecast.7). select one or more columns in the column selector list and click a role button. select it in the role box and click Remove. This is the maximum number of periods between points used in the com- putation of the correlations. Click Estimate. Similar unsatisfactory results appear. A commonly used rule of thumb for the maximum number of lags is n/4. You set the number of lags for the autocorrelation and partial autocorrelation plots in the Autocorrelation Lags box. To cast a column into a role. like others in JMP. The plot. It must be more than one but less than the number of rows. note that JMP is accumulating a list of models. click OK to see the analysis for each time series variable versus the time ID. assign columns for analysis with the dialog in Figure 8. or row number if no time ID is specified (Figure 8. Time Series for the y-axis. The Time Series Platform First. Moving Average Order. drag variables from the column selector list to one of the following role boxes: X. By default. JMP uses 25 lags and 25 forecast periods The Time Series Graph The Time Series platform begins with a plot of each times series by the time ID. an MA(2).8 Time Series The Time Series Platform 147 8 Time Series The expected reduction in spikes did not occur. highlight points with the cursor or brush tool. Some reflection is necessary. has features to resize the graph. Or. However.6 Model Comparison Table Examine the R2 for the two models in this table. Select ARIMA from the platform menu. After assigning roles. Figure 8. along with appropriate fit statistics. can be run in the same way. and label points.1. The selector list at the left of the dialog shows all columns in the current table. one or more time series variables. Enter a 2 beside q. the MA(2) is a worse fit than the MA(1). A second model. . To remove an unwanted variable from an assigned role.

and cover additional platform features. 151). You can select the model fitting commands repeatedly. The platform popup menu. The ARIMA and Smoothing Model commands are for fitting various models to the data and producing forecasts.7 Time Series Plot of Seriesg (Airline Passenger) Data By default. graphs of the autocorrelation and partial autocorrelation (Figure 8. After the first model has been fit. . also has fitting commands and options for displaying additional graphs and statistical tables.148 8 Time Series Time Series Commands Figure 8.5) of the time series are also shown. Time Series Commands The popup menu next to the time series name has the commands shown here. discuss statistical results.” p. The first three items in this menu control the descriptive and diagnostic graphs and tables. The following sections describe options and model fits. but can be hidden with commands from the platform popup menu on the Time Series title bar. The result of each new fit is appended to the report. a summary of all the models is inserted just above the first model report (an example is shown in “Model Comparison Table. discussed next. These are typically used to determine the nature of the model to be fitted to the series.

like the one shown previously in Figure 8. In addition.7. The solid black lines represent ± 2 standard errors for approximate 95% confidence limits . often called the sample autocorrelation function. The Graph command on the platform popup menu has a submenu of controls for the time series plot with the following commands. By definition. the first autocorrelation (lag 0) always has length 1. confidence curves show twice the large-lag standard error (± 2 standard errors). • Time Series Graph hides or displays the time series graph. The autocorrelation plot for the Seriesg data is shown on the left in Figure 8. Autocorrelation The Autocorrelation command alternately hides or displays the autocorrelation graph of the sample. The plot on the right in Figure 8. • Show Points hides or displays the points in the time series graph.8 shows the partial autocorrelation function for the Seriesg data. You can examine the autocorrelation and partial autocorrelations plots to determine whether the time series is stationary (meaning it has a fixed mean and standard deviation over time) and what model might be appropriate to fit the time series. • Connecting Lines hides or displays the lines connecting the points in the time series graph.8 Time Series Time Series Commands 149 8 Time Series Graph The Time Series platform begins by showing a time series plot.8. Partial Autocorrelation The Partial Autocorrelation command alternately hides or displays the graph of the sample partial autocorrelations. • Mean Line hides or displays a horizontal line in the time series graph that depicts the mean of the time series. This graph describes the correlation between all the pairs of points in the time series with a given separation in time or lag.

The ARIMA.150 8 Time Series Modeling Reports Figure 8. as discussed previously in their respective sections. and smoothing models begin with the following report tables. using a Kalman filter to compute the likelihood function.8 Autocorrelation and Partial Correlation Plots Number of Forecast Periods The Number of Forecast Periods command displays a dialog for you to reset the number of periods into the future that the fitted models will forecast. a report of the results of the fit and a forecast is added to the platform results. These commands also produce statistics and residuals that allow you to ascertain the adequacy of the model you have elected to use. Each general class of models has its own dialog. The initial value is set in the Time Series Launch dialog. The fit of each model begins with a dialog that lets you specify the details of the model being fit as well as how it will be fit. All existing and future forecast results will show the new number of periods with this command. Each time you select a model. You can select the modeling commands repeatedly. The models are fit by maximizing the likelihood function. seasonal ARIMA. . Modeling Reports The time series modeling commands are used to fit theoretical models to the series and use the fitted model to predict (forecast) future values of the series.

Schwartz’s Bayesian Criterion [SBC or BIC] are goodness of fit statistics. 154. whether all the roots of φ ( z ) = 0 lie outside the unit circle. Model Summary Table Each model fit generates a Model Summary table. Smaller values are better fits.” p. • Sum of Squared Errors is the sum of the squares of the prediction errors. SSE. and (1. 0). the random shocks • Akaike’s Information Criterion [AIC]. Each row corresponds to a different model. which summarizes the statistics of the fit. This is the sample estimate of the variance of the random shocks at. 0. Smaller values of these criteria indicate better fit. • DF is the number of degrees of freedom in the fit. • Invertible indicates whether the moving average operator is invertible. The Model Comparison table shown above summarizes the ARIMA models (1. . In the formulae below. n is the number of nonmissing observations and k is the number of fitted parameters in the model. The numerical values in the table are drawn from the Model Summary table for each fitted model.” p. You can use it to compare several models fitted to the same time series. (0. • Variance Estimate is the unconditional sum of squares (SSE) divided by the number of degrees of freedom. 0. 154. • Standard Deviation is the square root of the variance estimate. 0. described in the section “ARIMA Model. That is. • RSquare and RSquare Adj are also goodness of fit statistics. 1). 1) respectively. • Stable indicates whether the autoregressive operator is stable.8 Time Series Modeling Reports 151 8 Time Series Model Comparison Table The Model Comparison table summarizes the fit statistics for each model. where values closer to 1 indicate a better fit. That is. whether all the roots of θ ( z ) = 0 lie outside the unit circle. This is a sample estimate of the standard deviation of at. Note: The φ and θ operators are defined in the section “ARIMA Model. n – k. • –2LogLikelihood is minus two times the natural log of the likelihood function evaluated at the best-fit parameter estimates. SSE / (n – k). detailed in the online help.

In those models. These are described below for each model type. a value below 0. Some models contain an intercept or mean term.152 8 Time Series Modeling Reports Parameter Estimates Table There is a Parameter Estimates table for each selected fit. If the hypothesis is true. The Parameter Estimates table has these terms: • Term lists the name of the parameter. The definition of the constant estimate is given under the description of ARIMA models. for models that contain an intercept or mean term. • t Ratio lists the test statistics for the hypotheses that each parameter is zero. • Prob>|t| lists the observed significance probability calculated from each t-ratio. Factor 1 is nonseasonal and Factor 2 is seasonal.” p. • Std Error lists the estimates of the standard errors of the parameter estimates. then this statistic has an approximate Student’s t-distribution. . This is only shown for multiplicative models. Looking for a t-ratio greater than 2 in absolute value is a common rule of thumb for judging significance because it approximates the 0. Each type of model has its own set of parameters. • Estimate lists the parameter estimates of the time series model.05 (or sometimes 0. 154. which gives the estimates for the time series model parameters. The Parameter Estimates table also gives the Constant Estimate. • Factor (Seasonal ARIMA only) lists the factor of the model that contains the parameter.01) is interpreted as evidence that the parameter is significantly different from zero. In the multiplicative seasonal models. a t-ratio greater (in absolute value) than the computed value. the related constant estimate is also shown. • Lag lists the degree of the lag or backshift operator that is applied to the term to which the parameter is multiplied. Often. given a true hypothesis. It is the probability of getting. by chance alone.05 significance level. They are used in constructing tests and confidence intervals. The definition of the constant estimate is given under “ARIMA Model. They are described in the sections on specific time series models. It is the ratio of the parameter estimate to its standard error.

Iteration History The model parameter estimation is an iterative procedure by which the log-likelihood is maximized by adjusting the estimates of the parameters. To the left of the separating line the one-step-ahead forecasts are shown overlaid with the input data points. The Forecast plot shows the values that the model predicts for the time series. These are the actual values minus the one-step-ahead predicted values. the autocorrelation and partial autocorrelation of these residuals are shown.8 Time Series Modeling Reports 153 8 Time Series Forecast Plot Each model has its own Forecast plot. This can be useful for diagnosing problems with the fitting procedure. If it is. To the right of the line are the future values forecast by the model and the confidence intervals for the forecasts. In addition. The data and confidence intervals can be toggled on and off using the Show Points and Show Confidence Interval commands on the model’s popup menu. Residuals The graphs under the residuals section of the output show the values of the residuals based on the fitted model. . These can be used to determine whether the fitted model is adequate to describe the data. the points in the residual plot should be normally distributed about the zero line and the autocorrelation and partial autocorrelation of the residuals should not have any significant components for lags greater than zero. You can control the number of forecast values by changing the setting of the Forecast Periods box in the platform launch dialog or by selecting Number of Forecast Periods from the Time Series drop-down menu. The iteration history for each model you request shows the value of the likelihood function for each iteration. It is divided by a vertical line into two regions. Attempting to fit a model which is poorly suited to the data can result in a large number of iterations that fail to converge on an optimum value for the likelihood.

These displays are described in the section “Time Series Commands. they are applied to the residual series (the one-step-ahead model predictions minus the input series). The model can be rewritten as φ ( B )w t = δ + θ ( B )a t where the constant estimate δ is given by the relation .” p. 2 p 2 q The a t are assumed to be independent and normally distributed with mean zero and constant variance. with the following options for that model: Show Points hides or shows the data points in the forecast graph. The ARIMA command performs a maximum likelihood fit of the specified ARIMA model to the time series. For a response series { y i } . the autoregressive operator and the moving average operator and are written φ ( B ) = 1 – φ 1 B – φ 2 B – … – φ p B and θ ( B ) = 1 – θ 1 B – θ 2 B – … – θ q B a t are the sequence of random shocks. the general form for the ARIMA model is: φ ( B ) ( w t – µ ) = θ ( B )a t where t is the time index B is the backshift operator defined as B y t = y t – 1 w t = ( 1 – B ) y t is the response series after differencing d µ is the intercept or mean term. however. 148. Save Columns creates a new data table with columns representing the results of the model. φ ( B ) and θ ( B ) . respectively.154 8 Time Series ARIMA Model Model Report Options The title bar for each model you request has the popup menu shown to the right. ARIMA Model An AutoRegressive Integrated Moving Average (ARIMA) model predicts future values of a time series by a linear combination of its past values and a series of errors (also known as random shocks or innovations). Residual Statistics controls which displays of residual statistics are shown for the model. Show Confidence Interval hides or shows the confidence intervals in the forecast graph.

The Moving Average Order is the order (q) of the differencing operator θ ( B ) . if p and d are zero. 155 8 Time Series The ARIMA command displays the Specify ARIMA Model dialog. You can check the Model Summary table to see if the resulting fitted model is stable and invertible. the corresponding letters are often dropped. s ( t ) is one of the s time-varying seasonal terms. You might want to uncheck this box if the fitter is having difficulty finding the true optimum or if you want to speed up the fit. a t are the random shocks. An ARIMA model is commonly denoted ARIMA(p. For example. The Intercept check box determines whether the intercept term µ will be part of the model. then model would be denoted MA(q).d. β t is the time-varying slope term. Smoothing models represent the evolution of a time series by the model: y t = µ t + β t + s ( t ) + a t where µ t is the time-varying mean term. the fitting procedure will constrain the autoregressive parameters to always remain within the stable region and the moving average parameters within the invertible region. or q are zero. which allows you to specify the ARIMA model you want to fit.8 Time Series Smoothing Models δ = φ ( B )µ = µ – φ 1 µ – φ 2 µ – … – φ p µ . Smoothing Models JMP offers a variety of smoothing techniques. The results appear when you click Estimate.q). If the Constrain fit check box is checked.d. The Confidence Intervals box allows you to set the confidence level between 0 and 1 for the forecast confidence bands. If any of p. . Use the Specify ARIMA Model dialog for the following three orders that can be specified for an ARIMA model: 1 2 3 4 The Autoregressive Order is the order (p) of the polynomial ϕ ( B ) operator. The Differencing Order is the order (d) of the differencing operator.

The smoothing equations are written in terms of model parameters called smoothing weights. Smoothing Model Dialog The Smoothing Model dialog appears in the report window when you select one of the smoothing model commands. These ARIMA equivalents are used to estimate the smoothing weights and provide forecasts. the trend smoothing weight ϕ. the trend damping weight δ. s – 1 are the estimates of the s ( t ) . The estimators for these time-varying terms are L t smoothed level that estimates µ t T t is a smoothed trend that estimates β t S t – j for j = 0. The dialog also lets you to specify what type of constraint you want to enforce on the smoothing weights during the fit. The dialogs for seasonal smoothing models include a Periods Per Season box for setting the number of periods in a season. Each smoothing model defines a set of recursive smoothing equations that describes the evolution of these estimators. 1. …. Unconstrained allows the parameters to range freely. Stable Invertible constrains the parameters such that the equivalent ARIMA model is stable and invertible. the seasonal smoothing weight. the level smoothing weight γ. Custom expands the dialog to allow you to set constraints on individual smoothing weights. they have the common property that larger weights give more influence to recent data while smaller weights give less influence to recent data. . You may not be able to specify the equivalent ARIMA model using the ARIMA command because some smoothing models intrinsically constrain the ARIMA model parameters in ways the ARIMA command will not allow.156 8 Time Series Smoothing Models Models without a trend have βt = 0 and nonseasonal models have s ( t ) = 0 . They are α. Each smoothing model has an ARIMA model equivalent. The constraints are: Zero To One keeps the values of the smoothing weights in the range zero to one. The Confidence Intervals box allows you to set the confidence level for the forecast confidence bands. While these parameters enter each model in a different way (or not at all).

Simple Exponential Smoothing The model for simple exponential smoothing is y t = µ t + α t . In that case. none of the weights would be estimated from the data although forecasts and residuals would still be computed. 1)(0. 1 with θ = 1 – α . The smoothing equation. 2 2 The moving average form of the model is . Fixed. 1.8 Time Series Smoothing Models 157 8 Time Series Each smoothing weight can be Bounded. This model is equivalent to an ARIMA(0. In this case. The example shown here has the Level weight (α) fixed at a value of 0. 1) model where ( 1 – B )y t = ( 1 – θ B )α t with θ = 1 – α . the results of the fit appear in place of the dialog. The smoothing equations. 1. When you click Estimate.3. or Unconstrained as determined by the setting of the popup menu next to the weight’s name. This model is equivalent to an ARIMA(0. the value of the Trend weight is allowed to move within the range 0 to 1 while the Level weight is held at 0. defined in terms of a single smoothing weight α are L t = αy t + ( 1 – α )L t – 1 and T t = α ( L t – L t – 1 ) + ( 1 – α )T t – 1 . The moving average form of the model is ∞ yt = at + ∑ j–1 αa t – j Double (Brown) Exponential Smoothing The model for double exponential smoothing is y t = µ t + β 1 t + a t .3 and the Trend weight (γ) bounded by 0 and 1. Note that you can specify all the smoothing weights in advance by using these custom constraints. 1 = θ 2. is defined in terms of a single smoothing weight α. 1. Lt = αyt + (1 – α)Lt-1. 1)1 model ( 1 – B ) y t = ( 1 – θ B ) a t where θ 1.

0)S model where we define θ 1 = θ 1. 2. and θ 3 = – θ 1.158 8 Time Series Smoothing Models ∞ yt = at + ∑ j=1 ( 2α + ( j – 1 )α )a t – j 2 Linear (Holt) Exponential Smoothing The model for linear exponential smoothing is y t = µ t + β t t + a t . 1 θ 2. 1. and ϕ are L t = αy t + ( 1 – α ) ( L t – 1 + ϕT t – 1 ) and T t = γ ( L t – L t – 1 ) + ( 1 – γ )ϕT t – 1 This model is equivalent to an ARIMA(1. 1. 1 . s . s = θ 2. The smoothing equations in terms of smoothing weights α. The smoothing equations in terms of smoothing weights α and δ are L t = α ( y t – S t – s ) + ( 1 – α )L t – 1 and S t = δ ( y t – L t – s ) + ( 1 – δ )ϕS t – s This model is equivalent to a seasonal ARIMA(0. The smoothing equations defined in terms of smoothing weights α and γ are L t = αy t + ( 1 – α ) ( L t – 1 + T t – 1 ) and T t = γ ( L t – L t – 1 ) + ( 1 – γ )T t – 1 This model is equivalent to an ARIMA(0. 2 2 The moving average form of the model is ∞ yt = at + ∑ j=1 ( α + jαγ )a t – j Damped-Trend Linear Exponential Smoothing The model for damped-trend linear exponential smoothing is y t = µ t + β t t + a t . 2) model where ( 1 – B ) y t = ( 1 – θ B – θ 2 B )a t with θ = 2 – α – αγ and θ 2 = α – 1 . 1. 1)(0. 2) model where ( 1 – ϕ B ) ( 1 – B )y t = ( 1 – θ 1 B – θ 2 B )a t with θ 1 = 1 + ϕ – α – αγϕ and θ 2 = ( α – 1 )ϕ . γ. s . θ 2 = θ 2. 2 The moving average form of the model is ∞ yt = αt + ∑ j=1 ⎛ α + αγϕ ( ϕ – 1 )⎞ ⎜ ---------------------------------------⎟ α t – j ϕ–1 ⎝ ⎠ j Seasonal Exponential Smoothing The model for seasonal exponential smoothing is y t = µ t + s ( t ) + a t .

The moving average form of the model is ∞ yt = at + ∑ j=1 ψj at – j ⎧ α for jmods ≠ 0 ⎪ where ψ = ⎨ ⎪ ⎩ α + δ ( 1 – α ) forjmods = 0 Winters Method (Additive) The model for the additive version of Winter’s method is y t = µ t + β t t + s ( t ) + a t . and This model is equivalent to a seasonal ARIMA(0. s+1 ⎛ ⎞ i ⎜1 – ( 1 – B ) ( 1 – B )y t = ⎜ ∑ θi B ⎟ at ⎟ ⎝ ⎠ i=1 2 . and θ 3 = ( 1 – α ) ( δ – 1 ) .8 Time Series Smoothing Models 159 8 Time Series so ( 1 – B ) ( 1 – B )y t = ( 1 – θ 1 B – θ 2 B – θ 3 B s 2 s+1 )a t with θ 1 = 1 – α . T t = γ ( L t – L t – 1 ) + ( 1 – γ )T t – 1 . θ 2 = δ ( 1 – α ) . 0)s model The moving average form of the model is ∞ yt = at + ∑ j=1 Ψj at – j where ⎧ α + jαγ for jmods ≠ 0 ψ = ⎨ ⎩ α + jαγ + δ ( 1 – α ) forjmods = 0 . γ. s+1)(0. and δ are Lt = α ( yt – St – s ) + ( 1 – α ) ( Lt – 1 + Tt – 1 ) S t = δ ( y t – L t ) + ( 1 – δ )S t – s . 1. The smoothing equations in terms of weights α. 1.

.

Introduction For a short tour of the Multivariate platform. Open Solubility. The platform begins by showing a standard correlation matrix. The points are highlighted on all other plots and are selected in the data table. When the report appears. The Multivariate platform popup menu gives the additional correlations options and other techniques for looking at multiple variables such as • a scatterplot matrix with normal density ellipses • inverse. . partial.9 Correlations and Multivariate Techniques The Multivariate platform specializes in exploring how many variables relate to each other. you see correlations and a scatterplot matrix.jmp from the sample data folder. Select Analyze > Multivariate to bring up the launch dialog. and pairwise correlations • a covariance matrix • nonparametric measures of association • simple statistics (such as mean and standard deviation) All plots and the current data table are linked. or the outlier distance plot. You can highlight points on any scatterplot in the scatterplot matrix.

In addition. shown here. including nonparametric correlations (se below for instances of each kind. Principal Components would confirm this. .162 9 Correlations and Multivariate Techniques Introduction From here. Select Principal Components > on Correlations from the platform’s drop-down list.785 and 0. the last four variables are similarly correlated. The principal components report. corresponding to the eigenvalues of 4. Note that the first two variables (1-Octonol and Ether) are correlated with each other. indicates that there are two strong directions of variation. you can calculate several different kinds of correlations. This suggests that the variability in these six variables could be explained in fewer dimensions. We can express 95% of the variation of these six dimensions in only two dimensions.945.

In most of the following analysis options.jmp sample data table. and Simple Statistics > Univariate. which exclude rows that are missing on either of the variables under consideration.9 Correlations and Multivariate Techniques Launch the Platform and Select Options 163 9 Correlations and Multivariate Statistics Launch the Platform and Select Options When you choose Analyze > Multivariate. Inverse Correlations and Partial Correlations The inverse correlation matrix (Inverse Corr table). Many of the following examples use the Solubility. a missing value in an observation causes the entire observation to be deleted. In the inverse correlation. If the multiple correlation is zero. Correlations Multivariate The Correlations Multivariate option gives the Correlations table. The following sections describe the tables and plots offered by the Multivariate platform. the diagonal inverse . without regard to missing values in other columns. The exceptions are in Pairwise Correlations. The diagonal elements of the matrix are a function of how closely the variable is a linear function of the other variables. which is a matrix of correlation coefficients that summarizes the strength of the linear relationships between each pair of response (Y) variables. the diagonal is 1/(1 – R2) for the fit of that variable by all the other variables. This correlation matrix only uses the observations that have nonmissing values for all variables in the analysis. shown at the top in the next figure. provides useful multivariate information. The platform popup menu shown here lists additional correlation options and other techniques for looking at multiple variables. which calculates its statistics column-by-column. a standard correlation matrix and scatterplot matrix appears first.

The correlation of the variables is seen by the collapsing of the ellipse along the diagonal axis. By default.164 9 Correlations and Multivariate Techniques Launch the Platform and Select Options element is 1. the variables are uncorrelated. Scatterplot Matrix To help you visualize the correlations. If the variables are bivariate normally distributed. If the scatterplots are not showing. If the multiple correlation is 1. The cells of the scatterplot matrix are size-linked so that stretching a plot from any cell resizes all the scatterplot cells. as shown in Figure 9. . This is the negative of the inverse correlation matrix scaled to unit diagonal. then the inverse element becomes infinite and is reported missing. The scatterplot matrix is shown by default. If the ellipse is fairly round and is not diagonally oriented. a 95% bivariate normal density ellipse is imposed on each scatterplot. select Scatterplot Matrix from the platform popup menu.1. this ellipse encloses approximately 95% of the points. a scatterplot for each pair of response variables displays in a matrix arrangement. The partial correlation table (Partial Corr table) shows the partial correlations of each pair of variables after adjusting for all the other variables.

1 Example of a Scatterplot Matrix Two clusters of correlations: The first two variables and the next four. . By default they are 95% ellipses. Density Ellipses toggles the display of the density ellipses on the scatterplots constructed by the α level that you choose. In addition. Show Correlations shows the correlation of each histogram in the upper left corner of each scatShow Histogram draws histograms in the diagonal of the scatterplot matrix.9 Correlations and Multivariate Techniques Launch the Platform and Select Options 165 9 Correlations and Multivariate Statistics Figure 9. you can toggle the counts that label each bar with the Show Counts command. The popup menu next on the Scatterplot Matrix title bar button lets you tailor the matrix with color and density ellipses and by setting the α-level. These histograms can be specified as Horizontal or Vertical. terplot.

Figure 9.2 Reorder Scatterplot Matrix Drag cell diagonally Covariance Matrix The Covariance Matrix command displays the covariance matrix for the analysis. the columns reorder as shown in Figure 9. as illustrated previously by the two groups showing in Figure 9. Ellipse Color lets you select from a palette of colors to change the color of the ellipses. if you drag the cell of the column labeled 1-octanol diagonally down one cell. . you clearly see the variables cluster into groups based on their correlations. For example. You can reorder the scatterplot matrix columns by dragging a diagonal (label) cell to another position on the diagonal. When you look for patterns in the whole scatterplot matrix with reordered columns.1.166 9 Correlations and Multivariate Techniques Launch the Platform and Select Options Show Correlations Show Counts Show Histogram (Horizontal in this case) Ellipse α lets you select from a submenu of standard α-levels or select the Other command and specifically set the α level for the density ellipses.2.

3. Figure 9. These values match the ones that would be produced using the Distribution platform. using all available values. and so on) for each column. regardless of values in other col- umns. The count values differ if any pair has a missing value for either variable. Univariate Simple Statistics are calculated on each column. .3 Pairwise Correlations Report Simple Statistics The Simple Statistics submenu allows you to display simple statistics (mean.9 Correlations and Multivariate Techniques Launch the Platform and Select Options 167 9 Correlations and Multivariate Statistics Pairwise Correlations The Pairwise Correlations table lists the Pearson product-moment correlations for each pair of Y variables. These are values produced by the Density Ellipse option on the Fit Y by X platform. as shown in Figure 9. standard deviation. These statistics can be calculated in two ways that differ when there are missing values in the data table. The Pairwise Correlations report also shows significance probabilities and compares the correlations with a bar chart.

168 9 Correlations and Multivariate Techniques Launch the Platform and Select Options Multivariate Simple Statistics are calculated by dropping any row that has a missing value for any column in the analysis. Hoeffding’s D is a statistical scale that ranges from –0. the Nonparametric Measures of Association table is shown. See “Computations and Statistical Details. A pair is discordant if the observation with the larger value of X has the smaller value of Y. for computational information. The Nonparametric Measures of Association report also shows significance probabilities for all measures and compares them with a bar chart similar to the one in Figure 9.The statistic approximates a weighted sum over observations of chi-square statistics for two-by-two classification tables.3. Kendall’s Tau is based on the number of concordant and discordant pairs of observations. The Nonparametric submenu offers these three nonparametric measures: Spearman’s Rho is a correlation coefficient computed on the ranks of the data values instead of on the values themselves.5 to 1. Nonparametric Correlations When you select Nonparametric Correlations from the platform popup menu. A pair is concordant if the observation with the larger value of X also has the larger value of Y. These are the statistics that are used by the Multivariate platform to calculate correlations.” p. and detects more general departures from independence. with large positive values indicating dependence. 169. There is a correction for tied pairs (pairs of observations that have equal values of X or equal values of Y). .

the correlation is 1 or –1. n is the number of observations. If a weight variable is specified. 0 if z = 0 . Note that sgn ( z ) is equal to 1 if z > 0 . The formula ∑ sgn ( x i – xj ) sgn ( y i – y j ) i<j τ b = --------------------------------------------------------------( T0 – T1 ) ( T0 – T2 ) computes Kendall’s τb where T0 = ( n ( n – 1 ) ) ⁄ 2 . Average ranks are used in case of ties. depending on whether the variables are positively or negatively related. and ∑ ( ( ui ) ( ui – 1 ) ) ⁄ 2 . ∑ (x – x) ∑ (y – y) 2 2 If there is an exact linear relationship between two variables. it is denoted as r and computed as ∑ (x – x)(y – y) r = --------------------------------------------------------. Kendall. and Kendall’s τb ranges from –1 to 1. or tied. it is ignored. Otherwise they are discordant. Computations are then performed on the ranks of the data values. Kendall’s τb Coefficients Kendall’s τb coefficients are based on the number of concordant and discordant pairs. The ti (the ui) are the number of tied x (respectively y) values in the ith group of tied x (respectively y) values. If there is no linear relationship. or Hoeffding correlations. Nonparametric Measures of Association For the Spearman. the data are first ranked. the correlation tends toward zero. T1 = T2 = ∑ ( ( ti ) ( ti – 1 ) ) ⁄ 2 . A pair of rows for two variables is concordant if they agree in which variable is greater. For response variables X and Y. and –1 if z < 0 . Spearman’s ρ (rho) Coefficients Spearman’s ρ correlation coefficient is computed on the ranks of the data using the formula for the Pearson’s correlation previously described. .9 Correlations and Multivariate Techniques Computations and Statistical Details 169 9 Correlations and Multivariate Statistics Computations and Statistical Details Pearson Product-Moment Correlation The Pearson product-moment correlation coefficient measures the strength of the linear relationship between two variables..

A point that is tied on its x value or y value. • The observations are then re-ranked according to the values of the second variable. A point tied on both x and y contributes 1/4 to Qi.170 9 Correlations and Multivariate Techniques Computations and Statistical Details Computations proceed in the following way: • Observations are ranked in order according to the value of the first variable. Thus. i. Inverse Correlation Matrix The inverse correlation matrix provides useful multivariate information. from the zero model.e. For no-intercept and hidden-intercept models. are a function of how closely the variable is a linear function of the other variables. Hoeffding’s D Statistic The formula for Hoeffding’s D (1948) is ( n – 2 ) ( n – 3 )D 1 + D 2 – 2 ( n – 2 )D 3 D = 30 ⎛ ----------------------------------------------------------------------------------------⎞ where ⎝ n( n – 1)( n – 2 )(n – 3)( n – 4) ⎠ D1 = Si ( Qi – 1 ) ( Qi – 2 ) . it is ignored. D2 = Si ( Ri – 1 ) ( Si – 1 ) ( Si – 2 ) D3 = ( Ri – 1 ) ( Si – 2 ) ( Qi – 1 ) The Ri and Si are ranks of the x and y values. and the Qi (sometimes called bivariate ranks) are one plus the number of points that have both x and y values less than the ith points. The diagonal elements of the inverse correlation matrix.. from the mean model. . with 1 indicating complete dependence. but not on both. 2 1 – Ri where Ri2 is the coefficient of variation from the model regressing the ith explanatory variable on the other explanatory variables. Note that the definition of R2 changes for no-intercept models. JMP uses the R2 from the uncorrected Sum of Squares. contributes 1/2 to Qi if the other value is less than the corresponding value for the ith point. if the correlation matrix is ii denoted R and the inverse correlation matrix is denoted R-1. Specifically. a large rii indicates that the ith variable is highly correlated with any number of the other variables. When there are no ties among observations. • The number of interchanges of the first variable is used to compute Kendall’s τb. the D statistic has values between –0. the diagonal element is denoted r and is computed as r ii 1 = VIF i = --------------. rather than the corrected sum of squares. If a weight variable is specified. sometimes called the variance inflation factors (VIF).5 and 1.

and Charting Data This chapter shows you how to use JMP to interact with the rest of the world. the Chart. Select Method from the list of variables.10 Importing.7 on page 181. or web authoring tools. Open the data file Denim. Select Graph > Chart from the menu bar. This brings up a dialog like the one in Figure 10.jmp. Select Starch Content (%) from the list of columns. Select Thread Wear Measured from the list of columns. Click the Categories. presentation managers. Exporting. Click on the Statistics button and select Mean from the drop-down list. In this introduction. Introduction After starting JMP Student Edition. Click on the Statistics button and select Max. No statistics package is useful unless its results can be communicated to others. some of the variables’ statistics are charted. so this chapter also describes copying and pasting results from JMP Student Edition into word processors. Details about this data are found in “About the Data. In addition to techniques for accomplishing these tasks. 21 in the “The Distribution Platform” chapter. . produce a bar chart showing the mean starch content and the maximum thread wear for each of the wash methods. it illustrates ways of importing data from various formats into JMP for analysis. and Overlay Plot platforms are documented.” p. Using the Chart Platform To begin with. shown later in this chapter. Graphs and charts are usually used to summarize results. Therefore. X Levels button.

1 Bar Chart The Chart platform makes it easy to format this chart.172 10 Importing. or even change to another chart type. . and Charting Data Introduction Click OK. Simple charts are almost always the most effective. Right-Click on Mean(Starch Content (%)) in the legend to the right of the plot. do not feel that they all must be. The bars for Mean(Starch Content (%)) change to a needle chart. A bar chart like the one in Figure 10. The colors of the bars for starch content change to the selected color. Figure 10. Please note that merely because JMP Student Edition allows for a great many chart options to be changed. and choose a color from the resulting palette. With Mean(Starch Content (%)) still selected: From the drop-down list in the title bar next to Chart. Exporting.1 appears.2. select Overlay Color. select Y Options > Needle Plot. as shown in Figure 10.

X Level role. and Method to the Categories. Exporting. Assign Thread Wear Measured and Starch Content (%) to the Y role. From the drop-down list in the title bar next to Chart.10 Importing. Therefore. two charts are produced: • A plot of starch content and thread wear against wash method. Using the Overlay Plot Platform In this example. Select Graph > Overlay Plot from the menu bar. The entire chart changes to a line plot. • A plot of starch content versus thread wear for each level of wash method. select Y Options > Line Chart. charting options can be applied to individual levels as well as entire charts.2 Half Needle Chart Click somewhere in the blank area above the legend to deselect all columns. . and Charting Data Introduction 173 10 Graphs Figure 10.

and Charting Data Introduction Click OK. From the drop-down list in the title bar next to Overlay Plot. This produces a plot with both variables plotted on the y-axis. select Y Options > Connect Points. To connect the points and produce the plot shown in Figure 10. Exporting.3 Final Overlay Plot . Figure 10.174 10 Importing.3.

JMP Student Edition gives additional information about the file in the File Open dialog. The dialog shows the table notes. Select Starch Content (%) from the list of columns and click the Y button. Windows The Files of Type selection filters the list of files displayed in the dialog. JMP journal files.10 Importing. if they exist. SAS transport files. If *. . Again select Graph > Overlay Plot from the menu bar. This works as long as the file has the structure indicated by its name.* is chosen from the Files of Type menu. one for each level of the Method variable. Select Method from the list of columns and click the By button. JMP Script files. Importing Data The File > Open command displays a specialized open file dialog used to locate a file to open and tell JMP Student Edition the file format of the incoming file. The graphs shown here have been reduced in size by holding down the control or key and dragging on the corner of one graph. JMP looks at the type of file given by the 3-character extension appended to its file name and opens it accordingly. Select Size of Load from the list of columns and click the X button.4 shows an Open Data File dialog when the Files of Type drop-down list is changed from the default to JMP data table. This produces three separate plots. text files with any column delimiter. and flat-file database files. They all resize. If applicable. The example in Figure 10. and Excel files. JMP Student Edition directly reads JMP data tables. Exporting. and Charting Data Importing Data 175 10 Graphs To produce the plot of starch content versus load size for each level of wash method. The Open command then reads the file into a JMP Student Edition data table.

176 10 Importing.4 The Open Data File Dialog to Read a JMP table Macintosh The Open window allows you to open any type of file into JMP. . Exporting. If you select a text document to open. and Charting Data Importing Data Figure 10. the Open As menu appears in the window (see the next section for details on text importing).

10 Importing. You then see the dialog shown in Figure 10. Macintosh On the Macintosh. This is adequate for a rectangular text file with no missing fields. • Data (Using Preferences) opens the file and uses default rules (set in the preferences panel) to interpret end of field and end of line delimiters to create a JMP data table All Platforms JMP Student Edition attempts to discern the arrangement of text data. • Data (Using Preview) presents a dialog similar to the one shown in Figure 10. Exporting. Windows Under Microsoft Windows. that lets you designate delimiters and other information. This displays the Open As menu in the lower part of the dialog. Choose Text from the Files of Type drop-down list to import the data based on your current preference settings. Then choose one of the three text commands: • Text opens the file in a simple text editing window • Data (Best Guess) opens the data in a format that JMP thinks is appropriate based on the contents of the file. and an end-of-line delimiter . and Charting Data Importing Data 177 10 Graphs Importing Text Files You import text be doing one of three things. a consistent field delimiter.5. Select the Data With Preview checkbox to see a preview of an incoming text file.5 for specification of delimiters and other import details. select a text file to open. depending on the operating system you are working with. and shows a preview of the resulting data table.

and height. If no column names are indicated. age. and so on. including spaces embedded within the quotes. end-of-line delimiters. are ignored and treated as part of a text string. In Figure 10. Other text delimiters. Column 2. and the ability to set how many rows and columns will be read are additional options presented in the dialog.5 Import Text File . Exporting. data types. The initial settings in the delimited import dialog are taken from the current Preferences file. and Charting Data Importing Data Note: If double-quotes are encountered when importing text data. One or more end-of-field delimiters. the column names are name.178 10 Importing. JMP changes the delimiter rules to look for an end double-quote. and the first two rows of data.5. preferences are set that indicate the incoming table contains column headers to be used as the column names. The dialog also shows the column names. sex. the Name fields are called Column 1. Figure 10. the option to Strip enclosing quotes.

JMP Student Edition opens all the worksheets in the workbook. Excel worksheet and workbooks are imported simply by choosing Excel Files from the Files of Type (Windows) or Enable (Macintosh) lists as shown in Figure 10. Importing Microsoft Excel Files JMP Student Edition has the ability to directly import Microsoft Excel worksheets and workbooks under Macintosh and Microsoft Windows.6 Excel Open Choice JMP Student Edition can also import Excel workbooks that contain several data tables inside them. click the Try Fixed Width button to specify the separations of each column to be imported. After selecting Excel Files(*. and Charting Data Importing Data 179 10 Graphs Fixed-Width Text If your data is fixed width (that is. each variable uses a set number of columns in the text file). Exporting. Figure 10.10 Importing. .xls) from the Open dialog and double-clicking on the desired workbook.6.

7) Figure 10. The Chart Platform The Chart platform computes and plots data and statistics about the data. complete the following steps after bringing up the Chart launch dialog (Figure 10.180 10 Importing. Exporting. For that reason. Select Edit > Copy. Don’t worry too much about getting the orientation and chart type correct initially — they can be changed after the chart has been generated. From the section labeled Options. use the drop-down list to select the orientation of the chart (vertical or horizontal) and the type of chart (bar. To copy results into another program. what is going to be reported should be known before the Chart platform is used.8) and select the statistic to be charted. In essence. It is used to report the results from other explorations. and Charting Data Results from Platforms Results from Platforms The results from JMP Student Edition’s platforms can be cut and pasted into other programs using the system’s clipboard. the Chart platform is not intended as an exploratory device. through standard cut and paste facilities. select Edit > Paste. Hold down the Shift key and click on each part of the report that needs to be copied. Unlike the statistical platforms in JMP. line. In the word processor. or point chart) to be generated.7 Chart Launch Dialog Select the data column in the column list. Note that axes frequently need to be selected in addition to the graphs they accompany. Click the Statistics button (revealing the menu shown in Figure 10. needle. To plot descriptions of data. . the plots in the Chart platform do not “bristle with interactivity” to the degree that other platforms do. pie. Select the Selection tool ( ).

either in separate windows or overlaid in the same window.9 Chart Example Many of the options on the launch dialog can be changed using the platform popup menu.9. or a Grouping variable to generate separate graphs for each level of the column. X Levels is shown in Figure 10. Freq. However. Exporting. some options can be applied to individual charts. plotting the mean of Starch Content (%) using Method as Categories.10 Importing. An example bar chart. and Charting Data The Chart Platform 181 10 Graphs Optionally. Platform options affect all charts in the report window. include an X. Weight. Click OK.8 Plottable Statistics Figure 10. . and By options work as in other platforms. Level column to be plotted on the horizontal axis. Figure 10.

When this menu is accessed through a chart legend. • Std Error Bars overlays needles at plus or minus one standard error from the mean. • Point Chart shows only the plot points. • Line Chart replaces a bar chart with a line chart and connects each point with a straight line. and Charting Data The Chart Platform Single-Chart Options To apply the following options to individual charts. When accessed through the platform popup menu. • Show Points toggles the point markers on a line or needle chart off and on. Exporting. leaving a point chart. without connecting them. • Connect Points toggles the line connecting points on and off. • Overlay Marker assigns plot points a marker to identify them in overlaid charts. • Overlay Color assigns color to a variable to identify it when overlaid with other charts. .10.182 10 Importing. the commands apply only to the individual chart. right-click on a chart legend to see the popup menu shown in Figure 10. • Pen Style allows the choice of a line style from the palette shown in Figure 10.10. • Overlay Pattern assigns bars a fill pattern to identify them in overlaid charts. the commands apply to all charts.10 Chart Options • Bar Chart displays a bar for each level of the chart variables. Figure 10. Choose the Show Points option to show or hide the points. without any legends highlighted. • Needle Chart replaces each bar with a line drawn from the axis to the plotted value. The default chart is a bar chart.

The Preferred Size is set in the Preferences. Outlined mode is often used in presentations. increment.10 Importing. which can also be shown by double-clicking in the Y axis. • Marker Drawing Mode lets you adjust the way markers are drawn. Exporting. • Size/Scale allows changes in axes and plot frames. The marker size applies to the plot point and its associated rows. and draws tailored reference lines or a grid on the plot. • Marker Size allows selection of the marker from a palette of six point sizes that range from dot to very large. and redraw the plot.11 Frame Options • Background Color colors the background of the plot with the color chosen from the JMP Student Edition color palette. minimum. The Frame Size command displays a dialog to enter the exact pixel size for the plot frame. Figure 10. X Scale is only active on platforms where the X axis is numeric (the X axis for the Charts platform is categorical). and Charting Data The Chart Platform 183 10 Graphs Frame Options Frame options allow control of the plot frame’s elements as a whole. • Add Graphics Script displays a text entry box to enter JSL commands. usually to tailor the graphics output in ways not provided with commands and options. Y Scale shows the standard axis scale dialog. and tick marks for the axis. Fast mode is useful for graphs with large number of points. . • DisplayBox lists the commands to conveniently select and deselect the plot area. The axis scale dialog allows the specification of the maximum. See the JMP Scripting Language Guide for documentation of JSL commands. • Border lets you turn border lines on or off. so that they re-draw quickly after adjustments.

” p.” p. These options affect highlighted bars. Each chart can have its own chart type. • Y Options accesses the options described in “Single-Chart Options. 182. which are identical to their Overlay equivalents in Figure 10. or save the JSL commands for the analysis to a window or a file.10.” p. By default. • Level Options accesses the color. Figure 10. the axis notation only shows for the last chart displayed if the charts are not overlaid. with one as a bar chart and one as a needle chart. The commands affect only the highlighted level and its associated rows in the data table. For example.12 Different Chart Types • Vertical changes a horizontal bar chart or a pie chart to a vertical bar chart. When Overlay is not checked. Platform Options After the charts appear. the platform shows duplicate axis notation for each chart. Exporting.) . These options affect the variable whose legend is highlighted. • Separate Axes duplicate the axis notation for each chart when there are multiple charts. • Overlay displays a single overlaid chart when the chart has more than one Y (statistics) variable.184 10 Importing. marker. 184. Right-clicking on a highlighted legend shows the commands for the Colors and Markers palettes. Separate Axes is only enabled on the menu when there are multiple Y variables that are not overlaid. • Script has a submenu of commands available to all platforms that redo the analysis.12 has two overlaid variables. which can be overlaid. their drop-down menu has the following options. 66. • Horizontal changes a vertical bar chart or a pie chart to a horizontal bar chart • Pie changes a horizontal or vertical chart into a pie chart. the chart shown in Figure 10. and Charting Data The Chart Platform Level Options Click on a value in the legend to highlight it. (See “Script Submenu. and pattern palettes options described in “Level Options.

Connect Points can be activated when Show Points is not. the plots show separately. The section “Single-Plot Options. There is also a single-plot options menu for each Y variable. (See “Script Submenu. Plots initially appear overlaid with the Connect Points option in effect.” p. Exporting.” p. If one or more plot legends are highlighted. which show when the Y variable legend beneath the plot is right-clicked. . When Separate Axes is selected. Each Y variable’s plot can be modified by right-clicking on the variable name to bring up a menu. described below. When Overlay option is turned off. regardless if there are missing values between them. Platform Options Platform options affect every plot in the report window. the options affects all those plots. • Range Plot connects the lowest and highest points at each X value with a line with bars at each end. • Ungroup Plots creates a separate chart for each level of a grouping variable • Arrange Plots allows you to specify the number of plots in each row • Script has a submenu of commands available to all platforms that redo the analysis or save the JSL commands for the analysis to a window or a file. the X axes for other Y variables show tick marks but show no scale values. and Range option cannot be selected at the same time. Note: The Needle option. 66. When one of these options is selected at the platform level. and Charting Data The Overlay Plot Platform 185 10 Graphs The Overlay Plot Platform The Overlay Plot platform overlays numeric Y variables with a single numeric or character X variable. The individual plot options are the same as those in the Y Options submenu at the platform level. it affects all plots in the report if no legends are highlighted. • Needle draws a vertical line from each point to the X axis. 186. • Connect Through Missing connects adjacent points in the plot. on the last plot in the window. • Y Options has a submenu of options that apply to all variables and plots in the report window when selected from the main platform window. allowing for greater flexibility in plot displays.10 Importing.) Single-Plot Options Each Y variable is labeled beneath the plot. showing its name and symbol. • Show Points alternately shows or hides points. describes Y options for each individual variable. • Uniform Y Scale makes the Y scales the same on grouped plots. • Separate Axes lets the X axis scale values be printed only once. the values of the X variable appear in ascending order. • Connect Points is a toggle that alternately connects the points with lines. Optionally. • Overlay overlays plots for all columns assigned the Y role. with points plotted and connected in that order. The Overlay Plot platform has platform plotting options accessed by the popup menu icon on the Overlay Plot title bar.

186

10 Importing, Exporting, and Charting Data The Overlay Plot Platform

• Step joins the position of the points with a discrete step by drawing a straight horizontal line from each point to the X value of the following point, and then a straight vertical line to that point. Step, without showing points, is illustrated in Figure 10.13. Note: Only one of Connect Points, Needle, and Step can be chosen at a time. Figure 10.13 Overlay Step Plot

• Function Plot plots a formula (stored in the Y column) as a smooth curve. To use this function, store a formula in the Y column that is a function of a single X column. For example, the following column contains a formula involving the sine function. When used in an overlay plot, the function is plotted as a curve rather than individual points.

Note: Overlay Plot normally assumes you want a function plot when the Y column contains a formula. However, formulas that contain random number functions are more frequently used with simulations, where function plotting is not often wanted. Therefore, the Function Plot option is off (by default) when a random number function is present, but on for all other functions. • Connect Color displays the standard JMP color palette (see Figure 10.10) for assigning colors to lines that connect points. • Overlay Marker assigns markers to plotted points using the standard JMP marker palette (see Figure 10.10). • Overlay Marker Color lets you select the color of the overlay marker. • Line Style and Line Width let you adjust the appearance of the lines drawn on the plot.

10 Importing, Exporting, and Charting Data The Overlay Plot Platform

187 10 Graphs

11

Full Factorial Designs

Designing Experiments

A full factorial design contains all possible combinations of a set of factors. This is the most conservative design approach, but it is also the most costly in experimental resources. The full factorial designer supports both continuous factors and categorical factors with up to nine levels. In full factorial designs, you perform an experimental run at every combination of the factor levels. The sample size is the product of the numbers of levels of the factors. For example, a factorial experiment with a two-level factor, a three-level factor, and a four-level factor has 2 x 3 x 4 = 24 runs. Factorial designs with only two-level factors have a sample size that is a power of two (specifically 2f where f is the number of factors). When there are three factors, the factorial design points are at the vertices of a cube as shown in the diagram below. For more factors, the design points are the vertices of a hypercube. Full factorial designs are the most conservative of all design types. There is little scope for ambiguity when you are willing to try all combinations of the factor settings. Unfortunately, the sample size grows exponentially in the number of factors, so full factorial designs are too expensive to run for most practical purposes.

Introduction

The following example, adapted from Meyer et al. (1996) and Box, Hunter, and Hunter (1978), shows a five-factor reactor example. To follow along with this example, open the folder Sample Data that was installed when you installed JMP. Within this folder, open Design Experiment > Reactor 32 Runs.jmp. Suppose you have used the screening designer to investigate the effects of five factors on the percent reaction of a chemical process. The factors (Feed Rate, Catalyst, Stir Rate, Temperature, and Concentration) are all two-level continuous factors. 1 Select DOE > Full Factorial Design. 2 Click the red triangle icon on the Full Factorial Design title bar and select Load Responses. 3 Open Reactor Response.jmp to load the responses by opening the Sample Data folder that was installed with JMP. In the Sample Data folder, open Design Experiment > Reactor

Response.jmp.

4 Click the red triangle icon on the Full Factorial Design title bar and select Load Factors. 5 Open Reactor Factors.jmp to load the responses by opening the Sample Data folder that was installed with JMP. In the Sample Data folder, open Design Experiment > Reactor Factors.jmp.

190

11 Full Factorial Designs Introduction

The completed dialog is shown in Figure 11.1. Figure 11.1 Full-Factorial Example Response and Factors Panels

6 Click Continue to see Output Options panel. A full factorial design includes runs for all combinations of high and low factors for the five variables, giving 32 runs. 7 Click Make Table. The design data table (Figure 11.3) contains a run for every combination of high and low values for the five variables. Since there are five variables, there are 25=32 runs. This covers all combinations of a five factors with two levels each. Initially, the table has an empty Y column named Percent Reacted for entering response values when the experiment is complete. The values in your table may be different from those shown below. Figure 11.2 25 Factorial Reactor Data

To see the completed experiment and continue following this example, open the folder Sample Data that was installed when you installed JMP. Within this folder, open Design Experiment > Reactor 32

Runs.jmp.

Figure 11. 3 Click the red triangle icon on the Percent Reacted title bar and select Normal Quantile Plot.3 Reactor 32 Runs. 2 The probability to enter a factor (Prob to Enter) in the model should be 0.05. . Then click OK. The results are shown in Figure 11.1.4. 3 The probability to remove a factor (Prob to Leave) should be 0.jmp Begin the analysis with a quick look at the data before fitting the factorial model. 1 Select Analyze > Distribution. 1 Click the red triangle icon next to the Fit Model script and select Run Script. Columns.4 Distribution of Response Variable for Reactor Data Start the formal analysis with a stepwise regression. 2 Highlight Percent Reacted and click Y. and brings up the Stepwise control panel.11 Full Factorial Designs Introduction 191 11 DOE: Full Factorial Designs Figure 11. The data table has a script stored with it that automatically defines an analysis of the model that includes main effects and all two-factor interactions.

6 Starting Model For Stepwise Process The mixed stepwise procedure removes insignificant main effects and adds important interactions.192 11 Full Factorial Designs Introduction Figure 11. The end result is shown in Figure 11.6. make sure the menu beside Direction specifies Mixed. To do this. 6 Click Go. Note that the Feed Rate and Stir Rate factors are no longer in the model. The Model Specification window that appears is automatically set .7 Model After Mixed Stepwise Regression 7 Click the Make Model button.5 Run JSL Script for Stepwise Regression 4 A useful way to use the Stepwise platform is to check all the main effects in the Current Estimates table. Figure 11. Figure 11. 5 Check the boxes for the main effects of the factors as shown in Figure 11.7.

1 Open the Prediction Profiler by clicking the red triangle on the Response Percent Reacted title bar and selecting Factor Profiling > Profiler. The figure on the right in Figure 11.11 Full Factorial Designs Introduction 193 11 DOE: Full Factorial Designs up with the appropriate effects (Figure 11. Figure 11. All effects selected by the stepwise process are highly significant.9 shows the actual by predicted plot for the model.9 shows a table of model coefficients and their standard errors (labeled Parameter Estimates). as shown in Figure 11. which is more than an order of magnitude smaller than the range of predictions.8 Fitting a Prediction Model 8 Click Run Model to see the analysis for a candidate prediction model (Figure 11. The size of the random noise as measured by the RMSE is only 3.9). .10.3311%.9 Actual by Predicted Plot and Prediction Model Estimates The factor Prediction Profiler also gives you a way to compare the factors and find optimal settings. The figure on the left in Figure 11. This is strong evidence that the model has good predictive capability. Figure 11.8). The predicted model covers a range of predictions from 40% to 95% reacted.

Figure 11.11 shows the profiler’s initial display. The reaction is unfeasible economically unless the Percent Reacted is above 90%. .12 Viewing the Maximum Desirability The plot of Desirability versus Percent Reacted shows that the goal is to maximize Percent Reacted. Figure 11.11 Viewing the Profiler 2 Click the red triangle on the Prediction Profiler title bar and select Maximize Desirability to see the profiler in Figure 11.10 Selecting the Profiler Figure 11. Desirability increases linearly as the Percent Reacted increases.194 11 Full Factorial Designs Introduction Figure 11. therefore the Desirability for values less than 90% decreases and finally becomes zero.12.

11 Full Factorial Designs Creating a Factorial Design

195 11 DOE: Full Factorial Designs

The maximum Desirability is 0.945 when Catalyst and Temperature are at their highest settings and Concentration is at its lowest setting. Percent Reacted increases from 65.5 at the center of the factor ranges to 95.875 at the most desirable setting.

**Creating a Factorial Design
**

To start a full factorial design, select DOE > Full Factorial Design, or click the Full Factorial Design button on the JMP Starter DOE page. Then, follow the steps below.

**Entering Responses and Factors
**

To enter responses, follow the steps in Figure 11.13. Then, enter factors as shown in Figure 11.14 Figure 11.13 Entering Responses Click to enter lower and upper limits and importance weights. 4 To enter one 1 response at a time, click then select a goal type: Maximize, Match Target, Minimize, or None.

2 Double-click to edit the response name, if desired.

3 Click to change the response goal, if desired.

Tip: To quickly enter multiple responses, click the N Responses button and enter the

number of responses you want.

196

11 Full Factorial Designs Creating a Factorial Design

Figure 11.14 Entering Factors in a Full Factorial Design To enter factors, click either the Continuous button or the Categorical button and select a factor type, level 2 - 9.

Double-click to edit the factor name.

Click to enter values or change the level names. To remove a level, click it, press the delete key on the keyboard, then press the Return or Enter key on the keyboard.

When you finish adding factors, click Continue.

**Selecting Output Options
**

Use the Output Options panel to specify how you want the output data table to appear:

• Run Order—Lets you designate the order you want the runs to appear in the data table when it is created. Choices are:

Keep the Same—the rows (runs) in the output table appear as they do in the Design panel. Sort Left to Right—the rows (runs) in the output table appear sorted from left to right. Randomize—the rows (runs) in the output table appear in a random order. Sort Right to Left—the rows (runs) in the output table appear sorted from right to left.

• Number of Center Points—Specifies additional runs placed at the center of each continuous factor’s range. • Number of Replicates—Specify the number of times to replicate the entire design, including centerpoints. Type the number of times you want to replicate the design in the associated text box. One replicate doubles the number of runs.

11 Full Factorial Designs Creating a Factorial Design

197 11 DOE: Full Factorial Designs

**Making the Table
**

When you click Make Table, the table shown in Figure 11.15 appears. Figure 11.15 Factorial Design Table The name of the table is the design type that generated it. values in the Pattern column describe the run each row represents for continuous factors, a plus sign represents high levels for continuous factors, a minus sign represents low levels level numbers represent values of categorical factors

This script allows you to easily fit a model using the values in the design table.

12

Screening Designs

Designing Experiments

Screening designs are arguably the most popular designs for industrial experimentation. They examine many factors to see which have the greatest effect on the results of a process. Compared to other design methods, screening designs require fewer experimental runs, which is why they are cheap. Thus, they are attractive because they are a cheap and efficient way to begin improving a process. Often screening designs are a prelude to further experiments. It is wise to spend only about a quarter of your resource budget on an initial screening experiment. You can then use the results to guide further study. The efficiency of screening designs depends on the critical assumption of effect sparsity. Effect sparsity results because real-world processes usually have only a few driving factors; other factors are relatively unimportant. To understand the importance of effect sparsity, you can contrast screening designs to full factorial designs: • Full factorial designs consist of all combinations of the levels of the factors. The number of runs is the product of the factor levels. For example, a factorial experiment with a two-level factor, a three-level factor, and a four-level factor has 2 x 3 x 4 = 24 runs. • By contrast, screening designs reduce the number of runs by restricting the factors to two (or three) levels and by performing only a fraction of the full factorial design. Each factor in a screening design is usually set at two levels to economize on the number of runs needed, and response measurements are taken for only a fraction of the possible combinations of levels. In the case described above, you can restrict the factors to two levels, which yield 2 x 2 x 2 = 8 runs. Further, by doing half of these eight combinations you can still assess the separate effects of the three factors. So the screening approach reduces the 24-run experiment to four runs. Of course, there is a price for this reduction. This chapter discusses the screening approach in detail, showing both pros and cons. It also describes how to use JMP’s screening designer, which supplies a list of popular screening designs for two or more factors. These factors can be continuous or categorical, with two or three levels. The list of screening designs you can use includes designs that group the experimental runs into blocks of equal sizes where the size is a power of two.

Introduction

Suppose an engineer wants to investigate a process that uses an electron beam welding machine to join two parts. The engineer fits the two parts into a welding fixture that holds them snugly together. A voltage applied to a beam generator creates a stream of electrons that heats the two parts, causing them

The Length of this fused area is the depth of penetration of the weld.1 Screening Design Response With Match Target Goal 3 Click the Lower Limit text edit area and enter 0.17 inches To begin this example.22.17 inches with a lower bound of 0.12 as the lower limit (minimum acceptable value).12 and an upper bound of 0. which is a current that affects the intensity of the beam After each processing run. This example has one categorical factor (Operator) and two continuous factors (Speed and Current).1. For this study. Click the Goal text edit area and choose Match Target. and Current. This depth of penetration is the response for the study.22 as the upper limit (maximum acceptable value). but the goal of this process is to get a target value of 0. the engineer cuts the part in half. The ideal depth of the fused region is 0. which are the factors for the study: • Operator. .17 inches. 5 Add two continuous factors by typing 2 in the Continuous box and clicking the associated Add button.). select DOE > Screening Design from the main menu. as shown in Figure 12. 2 The default goal for the single default response is Maximize. there is a single default response called Y. as shown in Figure 12. Then click the Upper Limit text edit area and enter 0. Speed. 7 Set high and low values for Speed to 3 and 5 rpm. who is the technician operating the welding machine • Rotation Speed. Note that in the Responses panel. and assign Mary and John as values for the categorical factor called Operator. Change the default response as follows: 1 Double-click the response name and change it to Depth (In. The goals of the study are to: • find which factors affect the depth of the weld • quantify those effects • find specific factor settings that predict a weld depth of 0. which is the speed at which the part rotates under the beam • Beam Current.2. the engineer wants to explore the following three inputs. Set high and low values for Current to 150 and 165 amps. Figure 12.200 12 Screening Designs Introduction to fuse. 4 Add the categorical factor by clicking the Add button beside 2-Level Categorical. This reveals an area where the two parts have fused. The engineer wants to study the welding process to determine the best settings for the beam generator to produce the desired depth in the fused region. 6 Double-click the factor names and rename them Operator.

Figure 12. 9 Select Full Factorial in the list of designs. click Make Table to create a JMP table that contains the specified design.3.2 Screening Design with Two Continuous and One Categorical Factor 8 Click Continue. The Pattern variable shows the coded design runs. The table uses the names for responses. .4 appears. 10 View the table produced in this example by selecting Help (View on the Macintosh) > Sample Data Directory > Design Experiment > DOE Example 1. The table in Figure 12. or click the Screening Design button on the JMP Starter DOE page.3 List of Screening Designs for Two Continuous and One Categorical Factors When the design details are complete. follow the steps below.12 Screening Designs Creating a Screening Design 201 12 DOE: Screening Designs Figure 12. Figure 12. Then. and levels you specified.4 The Design Data Table Creating a Screening Design To start a screening design. select DOE > Screening Design. and then click Continue. factors.jmp. as shown in Figure 12.

Tip: To quickly enter multiple responses. or None. or that there is no goal. • The Match Target goal supports the objective when the best value for a responses is a specific target value. Match Target. click the N Responses button and enter the number of responses you want. and enter an asymmetric target value. the best value is usually the largest possible. follow the steps in Figure 12. 3 Click to change the response goal. 2 Double-click to edit the response name. such as when the response is impurity or defects. click then select a goal type: Maximize.5): • For responses such as strength or yield. if desired. you can tell JMP that your goal is to obtain the maximum or minimum value possible. The default target value is assumed to be midway between the lower and upper limits. Figure 12. The following description explains the relationship between the goal type (step 3 in Figure 12.5.5 Entering Responses Click to enter lower and upper limits and importance weights. .202 12 Screening Designs Creating a Screening Design Entering Responses To enter responses. A goal of Maximize supports this objective.5) and the lower and upper limits (step 4 in Figure 12. if desired. Note: If your target range is not symmetric around the target value. open the response’s Column Info dialog by double-clicking the column name. 4 To enter one 1 response at a time. to match a specific value. you can alter the default target after you make a table from the design. Minimize. such as with part dimensions. • The Minimize goal supports an objective of having the best objective be the smallest value. In the data table. Specifying Goal Types and Lower and Upper Limits When entering responses.

6. Click to enter factor values. operator. Entering Factors After entering responses. Double-click to edit the factor name. click it. Highlight the factor and click the Remove Selected button to remove a factor in the list. Choosing a Design The list of screening designs you can use includes designs that group the experimental runs into blocks of equal sizes where the size is a power of two. and gender. the order is the sorting sequence. If the values are numbers. enter factors. when designing experiments. Space Filling Design. To remove a level. press the delete key on your keyboard. Below is a description of factor types from which you can choose when creating screening designs: • Continuous Continuous factors have numeric data types only. After your responses and factors are entered. In theory. • Categorical Categorical factors (either numerical or categorical data types) have no implied order. then specifying importance is unnecessary.12 Screening Designs Creating a Screening Design 203 12 DOE: Screening Designs Understanding Importance Weights When computing overall desirability. Types of Factors In general.6 Entering Factors To enter factors.5) as the weight of each response. Mixture Design. then press the Return or Enter key on your keyboard. The Factors panel’s appearance depends on the design you select. This process is described below. Examples of categorical factors are machine. With two responses you can give greater weight to one response by assigning it a higher importance value. JMP uses the value you enter as the importance weight (step 4 in Figure 12. If the values are character. The settings of a categorical factor are discrete and have no intrinsic order. the order is the numeric magnitude. and Response Surface Design. click Continue. in Figure 12. you can enter different types of factors in the model. you can set a continuous factor to any value between the lower and upper limits you supply. type the number of factors and click Add. . Entering factors is the same in Screening Design. Figure 12. If there is only one response. Highlight the type of screening design you would like to use and click Continue.

. this is 2k where k is the number of factors. The samples size is the product of the levels of the factors. For two-level designs. Like the full factorial. fractional factorials are designed by deciding in advance which interaction effects are confounded with the other interaction effects. This causes some of the higher-order effects in a model to become nonestimable.8 is a geometric representation of a two-level factorial. This means that the estimates of the effects are uncorrelated. These higher-order interactions are assumed to be zero. But because running experiments costs time and money. These designs are orthogonal. Full factorial designs allow the estimation of interactions of all orders up to the number of factors. Two-Level Fractional Factorial A fractional factorial design also has a sample size that is a power of two. The fraction of the full factorial is 2-p.204 12 Screening Designs Creating a Screening Design Figure 12. you typically only run a fraction of all possible levels. If price is no object. The trade-off in screening designs is between the number of runs and the resolution of the design.7 Choosing a Type of Screening Design The screening designer provides the following types of designs: Two-Level Full Factorial A full factorial design contains all combinations of the levels of the factors. Their p-values change slightly. which must be assumed to be zero for the main effects to be meaningful. including interaction effects to the mth degree. Most empirical modeling involves first. because the estimate of the error variance and the degrees of freedom are different. the number of runs is 2k – p where p < k. In fact. This provides a good estimate of everything. fractional factorial designs are orthogonal. This can be expensive if the number of factors is greater than 3 or 4. the values of the other estimates remain the same. you can run several replicates of all possible combinations of m factor levels. • Resolution = 4 means that main effects are not confounded with other main effects or two-factor interactions. few experimenters worry about interactions higher than two-way interactions. Resolution Number: The Degree of Confounding In practice. If k is the number of factors. However.or second-order approximations to the true functional relationship between the factors and the responses. Experiments can therefore be classified by resolution number into three groups: • Resolution = 3 means that main effects are confounded with one or more two-way interactions. If you remove an effect in the analysis. The figure to the left in Figure 12. An effect is nonestimable when it is confounded with another effect. two-factor interactions are confounded with other two-factor interactions.

The main effects are orthogonal and two-factor interactions are only partially confounded with main effects. 1. -1 1. listed below. 1 L18 John L18 Chakravarty L18 Hunter L36 1 3 8 11 7 6 4 12 . Mixed-Level Designs If you have qualitative factors with three values. The figure on the right in Figure 12. One useful characteristic is that the sample size is a multiple of four rather than a power of two. Table 12. -1. –1 -1. All the fractional factorial designs are minimum aberration designs. then none of the classical designs discussed previously are appropriate. JMP offers complete factorials and specialized orthogonal-array designs. 1. This is different from resolution-three fractional factorial where two-factor interactions are indistinguishable from main effects. between two-factor interactions.12 Screening Designs Creating a Screening Design 205 12 DOE: Screening Designs • Resolution ≥ 5 means there is no confounding between main effects. For DOE experts. 24-run. Figure 12.8 is geometric representation of a two-level fractional factorial design. and 28-run Plackett-Burman designs. -1. -1 1. For pure three-level factorials. -1 1. 1 Plackett-Burman Designs Plackett-Burman designs are an alternative to fractional factorials for screening.8 Representation of Full Factorial (Left) and Two-Level Fractional Factorial (Right) Designs –1. or between main effects and two-factor interactions. 1. there are 20-run. a stepwise regression approach can allow for removing some insignificant main effects while adding highly significant and only somewhat correlated two-factor interactions. JMP offers fractional factorials.1 Design Two–Level Factors Three–Level Factors -1. For mixed two-level and three-level designs. In cases of effect sparsity. –1. However. the minimum aberration design of a given resolution minimizes the number of words in the defining relation that are of minimum length. There are no two-level fractional factorial designs with sample sizes between 16 and 32 runs.

The danger is that several active effects with mixed signs will cancel and still sum to near zero and give a false negative. Some of these designs are not balanced. subject to randomizing the runs. These are of the form factorOdd and factorEven where factor is a factor name.7). A Cotter design begins with a run having all factors at their high level. Click the Platform icon. For k factors. there are 2k + 2 runs. if you would like to make a Cotter design: 1 Immediately after entering responses and factors (and before clicking Continue). This completes the Cotter design. To alter the setting for all screening designs: 1 2 3 4 Select File > Preferences. testing the parameters on these combinations is equivalent to testing the combinations on the original effects. AOdd estimates the sum of odd terms involving A. . if you have three factors. and C: Figure 12. click the red triangle icon in the Screening Design title bar. When you use JMP to generate a Cotter design. Cotter designs are easy to set up. JMP does not include a Cotter design in the list of available screening designs (Figure 12. You believe in this so strongly that you are willing to bet that if you add up a number of effects. How to Run a Cotter Design By default.206 12 Screening Designs Creating a Screening Design If you have fewer than or equal to the number of factors for a design listed in the table. the sum will show an effect if it contains an active effect. Uncheck the box beside Supress Cotter Designs. Suppose you believe in effect sparsity— that very few effects are truly nonzero. you can use that design by selecting an appropriate subset of columns from the original design. JMP also includes a set of extra columns to use as regressors. Changing the setting via the red triangle menu applies only to the current design. A. and you believe there may be interactions.9 AOdd = A + ABC BOdd = B + ABC COdd = C + ABC AEven = AB + AC BEven = AB + BC CEven = BC + AC Because these columns in a Cotter design make an orthogonal transformation. AEven estimates the sum of the even terms involving A. The design is similar to the “vary one factor at a time” approach many books call inefficient and naive. even though they are all orthogonal. However. 2 Select Supress Cotter Designs. and the others high. many statisticians discourage their use. Because Cotter designs have a false-negative risk. In the example of factors listed above. Then follow k runs each with one factor in turn at its low level. They are constructed by adding up all the odd and even interaction terms for each factor. The next run sets all factors at their low level and sequences through k more runs with one factor high and the rest low. and so forth. B. Cotter Designs Cotter designs are used when you have very few resources and many factors. For example. Click DOE to highlight it.

The price of reducing the number of runs from 32 to eight is effect aliasing (confounding).11). . Figure 12. • Aliasing of Effects—Shows the confounding pattern for fractional factorial designs.10). It is necessary to construct the two additional factors in terms of the first three factors.12 Screening Designs Creating a Screening Design 207 12 DOE: Screening Designs Displaying and Modifying the Design After you select a design type. a full factorial with five factors requires 25 = 32 runs. Aliasing of Effects To see which effects are confounded with which other effects. Eight runs can only accommodate a full factorial with three two-level factors. Figure 12. Confounding is the direct result of the assignment of new factor values to products of the coded design columns. click the disclosure button ( on Windows and on the Macintosh) to reveal the Aliasing of Effects panel.10 Display and Modification Options • Change Generating Rules—Controls the choice of different fractional factorial designs for a given number of factors. • Coded Design—Shows the pattern of high and low values for the factors in each run. click the disclosure buttons ( on Windows and on the Macintosh) to display the design and show modification options using the Display and Modify Design panel to tailor the design (Figure 12. It shows effects and confounding up to two-factor interactions (Figure 12.11 Generating Rules and Aliasing of Effects Panel For example.

the values for Temperature are the product of the values for Feed Rate and Concentration. and Stir Rate). with second order aliasing as “1 5” (Feed Rate and Concentration). Figure 12. and third order confounding as “1 2 3” (Feed Rate. Thus. Figure 12.12 Show Confounding Patterns 3 Enter the order of confounding you want to see (Figure 12.11. . Temperature appears as “4”. This means that you can’t tell the difference of the effect of Temperature and the synergistic (interactive) effect of Feed Rate and Concentration. 2 Select Show Confounding Pattern (Figure 12. To create this table: 1 Click the red triangle at the bottom of the Aliasing of Effects area. Viewing the Confounding Pattern JMP can create a data table that shows the aliasing pattern for a specified level. Catalyst. Figure 12. In the example shown in Figure 12.14 shows the third level alias for the five-factor reactor example.13 Enter Order 4 Click OK. The effect names begin with C (Constant) and are shown by their order number in the design.13). This is characteristic of resolution-three designs.12).208 12 Screening Designs Creating a Screening Design For example. all the main effects are confounded with two-factor interactions.

The check marks for Temperature show it is a function of Feed Rate. the last column (Concentration) is the product of the second and third columns. and Stir Rate are all combinations of high and low values (a full factorial design). The checkmarks for Concentration show it is a function of Catalyst and Stir Rate.16 shows how the last two columns are constructed in terms of the first three columns. Figure 12.12 Screening Designs Creating a Screening Design 209 12 DOE: Screening Designs Figure 12. it changes the choice of different fractional factorial designs for a given number of factors. which represent Feed Rate. If you check the options as shown in Figure 12. .15 Default Coded Designs Changing the Coded Design In the Change Generating Rules panel. and Stir Rate. Catalyst. Temperature is now the product of Feed Rate and Catalyst.16 and click Apply. changing the checkmarks and clicking Apply changes the coded design. Concentration is a function of Feed Rate and Stir Rate. The Change Generating Rules table in Figure 12. Similarly. Plus signs designate high levels and minus signs represent low levels. As shown in Figure 12. rows for the first three columns of the coded design. The fourth column (Temperature) of the coded design is the element-by-element product of the first three columns. the Coded Design panel changes. so the fourth column of the coded design is the element by element product of the first two columns. Catalyst. and Stir Rate). The first three columns of the coded design remain a full factorial for the first three factors (Feed Rate. Catalyst. each row represents a run.15.14 The Third Level Alias for the Five-Factor Reactor Example Understanding the Coded Design In the coded design panel.

click Make Table. Choices are: Keep the Same—the rows (runs) in the output table appear as they do in the Design panel. Sort Left to Right—the rows (runs) in the output table appear sorted from left to right. including centerpoints. .16 Modified Coded Designs and Generating Rules Specifying Output Options Use the Output Options panel to specify how you want the output data table to appear.210 12 Screening Designs Creating a Screening Design Figure 12. • Number of Center Points—Specifies additional runs placed at the center points. • Number of Replicates—Specify the number of times to replicate the entire design. Randomize within Blocks—the rows (runs) in the output table will appear in random order within the blocks you set up. One replicate doubles the number of runs. Sort Right to Left—the rows (runs) in the output table appear sorted from right to left. Randomize—the rows (runs) in the output table appear in a random order. When the options are correctly set up.17 Select the Output Options • Run Order—Lets you designate the order you want the runs to appear in the data table when it is created. Type the number of times you want to replicate the design in the associated text box. Figure 12.

The column called Pattern shows the pattern of low values denoted “–” and high values denoted “+”.18 The Design Data Table The name of the table is the design type that generated it. The data table contains a script labeled Model. Pattern is especially useful as a label variable in plots. Continuing the Analysis After creating and viewing the data table. Figure 12. you can now run analyses on the data. the high and low values you specified are displayed for each run. you have a data table that outlines your experiment.19 Running the Model Script . This script allows you to easily fit a model using the values in the design table. Right-click it and select Run Script to run a fit model analysis (Figure 12.19). Figure 12.12 Screening Designs Creating a Screening Design 211 12 DOE: Screening Designs Viewing the Table After clicking Make Table. In the table.

20 An Actual-by-Predicted Plot To show labels in the graph (on the right in Figure 12.20. but this does not mean that the experiment has failed. and RMSE appear below the plot. Figure 12. The last column of the table has the p-values for each effect.212 12 Screening Designs Creating a Screening Design The next sections describe some of the parts of the analysis report that appears when you click Run Model. In Figure 12. At this stage the results are not clear. Viewing an Actual-by-Predicted Plot When the model contains no interactions. appears at the top of the Fit Model report. the mean line falls inside the bounds of the 95% confidence curves. It means that some follow-up runs are necessary. The model p-value. shown on the left in Figure 12. it is impossible to determine which two-factor interactions are important without performing more experimental runs. Viewing a Scaled Estimates Report When you fit the model. R2. select all points. JMP displays a Scaled Estimates report (Figure 12. Because of the confounding between two-factor interactions and main effects in this design. the RMSE is 14. but the Catalyst effect is large enough to be interesting if it is real.199. This suggests that effects other than the main effects of each factor are important. The pattern variable displayed in the data table serves as the label for each point. . The RMSE is an estimate of the standard deviation of the process noise assuming that the unestimated effects are negligible. In this case.20. right-click the graph. which tells you that the model is not significant. and select Row Label.20). None of the factor effects are significant. an actual-by-predicted plot.21) as a part of the Fit Model report. The Scaled Estimates report displays a bar chart of the individual effects embedded in a table of parameter estimates. which is much larger than expected.

you might also want to have complete 32-run factorial experimental data and analysis. . For comparison.Figure 12.21 Example of a Scaled Estimates Report If this scaled estimates report were not merely an example. you would then want to augment the design.

.

The Box-Behnken design. Central Composite Design fractional factorial points Box-Behnken Design axial points center points . If a minimum or maximum response exists inside the factor region. for which all but one factor are set at zero (midrange) and that one factor is set at outer (axial) values.13 Response Surface Designs Response surface designs are useful for modeling a curved quadratic surface to continuous factors. is an alternative to central composite designs. • Axial (or star) points. The most popular response surface design is the central composite design. It combines a two-level fractional factorial and two other kinds of points: • Center points. Three distinct values for each factor are necessary to fit a quadratic function. This is sometimes useful when it is desirable to avoid these points due to engineering considerations. for which all the factor values are at the zero (or midrange) value. illustrated in the figure to the left below. a response surface model can pinpoint it. One distinguishing feature of the Box-Behnken design is that there are only three levels per factor. illustrated in the figure on the right below. The price of this characteristic is the higher uncertainty of prediction near the vertices compared to the central composite design. so the standard two-level designs cannot fit curved surfaces. Another important difference between the two design types is that the Box-Behnken design has no points at the vertices of the cube defined by the ranges of the factors.

and Sulfur used to manufacture the tennis balls. In real life.1 shows the completed Response panel and Factors panel. Navigate to the Sample Data folder that was installed when you installed JMP. Figure 13.2.jmp. Figure 13. you would conduct the experiment and then enter the responses into the data table. To see this example data table.216 13 Response Surface Designs Introduction Introduction The Bounce Data. Let’s pretend this happened and use a finalized data table called Bounce Data. and select Design Experiment > Bounce Data. open the Sample Data folder that was installed when you installed JMP. The bounciness varies with amounts of Silica. Navigate to the Sample Data folder that was installed when you installed JMP. and select Design Experiment > Bounce Response. 3 Load the responses by clicking the red triangle icon on the Response Surface Design title bar and selecting Load Responses.jmp. Figure 13. The experimenter wants to collect data over a wide range of values for these variables to see if a response surface can find a combination of factors that matches a specified bounce target.3.1 Response and Factors For Bounce Data After the response data and factors data are loaded.2 Response Surface Design Selection The Box-Behnken design selected for three effects generates the design table of 15 runs shown in Figure 13. the Response Surface Design Choice dialog lists the designs in Figure 13. The objective of this experiment is to match a standardized target value (450) of tennis ball bounciness. 2 Load factors by clicking the red triangle icon on the Response Surface Design title bar and selecting Load Factors.jmp sample data file has response surface data inspired by the tire tread data described in Derringer and Suich (1980). and select Design Experiment > Bounce Factors. Silane.jmp. To follow this example: 1 Select DOE > Response Surface Design. .jmp.

with parameter estimates for all response surface and crossed effects in the model.jmp data table.3 JMP Table for a Three-Factor Box-Behnken Design After obtaining the Bounce Data. All main effect terms are significant as well as the two interaction effects involving Sulfur.5. The data table contains a script labeled Model. 2 Right-click Model and select Run Script to start a fit model analysis. Figure 13. The standard Fit Model analysis results appear in tables shown in Figure 13. run a fit model analysis on the data.jmp (Figure 13.3).13 Response Surface Designs Introduction 217 13 DOE: Response Surface Designs 1 Open to the Sample Data folder that was installed when you installed JMP.4 JMP Statistical Reports for a Response Surface Analysis of Bounce Data The Response Surface report also has the tables shown in Figure 13. and select Design Experiment > Bounce Data. . The prediction model is highly significant with no evidence of lack of fit. 3 Click Run Model. Figure 13.4.

218

13 Response Surface Designs Creating a Response Surface Design

Figure 13.5 Statistical Reports for a Response Surface Analysis

Provides a summary of the parameter estimates

Lists the critical values of the surface factors and tells the kind of solution (maximum, minimum, or saddlepoint). The solution for this example is a saddlepoint. The table also warns that the critical values given by the solution are outside the range of data values. Shows eigenvalues and eigenvectors of the effects. The eigenvector values show that the dominant negative curvature (yielding a maximum) is mostly in the Sulfur direction. The dominant positive curvature (yielding a minimum) is mostly in the Silica direction.

**Creating a Response Surface Design
**

Response Surface Methodology (RSM) is an experimental technique invented to find the optimal response within specified ranges of the factors. These designs are capable of fitting a second-order prediction equation for the response. The quadratic terms in these equations model the curvature in the true response function. If a maximum or minimum exists inside the factor region, RSM can find it. In industrial applications, RSM designs involve a small number of factors. This is because the required number of runs increases dramatically with the number of factors. Using the response surface designer, you choose to use well-known RSM designs for two to eight continuous factors. Some of these designs also allow blocking. Response surface designs are useful for modeling and analyzing curved surfaces. To start a response surface design, select DOE > Response Surface Design, or click the Response Surface Design button on the JMP Starter DOE page. Then, follow the steps below: • “Entering Responses and Factors,” p. 219 • “Choosing a Design,” p. 219 • “Specifying Axial Value (Central Composite Designs Only),” p. 220 • “Specifying Output Options,” p. 221 • “Viewing the Design Table,” p. 221 • “Continuing the Analysis, If Needed,” p. 222

13 Response Surface Designs Creating a Response Surface Design

219 13 DOE: Response Surface Designs

**Entering Responses and Factors
**

The steps for entering factors in a response surface design are unique to this design. To add factors, follow the step in Figure 13.6. Figure 13.6 Entering Factors into a Response Surface Design

Click Continue to proceed to the next step.

Choosing a Design

Highlight the type of response surface design you would like to use and click Continue. Figure 13.7 Choose a Design Type

The Response Surface designer provides the following types of designs: Box-Behnken Designs The Box-Behnken design has only three levels per factor and has no points at the vertices of the cube defined by the ranges of the factors. This is sometimes useful when it is desirable to avoid extreme

220

13 Response Surface Designs Creating a Response Surface Design

points due to engineering considerations. The price of this characteristic is the higher uncertainty of prediction near the vertices compared to the central composite design. Central Composite Designs The response surface design list contains two types of central composite designs: uniform precision and orthogonal. These properties of central composite designs relate to the number of center points in the design and to the axial values: • Uniform precision means that the number of center points is chosen so that the prediction variance at the center is approximately the same as at the design vertices. • For orthogonal designs, the number of center points is chosen so that the second order parameter estimates are minimally correlated with the other parameter estimates.

**Specifying Axial Value (Central Composite Designs Only)
**

When you select a central composite (CCD-Uniform Precision) design and then click Continue, you see the panel in Figure 13.8. It supplies default axial scaling information. Entering 1.0 in the text box instructs JMP to place the axial value on the face of the cube defined by the factors, which controls how far out the axial points are. You have the flexibility to enter the values you want to use. Figure 13.8 Display and Modify the Central Composite Design

• Rotatable makes the variance of prediction depend only on the scaled distance from the center of the design. This causes the axial points to be more extreme than the range of the factor. If this factor range cannot be practically achieved, it is recommended that you choose On Face or specify your own value. • Orthogonal makes the effects orthogonal in the analysis. This causes the axial points to be more extreme than the –1 or 1 representing the range of the factor. If this factor range cannot be practically achieved, it is recommended that you choose On Face or specify your own value. • On Face leaves the axial points at the end of the -1 and 1 ranges. • User Specified uses the value entered by the user, which can be any value greater than zero. Enter that value into the Axial Value text box. If you would like to inscribe the design, click the box beside Inscribe. When checked, JMP re-scales the whole design so that the axial points are at the low and high ends of the range (the axials are –1 and 1 and the factorials are shrunken based on that scaling).

13 Response Surface Designs Creating a Response Surface Design

221 13 DOE: Response Surface Designs

**Specifying Output Options
**

Use the Output Options panel to specify how you want the output data table to appear. When the options are correctly set up, click Make Table. Figure 13.9 Select the Output Options

• Run Order—Lets you designate the order you want the runs to appear in the data table when it is created. Choices are: • Keep the Same—the rows (runs) in the output table will appear as they do in the Design panel. • Sort Left to Right—the rows (runs) in the output table will appear sorted from left to right.

• Randomize—the rows (runs) in the output table will appear in a random order. • Sort Right to Left—the rows (runs) in the output table will appear sorted from right to left. • Randomize within Blocks—the rows (runs) in the output table will appear in random order

within the blocks you set up. • Number of Center Points—Specifies additional runs placed at the center points. • Number of Replicates—Specify the number of times to replicate the entire design, including centerpoints. Type the number of times you want to replicate the design in the associated text box. One replicate doubles the number of runs.

**Viewing the Design Table
**

Now you have a data table that outlines your experiment, as described in Figure 13.10.

222

13 Response Surface Designs Creating a Response Surface Design

Figure 13.10 The Design Data Table The column called Pattern identifies the coding of the factors. It shows all the codings with “+” for high, “–” for low factor, “a” and “A” for low and high axial values, and “0” for midrange. Pattern is suitable to use as a label variable in plots because when you hover over a point in a plot of the factors, the pattern value shows the factor coding of the point. Runs are in a random order. This script allows you to easily fit a model using the values in the design table. There are two center points per replicate.

The name of the table is the design type that generated it.

The Y column is for recording experimental results.

**Continuing the Analysis, If Needed
**

After creating and viewing the design table, running the experiment, and recording your results in the design table’s Y column, run a fit model analysis on the data. The data table contains a script labeled Model. Right-click it and select Run Script (Figure 13.11) to fit the model. Figure 13.11 Running the Script

After clicking Run Model in the dialog, (Figure 13.12), review the analysis.

13 Response Surface Designs Creating a Response Surface Design 223 13 DOE: Response Surface Designs Figure 13.12 Fitting the Model .

.

power. . and power. JMP computes the third. The calculations assume that there are equal numbers of units in each group. estimate of within-group variance. if they are balanced. “Will I detect the group differences I am looking for. the result is a plot of the other two. This capability is available for the single sample. difference to detect. given my proposed sample size. and alpha level?” In this type of analysis. If you supply two values. given that the true effect size is at least a certain size. The Sample Size and Power command determines how large of a sample is needed to be reasonably likely that an experiment or sample will yield a significant result. You can supply either one or two of the three values. and a number-of-parameters adjustment is specified. You can apply this platform to more general experimental designs. If you supply only one of these values. you must give JMP an estimate of the group means and sample sizes in a data table as well as an estimate of the within-group standard deviation (σ). and computes the third for the following cases: • difference between one sample's mean and a hypothesized value • difference between two samples means • differences in the means among k samples • difference between a variance and a hypothesized value • difference between one sample proportion and a hypothesized value • difference between two sample proportions • difference between counts per unit in a Poisson-distributed sample and a hypothesized value. two-sample. Prospective Power Analysis The following five values have an important relationship in a statistical test on means: • Alpha Alpha is the significance level that prevents declaring a zero effect significant more than alpha portion of the time. Using the Sample Size and Power command when doing a prospective analysis helps answer the question. These depend on the significance level—alpha—of the hypothesis test for the effect and the standard deviation of the noise in the response.14 Prospective Power and Sample Size Use the DOE > Sample Size and Power command to answer the question “How many runs do I need?” The important quantities are sample size. and the magnitude of the effect. and k-sample situations. It requires that you enter any two of three quantities. sample size.

The Sample Size and Power command in JMP helps estimate in advance either the sample size needed. When you select DOE > Sample Size and Power. or the effect size expected in the experimental situation where there is a single mean comparison. a two sample comparison. or when comparing k sample means.1 Sample Size and Power Choices One-Sample and Two-Sample Means After you click either One Sample Mean. the panel shown in Figure 14. . or samples) are involved in the experiment.1 appears with button selections for experimental situations. The following sections describe each of these selections and explains how to enter estimated parameter values and the desired computation. the Power and Sample Size dialog in Figure 14.1).226 14 Prospective Power and Sample Size One-Sample and Two-Sample Means • Error Standard Deviation Error Standard Deviation is the unexplained random variation around the means. or Two Sample Means in the initial Sample Size selection list (Figure 14. • Power Power is the probability of declaring a significant result. • Effect Size Effect size is how different the means are from each other or from the hypothesized value.2 appears and asks for the anticipated experimental values. As an example. power expected. The values you enter depend on your initial choice. Figure 14. • Sample Size Sample size is how many experimental units (runs. consider the two-sample situation.

• Sample Size Sample size is the total number of observations (runs. as shown here. This is usually done by .14 Prospective Power and Sample Size One-Sample and Two-Sample Means 227 14 DOE: Power And Sample Size Figure 14. the power and sample size platform constructs a plot that shows the relationship between those two values: • power as a function of sample size. experimental units. • Error Std Deviation Error Std (Standard) Deviation is the true residual error. For single sample problems this is the difference between the hypothesized value and the true value. Sample Size. The Power and Sample Size dialog asks for the values depending the first choice of design: • Alpha Alpha is the significance level.2 Initial Power and Sample Size Dialogs for Single Mean (left) and Two Means (right) The Two Sample Means choice in the initial Power and Sample Size dialog always requires values for Alpha and the error standard deviation (Error Std Dev). but the total over all groups. The power and sample size platform then calculates the missing item. in a three-factor two-level design with all three two-factor interactions. usually 0. In a multi-factor balanced design. • Difference to Detect Difference to detect is the smallest detectable difference (how small a difference you want to be able to declare statistically significant). the number of extra parameters is five—two parameters for the extra main effects. Even though the true error is not known. given a sample size • effect size as a function of sample size. which you need to adjust to real units. or samples). Sample size is not the number per group. and Power. Computed sample size numbers can have fractional values.05. If there are two unspecified fields. In practice. given specific effect size • power as a function of effect size. and three parameters for the interactions. • Extra Params Extra Params (Parameters) is only for multi-factor designs. it isn’t very important what values you enter here unless the experiment is in a range where there is very few degrees of freedom for error. for a given power. there are other factors with the extra parameters that can be specified here. the power calculations are an exercise in probability that calculates what might happen if the true values were as specified. in addition to fitting the means described in the situation. Leave this field zero in simple cases. For example. This implies willingness to accept (if the true difference between groups is zero) that 5% (alpha) of the time a significant difference will be incorrectly declared. and one or two of the other three values: Difference to detect.

229. See the section. for an illustration of this animation script. You should go for powers of at least 0. Power is equal to alpha when the specified effect size is zero. • Continue Evaluates at the entered values. For example.4. If an experiment requires considerable effort. but the cost is higher in sample size. • Animation Script The Animation Script button runs a JSL script that displays an interactive plot showing power or sample size. leave both Sample Size and Power empty and click Continue. leave Power missing in the dialog and click Continue.3. as shown in the left-hand dialog in Figure 14. shows a range of sample sizes for which the power varies from about 0.” p.4. plan so that the experimental design has the power to detect a sizable effect. . The dialog on the right in Figure 14. the plot on the right in Figure 14. Double click on the horizontal axis to get any desired scale. Change the range of the curve by changing the range of the horizontal axis. “Power and Sample Size Animation for a Single Sample.99998. rounding to 1. shows the power is calculated to be 0. To calculate the power when the sample size is 10.3 A One-Sample Example To see a plot of the relationship of power and sample size. • Back Goes back to the previous dialog. which gives a more typical looking power curve.90 or 0. Single-Sample Mean Suppose there is a single sample and the goal is to detect a difference of 2 where the error standard deviation is 0. when there is one.95.95 if you can afford it. Bigger power is better. has the horizontal axis scaled from 1 to 8.9. • Power Power is the probability of getting a statistic that will be declared statistically significant. The left-hand graph in Figure 14. Figure 14.2 to 0.3.228 14 Prospective Power and Sample Size One-Sample and Two-Sample Means increasing the estimated sample size to the smallest number evenly divisible by the number of groups.

5 Plot of Power by Difference to Detect for a Given Sample Size Power and Sample Size Animation for a Single Sample The Animation Script button on the Power and Sample Size dialog for the single mean displays an interactive plot that illustrates the effect that changing the sample size has on power.4 (the difference to be detected).4 A One-Sample Example Plot When only Sample Size is specified (Figure 14.4. Alpha is 0.6. The probabil- . a plot of power by difference appears. Sample Size is 10.05.14 Prospective Power and Sample Size One-Sample and Two-Sample Means 229 14 DOE: Power And Sample Size Figure 14. and the Difference to Detect is set to 0. Figure 14. and another with mean at 0. In the example shown in Figure 14.5) and Difference to Detect and Power are empty. The animation begins showing a normal curve positioned with mean at zero (representing the estimated mean and the true mean).

leave both Sample Size and Power blank and examine the resulting plot. is shaded in blue on this plot. . Also. the Difference to Detect is the difference between two means. often represented as β in literature.7. The comparison is between two random samples instead of one. as shown in the dialog on the left in Figure 14. To find out how large. Suppose the error standard deviation is 0. The crosshair tool estimates that a sample size of about 35 is needed to obtain a power of 0. 0.7. You can drag the handles over the curves drag to see how their positions affect power.9 (as before). shown on the right in Figure 14. you can click on the values for sample size and alpha showing beneath the plot to change them. Figure 14. To increase the power requires a larger sample.5433.6 Example of Animation Script to Illustrate Power Two-Sample Means The dialogs work similarly for two samples. This is considerably lower than in the single sample because each mean has only half the sample size.9.230 14 Prospective Power and Sample Size One-Sample and Two-Sample Means ity of committing a Type II error (not detecting a difference when there is a difference). Leave Power blank and click Continue to see the power calculation. the desired detectable difference is 1. and the sample size is 16.

The next example considers a situation where 4 levels of means are expected to be about 10 to 13.236. When a sample size of 16 is entered the power calculation is 0. In this case it is the square root of (–1.95.9. This confirms that a sample size of 16 looks acceptable. the power and sample size calculations produce the power curve shown on the right in Figure 14. which is the square root of 5.5)2+ (–0. .5)2+(0.5)2. calculated as square root of the sum of squared deviations from the grand mean. If both Sample Size and Power are left blank. and the Error Std Dev is 0.7 Plot of Power by Sample Size to Detect for a Given Difference k-Sample Means The k-Sample Means situation can examine up to 10 kinds of means. as shown in the dialog on the left in Figure 14.8. Notice that the difference in means is 2.5)2+(1.8.14 Prospective Power and Sample Size k-Sample Means 231 14 DOE: Power And Sample Size Figure 14.

nist.05 and power of 0. Suppose you want to detect an increase of 55 for a baseline variance of 100. entered as the Baseline Variance. To indicate direction of change.99. When you click Continue.8 Prospective Power for k-Means and Plot of Power by Sample Size One-Sample Variance The One-Sample Variance choice on the Power and Sample Size dialog (Figure 14.gov/div898/handbook.232 14 Prospective Power and Sample Size One-Sample Variance Figure 14. you enter two of the items and the Power and Sample Size calculations determines the third. In the dialog. select either Larger or Smaller from the Guarding a change menu. You can access the NIST manual examples at http://www. the computed result shows that you need a sample size of 170. alpha level. As with previous dialogs. The examples throughout the rest of this chapter use engineering examples from the online manual of The National Institute of Standards and Technology (NIST).1) determines sample size for detection of a change in variance. The computations then show whether the true variance is larger or smaller than its hypothesized value. and direction of change you want to detect. with an alpha of 0. . Enter these items as shown in Figure 14.itl.9.The usual purpose of this option is to compute a large enough sample size to guarantee that the risk of accepting a false hypothesis (β) is small. An example is when the variance for resistivity measurements on a lot of silicon wafers is claimed to be 100 ohm-cm and a buyer is unwilling to accept a shipment if variance is greater than 55 ohm-cm for a particular lot. specify a baseline variance.

10). the Baseline Proportion you enter is the average of the two sample proportions. For a one-sample situation. enter a negative amount in the Difference to Detect box. The dialogs are the same except you enter Baseline Proportion and also specify either a one-sided or two-sided test. Figure 14. .9 Sample Size and Power Dialog To Compare Single-Direction One-Sample Variance One-Sample and Two-Sample Proportions The dialogs and computations to test power and sample sizes for proportions are similar to those for testing sample means. When there are two samples. not the standard deviation. but the computations to determine sample size and test proportions use a normal approximation. the Baseline Proportion is the average of a known baseline proportion and the single sample proportion. Note: Remember to enter the variance in the Baseline Variance box. The sampling distribution for proportions is actually binomial. as indicated on the dialogs (Figure 14.14 Prospective Power and Sample Size One-Sample and Two-Sample Proportions 233 14 DOE: Power And Sample Size If you want to detect a change to a small variance.

10 Default Power and Sample Dialog for One-Sample and Two-Sample Proportions Enter average of baseline and one-sample proportions.08.or two-sided) Testing proportions is useful in production lines.The manager does not want to stop the process unless it has degenerated greater than 16% defects (6% above the 10% known baseline defective). Enter 1 or 2 to indicate the type of test (one. or average of two-sample proportions.11 Dialog To Compare One Proportion to a Baseline and Sample Size Plot . For example. where proportion of defects is part of process control monitoring.11 shows the entries in the Sample Size and Power dialog to detect a given difference between an observed proportion and a baseline proportion. The Baseline Proportion in this example is 0. Use the grabber tool (hand) to move the x-axis and show a specific range of differences and sample sizes. Figure 14. The example process is monitored with a one-sided test at 5% alpha and a 10% risk (90% power) of failing to detect a change of that magnitude.234 14 Prospective Power and Sample Size One-Sample and Two-Sample Proportions Figure 14. leave both Difference to Detect and Sample Size blank. Figure 14.11. which is the average of the baseline (10%) and the proportion above the baseline (6%). To see the plot on the right in Figure 14. suppose a line manager wants to detect a change in defective units that is 6% above a known baseline of approximately 10% defective. and the computed sample size of approximately 77.

The process meets the target if there are less than 48 defects (6 defects per wafer in a sample of 8 wafers).14 Prospective Power and Sample Size Counts per Unit 235 14 DOE: Power And Sample Size Counts per Unit The Counts per Unit selection calculates sample size for the Poisson-distributed counts typical when you can measure more than one defect per unit. Figure 14. In this kind of situation. As an example. Enter the Difference to Detect in terms of the baseline count per unit (defects per unit). which is the chance of detecting a change larger than 2 (6 defects per wafer). Although the number of defects observed in an area of a given size is often assumed to have a Poisson distribution. consider a wafer manufacturing process with a target of 4 defects per wafer and you want to verify that a new process meets that target. Enter a power of 0. Enter an alpha of 0.1 to be the chance of failing the test if the new process is as good as the target.9. Click the Counts per Unit button. Select DOE > Sample Size and Power. Questions of interest are: • Is the defect density within prescribed limits? • Is the defect density greater than or less than a prescribed limit? Enter alpha and the baseline count per unit.128 (Figure 14. 5 Click Continue to see the computed sample size of 8. The test is for one-sided (one-tailed) change. Then enter two of the remaining fields to see the calculation of the third. The computed sample size is expressed in those units. A unit can be an area and the counts can be fractions or large numbers. the area and count are assumed to be large enough to support a normal approximation. alpha is sometimes called the producer’s risk and beta is called the consumer’s risk.12).12 Dialog For Counts Per Unit Example 1 2 3 4 .

2 Click the Sigma Quality Level button. 3 Enter 50 for the number of defects and 1.5.236 14 Prospective Power and Sample Size Sigma Quality Level Sigma Quality Level Use the Sigma Quality Level feature. Figure 14. The results. by entering any two of the following three quantities: • number of defects • number of opportunities • Sigma quality level When you click Continue.4.13 Sigma Quality Level Example 1 If you want to know how many defects reduce the Sigma Quality Level to “six-sigma” for 1. 4 Click Continue. use the Sample Size and Power feature to compute the Sigma quality level for 50 defects in 1. .000 opportunities.14). The computation (window to the right in Figure 14. as shown in the window to the left in Figure 14. accessed by selecting DOE > Sample Size and Power.13. enter 6 as the Sigma Quality Level and leave the Number of Defects blank (window to the left in Figure 14.13.000 as the number of opportunities.000 opportunities: 1 Select DOE > Sample Size and Power.3. are a Sigma quality level of 5. as shown in the window on the right in Figure 14.000. the sigma quality calculator computes the missing quantity using the formula Sigma Quality Level = NormalQuantile(1 – defects/opportunities) + 1.14) shows that the Number of Defects cannot be more than approximately 3.000. As an example.000.

14 Sigma Quality Level Example 2 .14 Prospective Power and Sample Size Sigma Quality Level 237 14 DOE: Power And Sample Size Figure 14.

.

89 ARIMA 143. 221 AIC 151 AIC 104 Akaike’s Information Criterion 104.jmp 216–217 Bounded 157 Box Plots command 62. 151 aliasing effects 207 All Graphs command 65 All Pairs. 227 Alpha Amalyze 21 analysis of variance report 47.jmp 216 Bounce Factors. 89 Analyze menu 22. 219 See also Response Surface designs Box-Jenkins model see ARIMA Braces. 154–156 Arrange Plots 186 assigning importances (of responses) 195. 92 table 53. 181 producing 171 Bar Chart command 183 bar chart of correlations 167 Bartlett’s test 64 BIC 151 bivariate normal density ellipse 164 blue diamond disclosure icon 53 Border 184 Bounce Data. 60. 210. 79 Analyze Toolbar 13–14 animation scripts 228 Annotate tool 28 annotating 28 resizing and repositioning 28 ANOVA 60 Display Options command 62 JMP INTRO terms 60 one way 46 popup menu 62 report 47 table 53. 65 Box-Behnken designs 215–216.jmp 131 Brown smoothing 157 Brown-Forsythe test 64 By role 38 .Index JMP-SE Symbols “F Ratio” 104 “Prob>F” 104 Numerics –2LogLikelihood 151 95% bivariate normal density ellipse 164 A aberration designs 205 acceptable values See lower limits and upper limits activating toolbars 14 Actual-by-Predicted plots 212 Add button 86 Add Column button 95 Add Graphics Script command 184 additional runs 196. 202 Autocorrelation 149 autocorrelation 148–149 Autocorrelation Lags 147 Autoregressive Order 155 axial points 215 scaling. 72. Tukey Kramer command 62 Alpha 225. central composite designs 220 B Background Color command 184 Backward 103 bar chart 172. 45.

66 reports 66 Contingency Table command 67 continuous factors 203 continuous variables 29. 104 Cp 104 . 125–126 Color or Mark by Column command 49 colors and markers 49 Colors command 184 Column Info 96 Columns command in reports 34 Compare Means command 50.jmp 107. 66 analysis 45. 212 resolution numbers 204 confounding pattern 208 Connect Color command 187 Connect Means command 66 Connect Points option 174. 186 Connect Through Missing 186 Connecting Lines 149 constant estimate 152 Constrain fit 155 contingency table 54. 62 comparison circles 50. 30. 63 interpretation 63 Comparison Circles command 66 Confid Curves Fit command 59 Confid Curves Fit option 52 Confidence Interval command 24. 183. 46 graphs and reports 30 popup menu 30 Contrast dialog 84 contrasts 84 Control Charts c 132 Individual Measurement 127 Moving Range 127 np 130 p 130 R 125 S 125 Shewhart 124–133 u 131 XBar 125 Copy command 18. 37 Confidence Intervals 155–156 confidence intervals in ANOVA 50 in linear regression 52 mean 23. 181 Frame Options 183 Level Options 184 Platform options 185 Single-Chart Options 182 chart platform 172 changing all levels 173 chart types 185 Chi Square statistic 35 clipboard 180 Close command 19 Coating. 219–220 See also response surface designs Chakravarty 205 changing individual levels 172 Chart launch dialog 181–182 Chart platform 171. 180 corrected total 89 correlation 161–170 correlation coefficient 53 correlation matrix 163 Correlation of Estimates command 95 Correlations Multivariate 163 Cotter designs 206 count 34 Count Axis command 31 counts per unit (power and sample size) 235 covariance 161–170 Covariance Matrix 166 Cp 41.240 Index JMP-SE C C Total 89 Capability Analysis command 39 with Control Charts 115 Categorical factors 203 categorical probabilities testing 36 categorical variables 29 graphs and reports 34 Caustic Soda 21 CCD See central composite designs c-Chart 132 CDF plot command 33 center points central composite designs 215 response surface designs 215 central composite designs 215. 36 score 37 selecting level 37 confidence limits in linear regression 59 confounding 207.

Index JMP-SE 241 dragging in 46 Difference to Detect option 227. 86 crossed effect 86 crosstabs table 66 cumulative distribution function 33 cumulative logistic probability plot 67 cumulative probabilities 34 Current Estimates table 102 Custom 156 Custom Test command 94 cut and paste 18. Student’s t command 62 Edit Formula 96 effect aliasing 207 eigenvalue 218 eigenvector 218 size 226 sparsity 199. 22. 171 details 21 Density Axis command 31 Density Ellipse 164–165. 195. 45. 171. 152. 180 Index disclosure icon 53 DispayBox command 184 Display Options command 62. 79. 167 Density Ellipse command 53. 157 evolution 156 Expanded Estimates command 93 . 66 defects 235 Denim. 230 Differencing Order 155 Direction 102 Cpk 41 Cpm 41 Cross button 82. 61 density functions 33 descriptive statistics 22 design resolutions 204 designs aberration 205 Box-Behnken 215–216. 199 full factorials 204 minimum aberration 205 mixed-level 205 orthogonal screening designs 204 surface designs 220 orthogonal arrays 205 Plackett-Burman 205 response surface 215 screening 199 uniform precision 220 desirability values 202 DF 151 DFE 104 dialog boxes Distribution platform 16–17. 24 launching 22 report 25 DOE simple examples 199 double exponential smoothing 157 drag 147.jmp 21. 21 graphs 29 launch dialog 17. 71. 205–206 effect details 93 Effect Leverage Pairs 97 Effect Leverage personality 88 Effect Screening personality 88 Effect Test table 91 effects nonestimable 204 orthogonal 220 eigenvalue of effect 218 eigenvector of effect 218 Ellipse Alpha 166 Ellipse Color 166 Enter All 103 Entered 103 equal variances in t test 47 Error Bars command 62 error SS 90 error standard deviation 226–227 Estimate 103. 28. 65 D damped-trend linear exponential smoothing 158 data table opening 15 Data Table Window 39. 155. 219 central composite 215 fractional factorials 204 full factorial 189. 29. 166 dummy variables 93 Dunnett’s comparisons 62 Durbin-Watson Test 96 E Each Pair.

170 Holt smoothing 158 homogeneity of variances 64 honestly significant difference 62 Horizontal command 185 Horizontal Layout command 17. 202 independent variables 55 Individual Confidence Interval 96 . 181 F Ratio 47 F test 90 Factor 152 Factor Profiling option 193 factorial designs fractionals 204 full 189.jmp 99 fitting lines 51 fitting personality 85. 58 Fit Model dialog 79–80.242 Index JMP-SE exponential smoothing see Smoothing Models extra parameters 227 full factorial designs 189. 175 Graph toolbar 13–14 Graphs tab 13 Group By command 54. 61 group variances homogeneity 64 grouping variable 54. 87 Fixed 157 Forecast Periods 147. 195. 18. 180 File tab 13 File/Edit toolbar 13–14 Fit Distribution 26. 204 three level 205 Factorial Sorted macro 87 Factorial to Degree macro 86 factors categorical 203 continuous 203 key factors 199 false negatives 206 Fat Plus (selection) tool 18. 86 F “F Ratio” 104 G general linear model 79 Go 101. 30 hypothesized means specifying 25 I identifying key factors 199 importance of responses 195. 61. 199. 22. 44 Fit Line command 52. 24 bar position 23 bar widths 22 red bracket (box plot) 32 using 22 Histogram command 30 Hoeffding’s D 168. 173. 85 platform 79 Fit Model platform 100 examining results 81 launching 79 Save 96–97 Fit Polynomial command 59 Fit Special command 61 Fit Y By X platform 45 launching 45 Fitness. 204 examples 189 Full Factorial macro 83. 58 Fit Mean command 51. 195. 54. 199. 103 goal types 195. 202 goals matching targets 202 minimizing and maximizing 202 Goodness of Fit 33 Grand Mean command 65 Graph 149 Graph menu 171. 153 Forecast plot 153 Formula command 56 Formula Editor 56 Forward 102 fractional factorial designs 204 Freq button 86 frequencies table 34 frequency 34 H hand tool in Distribution platform 22 with Distribution platform 31 help system 11 Help tool 12 histogram 16.

170 Invertible 151 Index J JMP Starter 12–13 JSL (JMP Scripting Language) animation scripts 228 K Keep the Same command 196. about 12 Inverse Corr table 163 inverse correlation 163. 90 error 90 table 60 Lag 152 Lasso tool 48 leaf values 32 least squares means 82. 221 Kendall’s Tau 168 Kendall’s tau-b 169 k-Sample Means (power and sample size) 231 linear contrasts 84 linear exponential smoothing 158 linear regression 51. 202 maximizing goals 202 mean 16. adding 82 interactions 206 high-order 204 Intercept 155 intercept 152 interquartile range 31 Introduction sections. 93 least squares regression 51. 58. 58 confidence limits 59 Lock 103 Log function 57 Log10 function 57 Logistic platform 55 Logistic Plot command 68 logistic regression 67 Lognormal 33 Long-term sigma 39 Lot Number column 21 Lower Spec Limit 39 LSMeans 82. 71 interpreting the scatterplot 73 launching 72 preparing the data 71 matching target goals 195. 79 legends with colors and markers 49 Level Midpoints command 38 Level Numbers command 38 Level Options command 185 level smoothing weight 156 Levene’s test 64 leverage plot 91–92 whole model 92 .Index JMP-SE 243 likelihood ratio tests 36 Line Chart command 183 Line of Fit command 65 Line Style 187 Individual Measurement Chart 127 inertia of Scroller tool 82 Inscribe option 220 interaction effect. 93 LSMeans Plot 82 LSMeans Plot command 93 LSMeans Student’s t command 93 LSMeans Table command 93 M macros 83 Macros drop-down list 86 Make Model 103–104 Mallow’s Cp criterion 104 Marker Drawing Mode 184 Marker Size command 184 Markers command 184 marking points 48 matched pairs 47 plot interpretation 75 scatterplot 73 Matched Pairs platform 47. 30 confidence interval 30. 36 specifying hypothesized 25 test 35 testing 24 L L18 Chakravarty 205 L18 Hunter 205 L18 John 205 L36 205 Label column 211. 93 LSMeans Contrast command 84. 18. 202 maximize responses 195. 210. 61. 222 lack of fit 59.

32 Normal Quantile Plot command 25. 45 Moments command 30 Moments table 36 More Moments command 30 mosaic plot 54.244 Index JMP-SE Mean CI Lines command 65 Mean Confidence Interval 96 Mean Error Bars command 65 Mean Error Bars option 48 Mean Line 149 Mean Lines command 65 Needle Chart command 183 Needle option 186 Needle Plot command 172 nested effect 86 New Column means one and two sample 226 Means and Std Dev command 48 means diamonds 31. 217. 50 Means Diamonds command 62. 35 Normal 33 Normal 26 normal density ellipse 164 normal quantile plot 25. 202 minimizing goals 202 minimum aberration designs 205 missing value 163 missing values 167 Mixed 103 mixed-level designs 205 Mixture Response Surface macro 87 Model Comparison table 151 model effects 86 Model script Model Specification dialog 211. 64 Normal Quantiles command 38 normality 25 np-Chart 130 Number of Forecast Periods 150 number of runs screening designs 204 O O’Brien’s test 64 OC Curves 118 Oil1 Cusum. 18. 62 Means/Std Dev/Std Err command 62 median 16. 66 Mosaic Plot command 67 Moving Average Order 155 Moving Range Chart 127 MSE 104 multiple comparison tests 62–63. 65 Means Dots command 62 Means/Anova/t test command 50. 163 Multivariate platform 161–170 command 57 nominal logistic regression see Logistic platform nominal variables 29 nominal/ordinal by continuous fit see Logistic platform nonconforming unit 43 nonestimable effects 204 Nonparametric Correlations 168 Nonparametric Measures of Association table 168 nonparametric tests 25. 210. 202 nDF 104 one way analyses 47 one way ANOVA 46 one-sample and two-sample means 226 one-sample proportion (power and sample size) 233 one-sample variance (power and sample size) 232 opening data tables 15 order of runs 196.jmp 138 On Face option 220 N N Responses button 195. 31. 222 model sum of squares 90 Model Summary table 151. 31–32 Median rank scores 64 Method column 21 Minimal Report personality 88 minimize responses 195. 155 Modeling tab 13 modeling type 29. 221 ordinal variables 29 orthogonal array designs 205 orthogonal designs screening designs 204 surface designs 220 Orthogonal option 220 Other 166 outlier box plot 30–31 . 93 multiple regression example 99–105 Multivariate 161.

jmp 127 Pie command 185 Plackett-Burman designs 205 Plot Actual by Predicted 96 Plot Actual by Quantile command 65 Plot Effect Leverage 96 Plot Quantile by Actual command 65 Plot Residual By Predicted 96 Plot Residual By Row 96 Plot Residuals command 53 . 91 Parameter Estimates table 152 parameters. q parameters 155 Pairwise Correlations 163 Pairwise Correlations table 167 Parameter 103 parameter estimates 60 with fitted lines 52 Parameter Estimates Table 60. 81. 35 Probability Labels command 65 process capability ratio 41 product-moment correlation 167. 211.Index JMP-SE 245 plots Actual-by-Predicted 212 Point Chart command 183 points axial 215 colors and markers 49 Points command 65 Points Jittered command 66 Points Spread command 66 Poisson-distributed counts 235 polynomial effect 87 Polynomial to Degree macro 87 power analyses 226 in statistical tests on means 226 one-sample and two-sample means 227–228 Power Analysis command 93 power and sample size calculations 225–237 animation 229 counts per unit 235 k-sample means 231 one-sample and two sample proportions 233 one-sample mean 228 one-sample variance 232 sigma quality level 236 two-sample means 230 Ppk Capability Labeling 40 Predicted Values 96 predicted values saving 58–59 prediction variances 220 Prediction Formula 96 prerequisites to using JMP INTRO 11 Press 96 Print command 18 printing reports 18 Prob Axis command 31 Prob Scores command 38 Prob to Enter 102 Prob to Leave 102 Prob>|t| 152 “Prob>F” 104 probabilities testing 26. 187 Overlay option 185–186 Overlay Pattern command 183 Overlay Plot platform 173. 185 Connect Points option 174 platform options 186 single-plot options 186 Y Options 174 Y Options command 186 P p value 35. 201. 47. d. 169 Pen Style command 183 Periods Per Season 156 personality 99 Pickles. 169 proportions (power and sample size) 233 Pumice Stone 21 pure error 90 Index outliers 31 outside effect 86 overlap marks 50 Overlay Color command 183 Overlay Marker Color 187 Overlay Marker command 183. extra 227 Partial Autocorrelation 149 partial autocorrelation 148 Partial Corr table 164 partial correlation 164 Paste command 19. 85 p. 180 Pattern column 197. 222 patterns confounding 208 p-Chart 130 Pearson Chi Square test 36 Pearson correlation 167.

202 lower limits 202 upper limits 202 results annotating 28 revealing columns in reports 34 RMSE 193. 210. 221 screening designs 204 quantile-quantile plot 31 quantiles 32 Quantiles command 30.jmp 21.jmp 189 red bracket (box plot) 32 red triangle popup menus 17 Redo Analysis command 66 regressor columns 206 Remove 147 Remove 85 Remove All 103 Remove button 82 Remove Fit command 53 reports setting titles 80 requesting additional runs 196. 210. 171 details 21 sample means 226 Sample Size and Power command 225 sample sizes example comparing one proportion to baseline and sample size plot 234 example comparing single-direction one-sample variances 232 example with counts per unit 235 one and two sample means 227 . 212 root mean square error 59. 89 Rotatable option 220 Row Colors command 49 Row Markers command 49 row states 23 RSM (Response Surface Methodology) 218 RSquare 104. 221 re-running an analysis 82 rescaling designs 220 Residual Statistics 154 Residuals 96 residuals 53. 79. 219 desirability values 202 goals 195. 29. 62 quartiles 31 R r 53 R2 59. 71. 45.jmp 189–190 Reactor Response. 210. 48. 221 order they appear in table 196.246 Index JMP-SE p-value 36 Q q-q plot 31 Quantile Box Plot command 32 examples 216–223 introduction 218 purpose 215 reports 217 Response Surface Effect macro 87 Response Surface Methodology (RSM) 218 response surface models 87 responses custom designs 202. 210. 210.jmp 189 Reactor Factors. 221 randomizing runs 196. 88 runs additional 196. 89 R2 adjusted 59. 221 Range option 186 Range Plot command 186 Ranks Averaged command 38 Ranks command 38 R-Chart 125 Reactor 32 Runs. 89 Random Effect 99 Randomize within Blocks 210. 153 plotting 53 saving 58 resolution numbers 204 resolutions of designs 204 response surface designs S sample autocorrelation function 149 sample data Denim. 221 requesting additional 196. 151 Rsquare 89 Adj 89 RSquare Adj 104 Run Charts 114 Run Model 100 Run Model button 84.

error 226 Standardize command 38 standardized values 38 star points 215 starting JMP INTRO 12 statistical tests 34 Statistics 171 Statistics button 181 Std Dev Lines command 62. 221 Sort Right to Left 196. 186 prospective power analysis 226 screening designs 189 Sand Blasted? column 21 Save 40 Save Centered command 65 Save Columns 154 Save commands 37 Save Normal Quantiles command 65 Save Predicted Values 96 Save Script 38 Save Script for All Objects command 66 Save Script to Data Table command 66 Save Script to Report command 66 Save Script to Script Window command 66 Save Standardized command 65 SBC 151 scaling axial 220 designs 220 scatterplot 45–46.Index JMP-SE 247 shortest half 32 Show Center Line 127 Show Confidence Interval 153–154 Show Correlations 165 Show Histogram 165 Show Points 149. 155–159 smoothing weight 156 Sort Left to Right 196. 58 Scatterplot Matrix 164 scatterplot matrix 163 S-Chart 125 Schwartz’s Bayesian Criterion 151 score confidence intervals 37 screening designs 199 design types 204 Script 38 Script submenu 66. 30 testing 35 standard deviation.jmp 132 Short Term. 25. 185–186 scripts animation 228 generating the analysis model Model script See Model table property scripting See JSL Scroller tool 81 seasonal exponential smoothing 158 seasonal smoothing weight 156 Select Columns list 86 select rows in data table 161 selecting and marking points 48 selecting report items 18. 153–154 Show Points command 58. Grouped by fixed subgroup size 40 Index sigma 39–40 Sigma Quality 41 sigma quality level (power and sample size) 236 signed-rank test 35 significance probability 167 stepwise regression 99 simple exponential smoothing 157 single-sample means (power and sample sizes) 228 Size/Scale commands 184 Smoothing Model dialog 156 smoothing models 143. 210. 180 selection tool 18. 18. 210. 205–206 Spearman’s Rho 169 Spearman’s Rho 168 Spec Limits 40 Specified Sigma 40 specifying hypothesized means 25 Split command 71 Split command selecting rows 71 SS 104 SSE 104 Stable 151 Stable Invertible 156 Stack 38 Standard Deviation 151 standard deviation 16.jmp 143 Set Alpha Level command 51. 65 Std Dev Lines option 48 Std Err Bars command 31 Std Error 152 . effect 199. 64 setting titles windows and reports 80 Shewhart Control Charts 124–133 Shirts. 180 Separate Axes command 185–186 Seriesg. Grouped by Column 40 Short Term. 183. 221 sparsity.

89 sums of squares 89 surface designs See response surface designs T t Ratio 152 t statistic 35 t test 25.248 Index JMP-SE Std Error of Individual 97 Std Error of Predicted 96 Std Error of Residual 97 StdErr Prob 34 Stem and Leaf command 32 Step 101. 230 two-sample proportion (power and sample size) 233 two-way contingency table 54 U u-Chart 131 Unconstrained 156–157 UnEqual Variances command 64 Ungroup Plots 186 command 24. 35–36. 35 dialog 25 Test Probabilities command 27. ordinal. and nominal 29 . 103 Step command 186 Step History table 102 stepwise regression 100 Control panel 102–103 Stop 103 Studentized Residuals 96 Sum of Squared Errors 151 Summary of Fit table 59. 36 table 36 Test Std Dev command 35 testing a mean 24 testing for independence 55 testing probabilities 26 scaling estimated values 27 Tests command 67 Thread Wear column 21 Thread Wear Measured column 21 Time ID role 147 Time Series 143 Time Series Graph 149 Time Series platform 143–159 ARIMA 154–155 uniform precision designs 220 Uniform Scaling 38 Uniform Y Scale 186 Univariate 163 Upper Spec Limit 39 Use Median 127 User Defined option 220 using histograms 22 V values target 202 Van der Waerden 64 variables categorical 29 continuous. 46–47 report 46 two sample 46–47 Tables menu Split command 71 Tables tab 13 Tables toolbar 14 Tag Line option 28 Target 39 target values 202 Term 152 Test Mean commands 148–150 example 143–148 smoothing models 155–159 Time Series Plot 148 Time Series role 147 titles setting in windows and reports 80 toolbars 13 showing and hiding 14 Tools toolbar 13–14 trade-off in screening designs 204 trend 156 Try Fixed Width 179 Tukey-Kramer HSD 62 tutorial examples DOE 199–201 full factorial designs 189 multiple regression 99–105 response surface designs 216–223 time series 143–148 two-level categorical 200 two-level fractional factorials 204 two-level full factorials 204 two-sample and one-sample means 226.

jmp 130 Weibull 33 Weight button in Fit Model 85 weight. importance 195. 35 Zero To One 156 . 202 Welch ANOVA 64 Westgard Rules 122 Where 38 whiskers 31 whole model 92 Whole Model Test table 68 Wilcoxon rank scores 64 Wilcoxon signed-rank test 25. Columns button 22 Z statistics 43 z test 25. 79–80. Hsu’s MCB command 62 With Control. Dunnett’s command 62 word processing program with cut and paste 28. 35 Window menu 23.Index JMP-SE 249 Index modeling type 29 standardized values 65 Variance Estimate 151 variance of prediction 220 variances equality in t test 47 Vertical command 185 W-Z Washers. 180 X role 147 X-Axis Proportional command 66 XBar Chart 125 Y button in Fit Model 85 Y Options command 185 Y role 147 Y. 26. 84 windows setting titles 80 Winter’s method 159 With Best. 82.

.

KEITH PACKARD DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE. TORT (INCLUDING NEGLIGENCE).’S LICENSOR’S HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. DATA OR PROFITS.’S LICENSORS MAKE NO WARRANTIES. IN NO EVENT SHALL NEIL HODGSON BE LIABLE FOR ANY SPECIAL. SAS INSTITUTE INC. Inc.’S LICENSORS DO NOT WARRANT.Notices Technology License Notices The ImageMan DLL is used with permission of Data Techniques.’S LICENSOR) BE LIABLE TO YOU FOR ANY CONSEQUENTIAL. THE ABOVE LIMITATIONS MAY NOT APPLY TO YOU. SAS INSTITUTE INC.00. . WHETHER IN AN ACTION OF CONTRACT. NEGLIGENCE OR OTHER TORTIOUS ACTION. THE ABOVE EXCLUSION MAY NOT APPLY TO YOU. INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE. PRODUCT LIABILITY OR OTHERWISE WILL BE LIMITED TO $50. AND THE LIKE) ARISING OUT OF THE USE OR INABILITY TO USE THE SOFTWARE EVEN IF SAS INSTITUTE INC. NEIL HODGSON DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE. XRender is Copyright © 2002 Keith Packard. CURRENTNESS OR OTHERWISE. INCLUDING WITHOUT LIMITATION THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.org>. LOSS OF BUSINESS INFORMATION.’S LICENSOR’S LIABILITY TO YOU FOR ACTUAL DAMAGES FOR ANY CAUSE WHATSOEVER. THE ENTIRE RISK AS TO THE RESULTS AND PERFORMANCE OF THE SOFTWARE IS ASSUMED BY YOU. EXPRESS OR IMPLIED. AND REGARDLESS OF THE FORM OF THE ACTION (WHETHER IN CONTRACT. EMPLOYEES OR AGENTS (COLLECTIVELY SAS INSTITUTE INC. ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE. GUARANTEE OR MAKE ANY REPRESENTATIONS REGARDING THE USE OR THE RESULTS OF THE USE OF THE SOFTWARE IN TERMS OF ITS CORRECTNESS. INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS. INCIDENTAL OR INDIRECT DAMAGES (INCLUDING DAMAGES FOR LOSS OF BUSINESS PROFITS. OFFICERS. BECAUSE SOME STATES DO NOT ALLOW THE EXCLUSION OR LIMITATION OF LIABILITY FOR CONSEQUENTIAL OR INCIDENTAL DAMAGES. ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.’S LICENSORS AND THEIR DIRECTORS. IN NO EVENT WILL SAS INSTITUTE INC. NEGLIGENCE OR OTHER TORTIOUS ACTION. BUSINESS INTERRUPTION. REGARDING THE SOFTWARE. ACCURACY. IN NO EVENT SHALL KEITH PACKARD BE LIABLE FOR ANY SPECIAL. SAS INSTITUTE INC. INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE. RELIABILITY. WHETHER IN AN ACTION OF CONTRACT. THE EXCLUSION OF IMPLIED WARRANTIES IS NOT PERMITTED BY SOME STATES. Scintilla is Copyright 1998-2003 by Neil Hodgson <neilh@scintilla. INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS. DATA OR PROFITS.

- SAS - Regression Using JMP
- JMP User's Guide
- JMP_Guide
- Statistics
- SAS for Time Series
- Sas Debugging Sas Programs
- Applying Data Mining Techniques Using SAS Enterprise Miner
- Jmp Introductory Guide
- JMP Stat Graph Guide
- [Howard Schreier] PROC SQL by Example Using SQL w(BookFi.org)
- R Handout Statistics and data analysis using R
- JMP
- sas eminer
- QC minitab.pdf
- Jmp Doe Guide
- Six Sigma Best Practices
- Numeric.docx
- QMM_2_6_2017.pptx
- Yauri et al , 2012.pdf
- OPRE-6301-02-03
- Data Models lecture
- Chapter 2 Review
- Curso Web y Data Mining 3 Predictive Modeling Using Logistic Regresion
- Root Cause Analysis
- 119316913 Basic Statistics Using SAS
- mth536 mod1 app histogram lesson
- Business Analytics Using SAS
- 2º_Principles of Probability and Statistics Assessment
- CHAPTER 1-2
- 310_spring2012_chapter8.pdf

Skip carousel

- An Introduction of Content Based Image Retrieval Process
- Fomc Minutes 20090128 Epa
- Fomc Minutes 20100623 Epa
- A Mixed Approach of Color Image Feature Enhancement
- Fomc Minutes 20111102 Epa
- Fomc Minutes 20110126 Epa
- Fomc Minutes 20091104 Epa
- Fomc Minutes 20101103 Epa
- Fomc Minutes 20090429 Epa
- Fomc Minutes 20100428 Epa
- Fomc Minutes 20110427 Epa
- Fomc Minutes 20090624 Epa
- Fomc Minutes 20110622 Epa
- A Review on Human Activity Recognition Using HOG Feature
- Fault Detection of A Hot Metal Body
- Study on Contrast Enhancement with the help of Associate Regions Histogram Equalization
- tmp948B
- Web Interaction Content Filtering And Security
- tmp9363.tmp
- Fomc Minutes 20100127 Epa
- tmpCD9E.tmp

Skip carousel

- tmp7391.tmp
- tmp6C8A.tmp
- Tmp 3371
- fcic_docs_moodys_financedefault_20100602.pdf
- The Influence of Corporate Culture on Organizational Commitment
- Tmp 2069
- Detection of DoS Attack Based on MCA and TAM.
- tmpD82B
- Tenth-Grade WASL in Spring 2006
- Business Statistics/Series-2-2011(Code3009)
- ifdp1178
- tmpA623.tmp
- The San Francisco Polar Bear Swim Study
- 68489_1980-1984
- RIEF June
- tmp698D
- Timeliness of Annual Report Releases in Relation to the Direction and Magnitude of Earnings and Share Price Revaluation
- tmp2537.tmp
- tmpADF8
- tmpF695
- tmp57E6.tmp
- frbrich_wp13-3.pdf
- tmpAE76.tmp
- tmp329E.tmp
- tmpE13F.tmp
- Ttr Met Gates
- tmp4E56.tmp
- tmpF678.tmp
- 64449_1965-1969
- 68456_1975-1979

Sign up to vote on this title

UsefulNot usefulClose Dialog## Are you sure?

This action might not be possible to undo. Are you sure you want to continue?

Close Dialog## This title now requires a credit

Use one of your book credits to continue reading from where you left off, or restart the preview.

Loading