Workshop On SPSS1

Workshop on SPSS (Hands-On) for Beginners by Assoc. Prof. Dr.
Ernest Cyril De Run Venue: Computer Laboratory, Faculty of Computer Science and Information Technology, West Campus, UNIMAS. Date: 26 October 2007 (Friday) The workshop will introduce participants to a basic hands-on use of SPSS. Issues such as how to key in, compute, and transform data from questionnaires and interviews will be presented. Tips on syntax use will also be provided. Basic explanation on using SPSS to calculate Means, ANOVA, Manova, Regression, Factor Analysis, Frequency, Correlation, and Cross tabulation will also be discussed. Please note that the workshop will focus on how to use SPSS, and will not discuss the various methods of statistical analysis. What is SPSS? How to open the program An Overview of the Program How to key in Interview based data How to key in Questionnaire based Data and to Transform. Syntax How to Analyze Frequency Crosstabulation Means ANOVA Manova Correlation Regression (Enter for confirmation, stepwise for prediction) Exploratory Factor Analysis
SPSS ECDR 26.10.07
What is SPSS? SPSS refers to computer software named Statistical Program for Social Sciences and it comes in various versions and add ons. It is software and not a method of analysis. Therefore please do not state that you are using SPSS to analysis whatever in your research paper. You may state that you use this statistical package in order to run a certain analysis such as ANOVA or any other method. SPSS is statistical and data management software that is widely used. This is partly because it is simple to use, user friendly, and does not require coding as by SAS. You may use code in Syntax, but thats another story. In most cases, you can just copy and paste code from SPSS output into Syntax thus not requiring you to write your own code. The output that is presented by SPSS is also simple and easy to understand, making it widely copied and not properly presented for academia purposes. See output example in Example of Output.
It also allows for the use of graphs that makes presentations much clearer.
SPSS ECDR 26.10.07
SPSS ECDR 26.10.07
How to Open the Program There are at least three ways to open the SPSS program on your computer. They are: 1. From your Desktop, select Start, All Programs, SPSS for Windows, SPSS 15.0 for Windows. A window will appear asking you what you would like to do, with a few choices. I normally just click on Cancel. Then I will open an existing file that I want or start keying in data.
2. Open an existing file by clicking on it, and SPSS will start. An output document will appear too. I would normally close the document; some people prefer to keep it open to look at the records.
SPSS ECDR 26.10.07
3. From Desktop, if there is a shortcut, click on it. A window will appear asking you what you would like to do, with a few choices. I normally just cancel it and then open an existing file that I want or start keying in data.
SPSS ECDR 26.10.07
An Overview of the Program Now that you have a SPSS program open, lets look into its components.
Lets start from the bottom upwards to the top. At the very bottom, you will see a note, SPSS Processor is Ready. This is a neat feature that tells you what you already know. And what more, when you work on a slow computer, it will tell you that it is processing. Next, you will notice the words Data View and Variable View. Data View shows you the Data (numbers or words, depending on what mode your computer is on) and Variable view show you the inner working or the meaning of that words or numbers. We will look into Variable View later on. What we are looking at now is known as Data View. Data View Next you will notice this empty box (where soon your data will be placed in). It is cordoned by a series of numbers (rows) and var in the column. This will change once you keyed in the appropriate terms in the Variable view later.
SPSS ECDR 26.10.07
The next line shows the various buttons that you may use. Most importantly, the SAVE button. Please do use this regularly. Others refer to go to buttons or Insert buttons. The Value Label (also known as toe tag icon) will determine whether your screen shows numeric values or their labels as dictated in the Values section in the Variable View. The top line. The all-important line. Few things to know. 1. File, Save. This is extremely important in SPSS, as in all computer programs. Please do remember to save continuously while working on SPSS. You may also use CTRL S. 2. File, New, (your choice of Data, Syntax, Output, Draft Output, Script). This is used when you wish to open anew file while you are working on a different file. If you choose Data, a new Data file (similar to what you are seeing now, will open) and the same goes for the others. 3. File, Open, (your choice of Data, Syntax, Output, Draft Output, Script). This is used when you wish to open an existing file while you are working on a different file. If you choose Data, an existing Data file (similar to what you are seeing now, will be called on in a new window that you will have to choose to open) and the same goes for the others. 4. Edit, Options, Draft Viewer make sure that the Display Command in Log is ticked. This will then display the codes for all the commands that you use, which you can later incorporate into your Syntax. The others will be discussed as and when we use them.
SPSS ECDR 26.10.07
Variable View Lets look into Variable View. Click on it.
Lets start from the bottom upwards to the top. At the very bottom, you will see a note, SPSS Processor is Ready. And you will notice that the top is also the same. The only difference is the middle. The rows here refer to the coding that you will use in the columns in the Data View. Lets look at each column in the Variable View. 1. Name. Refers to the name of the column in the Data View. Normally this will coincide with the questions in your questionnaire so that it will be easier to track down once you run an analysis. The name must be unique, start with a letter, and up to 8 characters. Use short terms, as this will make life easier when running analysis later on. Plus you can place a longer explanation in the Label. Click on the appropriate box, and type in the name. SPSS is kind of tricky here, especially if you want to use the hyphen, as SPSS thinks its a minus sign. You can use underscore. Also, dont have space between terms. 2. Type. This refers to the type of data that you will be typing in. There are 8 types (Numeric, Comma, Dot, Scientific notation, Date, Dollar,
SPSS ECDR 26.10.07
Custom Currency, String), but the most common are Numeric and String. Numeric refers to the use of numbers and String refers to the use of alphabets or alphabets and numbers. There are implications to this choice. If you choose String, data there cannot be analyzed by numeric operations (i.e. Means). Even if your data is in Ringgit Malaysia, I would still suggest that you use Numeric instead of Dollar.
3. Width. This refers to the number of characters that SPSS will allow to be placed in the column in Data View. This includes dot, commas, spacing, and everything that is typed in. 4. Decimals. This refers to the number of decimals that SPSS will display. Interestingly, SPSS will calculate more decimals than you need to know, but will show only the decimals that you need to be shown. 5. Label. As noted earlier, this is a place where you may type in text that explains the column. The maximum space is for 255 characters but I do suggest that you be brief as this will appear in your analysis and would make your tables look ugly. 6. Values. This is where you assign meaning to the numbers that you are using. Clicking up the Values box (where the three dots are), will open another window that allows you to key in the appropriate meanings for each value. In the Values Label dialogue box, you can click in the Value field the appropriate number, then click in the Value Label field to type in what that number represents. Always click on Add after that, otherwise its not kept in the list. You can also change or remove values by clicking the appropriate box.
SPSS ECDR 26.10.07
7. Missing. You may inform SPSS that certain data should be treated as missing by using certain numerical code. This can be done by filling in the Discrete missing values with values of your choice. You may also just leave the field blank, where SPSS will display that is known as SYSTEM MISSING data.
8. Columns. Refers to how wide a column should be. 9. Align. You can also align your data accordingly. You may choose to align left, right, or center. 10. Measure. This is important as you decide what type of data that you have. As the saying goes, rubbish in, rubbish out. SPSS does not differentiate between interval and ratio, so these two are placed together as scale. The other two forms of data measurement remains, which are ordinal and nominal.
SPSS ECDR 26.10.07
10
How to Key in Interview-based Data Many have told me that SPSS cant handle cases where there are openended questions in a questionnaire or when there is a transcript of an interview. Such data requires NuDist or other similar computer programs to analyze it. However I beg to differ, as there is more than one way to skin a cat. Planning for Interview Key In Refer to the attached Interview Transcripts. I would normally open an individual file for each research question of the interview. In this case there are two research questions, why do they join and why do they stay on in a Multi Level Marketing Company. You would also notice that there are some demographics and ancillary questions that would be nice to have in a data form to help analyze the data that is found. Therefore I would first and foremost arrange the data in the interview according to how I would want to key it in SPSS. The first section would be the demographics and ancillary data and the second section would be the relevant research questions. Key in Interview Data Open SPSS. Create variables in the Variable view that can represent the demographics. I see the possibility for Gender, Age, Marital Status, Race, Education level, Member of which MLM COMPANY, Member since, and Level in the company. For the research question, the way I would do it is to note the answers given by the fifteen respondents. I would code it accordingly, or even get a second coder or third person involved if there is disagreement as to how to code it. Refer to Research Methodology books on how to do coding. Once this is done, I am ready to enter the data into SPSS. In many cases, I do this on the fly, which is to code and key in immediately. Key in Interview Data - Example Lets do Interview 1. For demographics, I would key in gender in the Name first, and type in the label, Gender. For Values, I would key in 1 for Male and 2 for Female. If you notice, after you typed in gender in name, all other variables automatically appear. After doing so, when you open Data View, you can see the first column named gender.
SPSS ECDR 26.10.07
11
Do this for all the other variables. For age, I normally leave Values blank, as I would key in the actual age first and at a later stage transform this into a scaled dataset. See SPSS file Interview Demographics.
SPSS ECDR 26.10.07
12
For the first research question, why do they join MLM companies, I would look at the answer given to the question, code it, and key it in as a Yes/No.
See SPSS file Interview RQ1 for the Variable and Data View.
SPSS ECDR 26.10.07
13
Then what I would do is to delete all the coding and answers given to the first research question (Go to Variable View, highlight the relevant rows, click
SPSS ECDR 26.10.07
14
Delete). This will leave me with the demographic file. I will then save it as another file and proceed to key in the answers / codes to the second research question. See SPSS file Interview RQ2 for the Variable and Data View.
SPSS ECDR 26.10.07
15
Assignment Now try by yourself to key in the remaining interviews.
SPSS ECDR 26.10.07
16
How to key in Questionnaire based Data and to Transform. The process to key in for a Questionnaire based data is also similar. Except that in most cases here one will be working with Scale instead of Nominal data. The most important thing here is to plan everything from the perspective of keying into SPSS so that when the data comes, you can immediately post the data into SPSS. This means questionnaire design must take into account he limitations of SPSS and the requirements of the method of analysis. The demographics section will pretty well be the same as the earlier discussion. The only difference will be the data coding for the questionnaires and perhaps the positioning of the data in the SPSS. Key in Questionnaire Data Example See the example questionnaire in the file Example Questionnaire SR & Loyalty. Then refer to the SPSS file where data has already been keyed in, Example Questionnaire 1.
SPSS ECDR 26.10.07
17
Look at how the coding in the SPSS file mirrors the questionnaire. In this case, the data for age had already been coded into groups by the researcher. Others are just keyed in as per what the respondents have answered. Please take note that as you key in the data from the questionnaire, write the relevant corresponding row number on to the questionnaire. This is important for later stages of checking data. Checking for Mistakes Once completed keying in all the data, check if there were any mistakes in what was keyed in. How to do so? Two ways. The first is to select Analyze, Descriptive Statistics, Frequencies. A dialogue box would appear. Select all the variables and transfer it to the Variables box. Then click OK. Once you have done so, the output would appear. Check if there are any missing data or numbers that should not be in the dataset. As an example, if you used a Likert Scale with 5 anchors then you shouldnt have any other numbers aside from 1, 2, 3, 4, and 5. So if you find number 11, or 22, or 6, there must have been a mistake in keying in the data. The second method is to select Analyze, Descriptive Statistics, Descriptives. A dialogue box would appear. Select all the variables and transfer it to the Variables box. Then click OK. Once you have done so, the output would appear. Check if there are any numbers that do not represent the Minimum and Maximum in the dataset. As an example, if you used a Likert Scale with 5 anchors then you shouldnt have any other numbers aside from 1, 2, 3, 4, and 5. So if you find number 11, or 22, or 6, there must have been a mistake in keying in the data. Check also if any of the Means are extraordinary large or small. Correcting Mistakes If you found something wrong, what do you do? Firstly, determine why was it wrong? Was it because the wrong number was keyed in or that the data was missing data or for any other reason. Secondly, identify where is the data wrongly keyed in? In which column? Thirdly, look up the relevant column in the Data View. Click on it. Fourth, press CRTL F simultaneously and a Find Data dialogue window will appear. Key in the relevant number or item that was wrong to find out which row it is in. Fifth. Once the row is identified, go back to your bundles of questionnaire that has been marked by row number and search for the questionnaire that represents the row that contains the wrong key in. Type in the right number.
SPSS ECDR 26.10.07
18
Recode Lets now look into Recode. Lets assume the researcher wants to recode the Educational Level data of respondents from the current 7 values (1 = SPM, 2 = STPM, 3 = Matriculation, 4 = Diploma, 5 = Undergrad, 6 = Degree, 7 = Master) into only 3 values, that is those with an educational level up to school level, those with pre-university, and those with an University education. The first thing to do is to look into the data itself, as to whether there is sufficient numbers to do such a recode. Running a frequency does this. This will be explained later. After running a frequency and noting that there are sufficient data, then you may proceed to Recode. Click on Transform. You will notice that there are two types of Recode instructions. They are: 1. Recode into Same Variable, and 2. Recode into Different Variable. The choice is yours depending on what you intend to do. I would nearly always recode into a different variable, as I prefer to leave my initial data intact so that I may return to it at a later stage. So, click on Transform, Recode into Different Variable, and a dialogue box will appear.
Find the variable that we wish to recode, Education Level, and click on it. Transfer it to the Input Variable -> Output Variable box. Once you have done so, the name will be recorded and the box will be renamed Numeric Variable -> Output Variable.
SPSS ECDR 26.10.07
19
If you notice on the Output variable box, there is Name and Label, which corresponds to the new name and label that you wish for this variable. Type in for name, newedu and for label type in, New Education Level. Click on Change.
You will notice that the Numeric Variable -> Output Variable box now shows the old and new name. Now click on Old and New Values. A new dialogue box will appear, Recode into Different Variable: Old and New Values.
SPSS ECDR 26.10.07
20
There are two sections to this dialogue box. The first part refers to the old data and the other part is to the new data that you wish to create. For the old data, you are given options as how to categorize the data, from a stand alone value, system missing values, range or all other values. For the new data you are given 3 choices, to key in a new value, system missing, or copy the old data. We wanted to create 3 values, which are those with an educational level up to school level, those with pre-university, and those with a University education. In the old data, school level education refers to number 1 and 2. Since this is within a range, key in number 1 to 2 in Range in the old data section and in the new data section type in 1. Click on Add, otherwise it will not be added into the new data. Do the same thing for number 3 and 4 of the old data, which refers to preuniversity education. In the new data section type in 2. Click on Add.
SPSS ECDR 26.10.07
21
For University level, you may still use range, or use range, value through HIGHEST. In the new data section type in 3. Click on Add.
SPSS ECDR 26.10.07
22
Click on Continue, which will close the dialogue box and bring you back to the Recode into Different Variable dialogue box. Click OK. SPSS will run the data and an Output table will appear with the code. You may think about / consider saving the code to use it in Syntax later on. Go back to the SPSS file and look in the Variable View.
You will see a new row, with the name newedu and with the label, New Education Level. You will also notice that there is no data in Values. Click on the values box and key in the relevant new values. When you click on the value box, the Value Label dialogue box will appear. Remember that your new data was coded as 1 for educational level up to school level, 2 for those with pre-university, and 3 for those with an University education. In the Value Label dialogue box, type in 1 for value and school level for value label. Click on Add. In the Value Label dialogue box, type in 2 for value and pre-university level for value label. Click on Add.
SPSS ECDR 26.10.07
23
In the Value Label dialogue box, type in 3 for value and university level for value label. Click on Add.
Click on OK. Run frequency again to check if the data has been transformed properly. Assignment Now try by yourself to recode into a different variable for the following situation. 1. Use the dataset Assignment 1. Recode into a different variable for the current variable by the name City to two (2) values. The first are those from West Malaysia and the second are those from East Malaysia. 2. Use the dataset Assignment 1. Recode into a different variable for the current variable by the name Age to your own determination of values. This must reflect the data.
SPSS ECDR 26.10.07
24
Compute Compute refers to a method where SPSS runs a computation for you in order to create a new variable. Refer to the current dataset, Example newedu.
There is a section there with 11 statements on loyalty. See row 23 to 33 in the Variable View and as shown here, Table 1.
SPSS ECDR 26.10.07
25
Table 1. Loyalty Items by Rows Row Name Label 23 highprob There is a high probability that you will dine at this restaurant again. 24 recomend You have recommended other people to patronize this restaurant. 25 sayptive You will say positive thing to other people about the service provided by this restaurant. 26 feedback You will give positive feedback to this restaurant. 27 28 29 30 31 32 33 trynew pricrise prefer changed firstcho oneofcho regular You will try the new food or drinks that are recommended by this restaurant. You will continue to dine at this restaurant even if the price or service charge is increased somewhat. You have strong preference on this restaurant. You will keep dining at this restaurant; regardless of everything being changed somewhat. This restaurant is the first choice in your mind when you consider having dinner outside. Assume that you have only three choices when you are in need of having dinner, this restaurant must be one of them. You have regularly dined at this restaurant for a long period of time.
Row 23 to Row 27 represents variables that make up Behavioral Loyalty. Row 28 to Row 30 represents variables that make up for Attitudinal Loyalty. Row 31 to Row 33 makes up for Cognition Loyalty. The average sum of all rows creates a measurement for Loyalty. Lets say we wish to create a variable named Behavioral Loyalty. We know it is the average sum of rows 23 to rows 27. Click on Transform, Compute Variable. A Compute Variable dialogue box will appear.
SPSS ECDR 26.10.07
26
Target Variable refers to the new variable name that we wish to create; in this case lets name it behloy.
SPSS ECDR 26.10.07
27
Numeric Expression refers to the mathematical formula that we intend to use to create this new Target Variable. In this case it is the average of the sum of rows 23 to 27. The formula then is (highprob + recomend + sayptive + feedback + trynew) / 5. This is placed in the numeric expression by typing in ( followed by clicking on the appropriate variable and bringing it to the Numeric Expression (click on the arrow). Do this for all the variables required and then place the ). Then place the divide sign ( / ) followed by the number to be divided by to obtain the average.
SPSS ECDR 26.10.07
28
Click OK. An output will appear. You may consider saving this output for future use.
SPSS ECDR 26.10.07
29
Open the data file and look in Variable View. You will find a new row with the name behloy. There is no label and no values. You must input this. For label, I suggest Behavioral Loyalty and for values, you can just copy from the original loyalty dataset and paste. Copy by clicking on the right side of the mouse when placed on the original values and then click on Copy on the left side of the mouse. Then click on the new values box, click the right side of the mouse to depict the dialogue box and click on paste on the left side of the mouse.
SPSS ECDR 26.10.07
30
Open the Data View and have a look at the behloy data column. You will notice that it is no longer a single number but one with two decimal points. This is not correct for Likert scale, so it has to be changed. You may just change it by clicking on the Decimals column in the Variable view and reducing it to 0 decimals. However, I dont prefer this as when you run a frequency SPSS will still show the different decimal points.
SPSS ECDR 26.10.07
31
I prefer to open a Microsoft Excel file. Copy all the variables from SPSS and paste it in the Excel file. Highlight all the numbers in the Excel file. Then click on Format, Cells and a Format Cells dialogue box will appear. Select Number and 0 decimal places. Click OK. All the numbers would change to a single decimal. Copy this and paste it back onto SPSS.
SPSS ECDR 26.10.07
32
Select all the data in SPSS. Copy.
Paste the data in Microsoft Excel.
SPSS ECDR 26.10.07
33
Select Format, Cells. This is the Format Cells dialogue box. Change the decimals to 0 and click OK.
SPSS ECDR 26.10.07
34
Copy the data and paste in back in SPSS. Assignment Now try by yourself to compute into a different variable for the following situation. 1. Use the dataset Example behloy. Compute the various loyalty variables into Attitude and Cognition Loyalty. 2. Use the dataset Example behloy. Compute the various loyalty variables into Overall Loyalty. See answer here in Example Loyalty.
SPSS ECDR 26.10.07
35
Syntax SPSS is run on a program language that most of us will not even use or be familiar with, Nevertheless, by knowing some simple tricks of the trade, it will make life easier especially when running repetitive analysis. Syntax in SPSS is the program language. I do not recommend that you learn it, but if you wish to do so you may look in the Help topics in SPSS or in its manuals. When you are running syntax, you can find out what are the commands, subcommands, and keywords by pressing on F1. For me, and for most researchers, it would be sufficient enough that you know how to create the command language and how to run it again and again for your task. By now you will realize that I have used most of the commands found in the menu and dialogue boxes. This is because it is easy to use and easier to understand. However, if you need to repeat your analysis, you can save the command language in a Syntax file so that you can run an analysis at a later date or to repeat various analyses. A syntax file is just a file that carries the SPSS language commands. You can type or paste syntax into a syntax window that is already open. You can open a new syntax window by choosing: File, New, Syntax. To save a syntax file, from the menus choose: File, Save. To open a saved syntax file, from the menus choose: File, Open, Syntax. Select a syntax file that you want from the dialogue box. If no syntax files are displayed, make sure Syntax (*.sps) is selected in the Files of type drop-down list. Click Open. How to get the Commands As discussed earlier, the normal ways are by reading the manuals and Help section. I suggest some simpler ways. Whenever you run an analysis, you will notice that there is a Paste button. When you click on the paste button, a syntax file will open with the syntax for the analysis that you intended to do. Open the file Example Loyalty. Choose Analyze, Descriptive Statistics, Descriptive. The Descriptives dialogue box will appear. Choose the variables behloy, attloy, cogloy, and allloy that were created earlier. If you click on OK, you will get an Output table. Instead click on Paste. You will see a Syntax window appear with the commands for the analysis that you wanted to do.
SPSS ECDR 26.10.07
36
You can save the syntax file, as Example Syntax.
Another way to obtain syntax commands is by running the analysis. You will obtain an output. If you notice that at the top section of the output is the very same syntax command as what you have saved earlier. Create a new syntax file or open an existing syntax file. Copy the syntax command in the output file and paste it in the syntax file. Assignment Now try by yourself to create your own syntax file.
SPSS ECDR 26.10.07
37
Output We have been discussing quite a number of matters while looking at the Output file, yet without discussing this rather important file. As you may have noticed, every time that you do an analysis or any action in SPSS, an output file will appear. You can close it or leave it on, depending on your personal taste and need. I would normally close it as I prefer to have the new syntax commands and without the clutter of past work. However, sometimes the past work in itself is essential. Therefore the choice is yours. In the case of an analysis, you will obtain an output. See example of an output.
You will notice that the output file is divided into two sections. One is more of Headings and the other is the exact output itself. There will be the SPSS commands syntax, and the various tables relevant to the analysis carried out. From the output, you can copy whatever data that is relevant to your study and paste it onto other programs such as MSWord. This is what most students do. Please dont do this, as it indicates a lack of analysis on your part. This is how students normally present such findings.
SPSS ECDR 26.10.07
38
Gender Frequenc y Valid Male 105 Female 144 Total 249 Percent 42.2 57.8 100.0 Race Frequenc y Valid Malay 57 Chinese 148 Iban 15 Others 29 Total 249 Percent 22.9 59.4 6.0 11.6 100.0 Age Frequenc y Valid 15-24 173 25-34 63 35-44 13 Total 249 Percent 69.5 25.3 5.2 100.0 Valid Percent 69.5 25.3 5.2 100.0 Cumulative Percent 69.5 94.8 100.0 Valid Percent 22.9 59.4 6.0 11.6 100.0 Cumulative Percent 22.9 82.3 88.4 100.0 Valid Percent 42.2 57.8 100.0 Cumulative Percent 42.2 100.0
Education Level Frequenc y Valid SPM 60 STPM 33 Matriculation 6 Diploma 29 Undergraduat 34 e Degree 83 Master 4 Total 249 Percent 24.1 13.3 2.4 11.6 13.7 33.3 1.6 100.0 Valid Percent 24.1 13.3 2.4 11.6 13.7 33.3 1.6 100.0 Cumulative Percent 24.1 37.3 39.8 51.4 65.1 98.4 100.0
SPSS ECDR 26.10.07
39
Gender
150
100
Frequency
50 0 Male Female
Gender
Age
200
150
Frequency
100
50
0 15-24 25-34 35-44
Age
SPSS ECDR 26.10.07
40
Race
150
100
Frequency
50 0 Malay Chinese Iban Others
Race
Education Level
100
80
Frequency
60
40
20
0
SPM STPM Matriculation Diploma Undergraduate Degree Master
Education Level
Again, please dont do this. This is common in most students presentation of SPSS findings from an output. A direct cut and paste of the output file. Plus the graphs and the data set are redundant. Students do this even when the output is for a regression or a factor analysis. Please note, and we will discuss this, that there are norms of presentation for various types of analysis.
SPSS ECDR 26.10.07
41
In this case, a better mode of presentation of the can be done by cut and paste, but then to remodel the various tables into an acceptable Table for presentation, such as follows: Table 1: Respondents Profile Variable Gender Male Female Race Malay Chinese Iban Others Age 15-24 25-34 35-44 Education SPM Level STPM Matriculation Diploma Undergraduate Degree Master
Frequency 105 144 57 148 15 29 173 63 13 60 33 6 29 34 83 4
Percent 42.2 57.8 22.9 59.4 6.0 11.6 69.5 25.3 5.2 24.1 13.3 2.4 11.6 13.7 33.3 1.6
Aside from cut and paste, one can also export what was found in the output file to MS Word. This is done by right clicking the mouse and selecting Export.
SPSS ECDR 26.10.07
42
In the Export format box, select Word/RTF file (*.doc). Then in Export File, click on Browse and select where you wish to save the exported document to.
Click OK.
SPSS ECDR 26.10.07
43
The document should appear as OUTPUT.DOC.
It is now a MS Word document and you can create tables from the file instead of cut and paste over and over again. Assignment 1. Now try by yourself to run the above frequency. 2. Then try to run the Export function.
SPSS ECDR 26.10.07
44
Some Assumptions Normally Distributed Nearly all of these analyses that are discussed here require that the data be normally distributed. You can check this in a Q-Q Plot. Click on Analyze, Descriptives, Q-Q Plots.
Select the variable that you require, in this case apology and age. Note that your Test Distribution is Normal.
SPSS ECDR 26.10.07
45
Click on OK. The relevant output should appear. See Output QQ Plot.
Check on the Q-Q Plot that the data is normally distributed by noting if the dots run on the line. These two look acceptable. Variances are Equal You can check whether the variances of all variables used are equal by noting the Levene's Test for Equality of Variances. This will normally appear whenever you run analysis that requires it. If the Levene test is significant (the value in Sig. is less than 0.05) then this indicates that the variance of the two samples are significantly different. If the Levene test is not significant (the value in Sig. is more than 0.05) then this indicates that the variance of the two samples are approximately equal.
SPSS ECDR 26.10.07
46
How to Analyze: Frequency Open the file Example Loyalty. Lets say you want to know the frequency of your respondents gender. Select Analyze, Descriptive Statistics, Frequencies. The Frequencies dialogue box will appear.
Select Gender and transfer it to the Variables box.
Click on OK. The Output file will appear. See earlier examples of an output file.
SPSS ECDR 26.10.07
47
Note There are three buttons at the bottom of the dialogue box, Statistics, Charts, and Format. When you click on Statistics, a Statistics dialogue box will appear. It has four mini boxes, Percentiles Values, Central Tendencies, Dispersion, and Distribution.
Percentiles Values is used in cases where you want to know groupings by quartiles or cut off points, such as in the event that you want to create a new grouping as discussed in Recode. This will allow you to see what are the grouping like. When you click on Charts, it allows you to design your own chart. SPSS provides you with a number of choices. Once you obtained the Output, you may then copy it and use it in other programs such as MS Word.
SPSS ECDR 26.10.07
48
How to Present the Findings Refer to Output discussion. Assignment 1. Open the file Example Loyalty. Now try by yourself to run a frequency from the above dataset for education level. 2. Determine what are the Quartiles for this dataset. Create a pie chart for the Quartiles. 3. Determine what is the cut point for three groups for education level. Create a bar chart for the three groups.
SPSS ECDR 26.10.07
49
How to Analyze: Crosstabulation Open the file Example Loyalty. Lets say you want to know the relationship between gender and its relationship with the use of apology in service recovery. Select Analyze, Descriptive Statistics, Crosstabs. The Crosstabs dialogue box will appear.
You will notice that there are two boxes, one for rows and the other for columns. This is how you data will be shown, so careful planning has to be done in order to present your data nicely. In this case I would rather have Gender in the column and the use of apology in service recovery in the rows. Why? This is because it would make it easier to see as well as to present the data later on.
SPSS ECDR 26.10.07
50
You will also notice that there are three buttons at the bottom of the dialogue box, Statistics, Cells, and Format. In Statistics, the normal thing I would do is to click on Chi-square and in most cases even this is ignored.
In Cells, the main issue is whether to click on Percentages by row, column or both. This will depend on what you intend to find out.
SPSS ECDR 26.10.07
51
If click on row, the output will be as such.
If click on column, the output will be as such.
SPSS ECDR 26.10.07
52
As you can see, therefore the presentation of the data and its interpretation will differ. The researcher in line with his/her research question and objectives must make a decision. In this case, I choose by column. As for Format, I normally just let it be.
Click OK. The Output file will appear. See Crosstab Output.
SPSS ECDR 26.10.07
53
How to Present the Findings Crosstab can be presented in a variety of ways, of which the easiest is to make it into a Table as shown here for both frequency and percentage, or can easily be shown for either one by itself.
SPSS ECDR 26.10.07
54
Table 1: Crosstabulation of Possibility of Apology by Gender Apology Gender No chance Very slight possibility Slight possibility Some possibility Fair possibility Fairly good possibility Good possibility Probable Very probable Almost sure Certain, practically certain Assignment 1. Open the file Example Loyalty. Now try by yourself to run a cross tabulation from the above dataset for education level and apology. 2. Prepare a Table to depict your findings. Frequency % Frequency % Frequency % Frequency % Frequency % Frequency % Frequency % Frequency % Frequency % Frequency % Frequency % Male 0 .0% 0 .0% 0 .0% 4 3.8% 4 3.8% 8 7.6% 18 17.1% 11 10.5% 22 21.0% 13 12.4% 25 23.8% Female 1 .7% 1 .7% 5 3.5% 7 4.9% 7 4.9% 19 13.2% 24 16.7% 10 6.9% 25 17.4% 19 13.2% 26 18.1%
SPSS ECDR 26.10.07
55
How to Analyze: Means Open the file Example Loyalty. Lets say you want to know the Means of the various measurements of loyalty that you have used, from the variables to the summations. Select Analyze, Descriptive Statistics, Descriptives. The Descriptives dialogue box will appear.
Select and transfer all the loyalty variables into Variable(s) box.
Click OK. The output will appear. See Means Output.
SPSS ECDR 26.10.07
56
Note There is an Options button. Click on it and it will depict the following.
Normally I am satisfied with this though sometimes there may be a need to test for Distribution. Click Continue.
SPSS ECDR 26.10.07
57
How to Present the Findings Means can be presented in a variety of ways, of which the easiest is to make it into a Table as shown here. Or you can just show the summation loyalty variables. Table 1: Means for Loyalty Variables Variables There is a high probability that you will dine at this restaurant again. You have recommended other people to patronize this restaurant. You will say positive thing to other people about the service provided by this restaurant. You will give positive feedback to this restaurant. You will try the new food or drinks that are recommended by this restaurant. Behavioral Loyalty You will continue to dine at this restaurant even if the price or service charge is increased somewhat. You have strong preference on this restaurant. You will keep dining at this restaurant; regardless of everything being changed somewhat. Attitude Loyalty This restaurant is the first choice in your mind when you consider having dinner outside. Assume that you have only three choices when you are in need of having dinner, this restaurant must be one of them. You have regularly dined at this restaurant for a long period of time. Cognition Loyalty Overall Loyalty Or: Table 1: Means for Loyalty Variables Variables Behavioral Loyalty Attitude Loyalty Cognition Loyalty Overall Loyalty
Mean 6.43 5.66 5.82 5.43 6.14 5.88 4.99 5.51 5.06 5.20 4.88 5.50 5.18 5.20 5.50
Std. Dev. 2.31 2.26 2.26 2.30 2.44 1.97 2.23 2.12 2.18 1.91 2.31 2.49 2.51 2.15 1.78
Mean 5.88 5.20 5.20 5.50
Std. Dev. 1.97 1.91 2.15 1.78
SPSS ECDR 26.10.07
58
Assignment 1. Open the file Example Loyalty. Now try by yourself to run a Means from the above dataset for all the Service Recovery variables. 2. Prepare a Table to depict your findings.
SPSS ECDR 26.10.07
59
How to Analyze: t-test Open the file Example Loyalty. T-test is normally used when there are only two values in a variable. An Anova is used when there are three or more values in a variable. SPSS offers 3 types of t-test: 1. One Sample T-Test 2. Independent Sample T-Test 3. Paired Samples T-Test One Sample T-Test A One Sample T-Test compares the mean score of a sample to a known value. Lets say you want to know whether in your education variable, that the respondents education level is different from the known population mean. In this case, the mean for education level, let say is 4. Click on Analyze, Compare Means, One Sample T-Test. The following dialogue box will appear.
Click and transfer Education Level variable to the Test Variable Box. Then type in the Test Value, which refers to the known population mean. In this case, lets assume it is 4.
SPSS ECDR 26.10.07
60
Click OK. The output file will appear. See Output One Sample T Test.
The T value is 1.218, with 248 degrees of freedom. The significance value is 0.224. This means that there is no significance difference between the two groups (the significance is more than 0.05).
SPSS ECDR 26.10.07
61
How to Present the Findings T-Test findings can be presented as a sentence or a Table. If in a sentence, I would say: T-test findings indicate that the Education Level variable (t = -1.218, p = 0.224) is not significantly different from the population mean. Or, it could also be presented in Table form as follows: Table 1: One-Sample Test for Educational Level Variable Test Value = 4 t df Sig. (2-tailed) Education Level -1.218 248 .224 Assignment 1. Open the file Example Loyalty. Now try by yourself to run a One Sample T-Test from the above dataset for all the Service Recovery variables, with a Test Value of 5. 2. Prepare a Table to depict your findings. Independent Sample T-Test An Independent Samples T Test compares the mean scores of two groups on a given variable. Lets say you want to know whether the means for apology to be used as a service recovery is similar or different between men and women. Click on Analyze, Compare Means, Independent Samples T Test. The dialogue box will appear.
SPSS ECDR 26.10.07
62
You will see that there is a Test Variable box and Grouping Variable box. Move the dependent variable, in this case Apology, to the Test Variable box. Move the Independent Variable, in this case Gender, to the Grouping Variable.
When you have done so, you will notice that the Define Groups button pops up. Click on it and define your groups.
SPSS ECDR 26.10.07
63
As you know, we have only two values here, that is 1 and 2. Type it in.
Click on Continue. There is an Options button in the main Independent Samples T Test dialogue box. This is to indicate the Confidence Interval that you wish to use. I normally leave it at 95%.
The Output file will appear. See Output for Independent Sample T-Test.
SPSS ECDR 26.10.07
64
The Output depicts the Means. Here we can see that men score higher than women for the variable apology.
Group Statistics Std. Error Mean .19461 .19950
apology
Gender Male Female
N 105 144
Mean 7.5810 6.9444
Std. Deviation 1.99413 2.39398
The next output that is important is to note the Levenes Test.

Independent Samples Test Levene's Test for Equality of Variances F Sig. Lower Upper apology Equal variances assumed Equal variances not assumed 4.994 .026
SPSS ECDR 26.10.07
65
This is important, as this is part of the assumptions for running this test, that the variances are approximately equal. If the Levene test is significant (the value in Sig. is less than 0.05) then this indicates that the variance of the two samples are significantly different. If the Levene test is not significant (the value in Sig. is more than 0.05) then this indicates that the variance of the two samples are approximately equal. However in this example, the Levene test is significant, indicating that the variance of the two samples are significantly different. The next portion to note is the results of the Independent T-Test.
Independent Samples Test t-test for Equality of Means t Lower 2.220 2.284 df Upper 247 242.594 Sig. (2-tailed) Lower .027 .023 Mean Difference Upper .63651 .63651 Std. Error Difference Lower .28673 .27870 95% Confidence Interval of the Difference Upper .07175 .08753 Lower 1.20126 1.18548
Read the BOTTOM line when the Levene test indicates that the variances of the two samples are significantly different. Read the TOP line when the Levene test indicates that the variances of the two samples are approximately equal. In this case, we read the BOTTOM line. There is a significance difference between the two groups (the significance level is less than 0.05). Therefore this indicates that how men and women see the possibility of apology being used as a service recovery effort is different. How to Present the Findings T-Test findings can be presented as a sentence or a Table. If in a sentence, I would say: How men and women see the possibility of apology being used as a service recovery effort is different (t = 2.284, p = 0.023). Or, it could also be presented in Table form as follows:
SPSS ECDR 26.10.07
66
Table 1: Independent Sample Test for Apology Variable t df Sig. (2-tailed) Apology 2.284 242.594 .023 Assignment 1. Open the file Example Loyalty. Now try by yourself to run a Independent Sample T-Test from the above dataset for all the Service Recovery variables, against gender. 2. Prepare a Table to depict your findings. Paired Samples T-Test The Paired Samples T-Test compares the means of two variables. This TTest measures the difference between the two variables for each case, and then tests to see if the average difference is significantly different from zero. Lets say you want to see if there is any difference between the variable apology and assist as methods of service recovery. Click on Analyze, Compare Means, Paired Samples T Test. The dialogue box will appear.
Click on apology and assist. You will notice that when you click the variables, it will appear in the Current Selections. You can only choose two variables at a time.
SPSS ECDR 26.10.07
67
Then transfer it to the Paired Variables box.
The Options button is similar to the previously discussed button. Click OK and the Output will appear. See Output Paired Samples T-Test.
SPSS ECDR 26.10.07
68
The first output depicts the Means, which in this case indicates that apology is seen as more probable response for service recovery.
Paired Samples Statistics Std. Error Mean .14271 .13718
Pair 1
apology assist
Mean 7.2129 6.4699
N 249 249
Std. Deviation 2.25199 2.16462
The second output depicts correlation between the two variables. Apparently there is a high correlation between apology and assistance.
Paired Samples Correlations N Pair 1 apology & assist 249 Correlation .601 Sig. .000
The last part that we need to see is the difference. In this case there is a clear difference. If the significance value is less than .05, there is a significant difference. If the significance value is greater than. 05, there is no significant difference.
SPSS ECDR 26.10.07
69
Paired Samples Test Pair 1 apology - assist t 5.942 df 248 Sig. (2-tailed) .000
How to Present the Findings T-Test findings can be presented as a sentence or a Table. If in a sentence, I would say: Apology and Assistance correlates well (Correlation = 0.601, p = 0.000) yet the paired samples t-test indicates that there is significant difference between the two variables (t = 5.942, p = 0.000). Or, it could also be presented in Table form as follows: Table 1: Paired Sample Test for Apology-Assistance Variable t df Sig. (2-tailed) Apology-Assistance 5.942 248 .000 Assignment 1. Open the file Example Loyalty. Now try by yourself to run a Paired Sample T-Test from the above dataset for all the Service Recovery variables. 2. Prepare a Table to depict your findings.
SPSS ECDR 26.10.07
70
How to Analyze: Correlation Correlation is used when you want to know how two variables are associated with each other and how strong that association is. Correlation can also tell you the direction of the association. Pearson R Correlation is used when the data that you are using is normally distributed. When the data that you are using is not normally distributed, then you use Spearman Rho. SPSS offers three types of correlations: 1. Bivariate 2. Partial 3. Distance The normally used correlation is Bivariate, which will be discussed here. Open the file Example Loyalty. Lets say you want to know the correlation between apology and assist variables. These are two variables that are used in service recovery. Select Analyze, Correlations, Bivariate. A dialogue box will appear.
Remember, if your data is normally distributed then use Pearson R Correlation and if it is not normally distributed, then you use Spearman Rho. This can be seen under the Table Correlation Coefficients. You can also choose whether to use Two-tailed or One-tailed significance test.
SPSS ECDR 26.10.07
71
Select apology and assist and transfer it to the Variables box. The Options button here allows you to choose if you require added statistics. Normally I dont use it as the statistics would have been calculated earlier.
Click OK and the output will appear. See Output Correlation.
The output provides us with the correlation coefficient, significance and number of cases (N). The correlation coefficient is shown as a number between +1 and -1. The strength of the correlation can be seen as when it gets nearer to either +1 or 1. The correlation coefficient also provides the
SPSS ECDR 26.10.07
72
direction of the relationship, either positive (one increase, so does the other) or negative (one increase the other decrease). In this case, the correlation coefficient is 0.601, which is quite acceptable and positive. So in this case, as the probability of apology increase, there is also an increase in the probability of assistance. How to Present the Findings Correlation findings can be presented as a sentence or a Table. If in a sentence, I would say: Apology and Assistance correlates well (Correlation = 0.601, p = 0.000). Or, it could also be presented in Table form as follows: Table 1: Correlations Variable Assist
* Correlation is significant at the 0.01 level (2-tailed).
Apology .601(*)
Assignment 1. Open the file Example Loyalty. Now try by yourself to run Correlations from the above dataset for all the Service Recovery variables. 2. Prepare a Table to depict your findings.
SPSS ECDR 26.10.07
73
How to Analyze: One-Way ANOVA The One-Way ANOVA compares the mean of one or more groups based on one independent variable. Open the file Example Loyalty. Lets say you want to know the One-Way Anova between the variable apology and age level. In simple terms, you want to know whether there is any difference in how the various age groups look at the variable apology. Click on Analyze, Compare Means, One-Way Anova. The dialogue box will appear.
Click on apology and move it to the Dependent List box. Click on Age and move it to the Factor box.
SPSS ECDR 26.10.07
74
There are three buttons at the bottom, Contrast, Post Hoc, and Options. You will need to look at Options and Post Hoc. Click on Options and click on the boxes for Descriptives and Homogeneity of Variance. Click continue.
Click on Post Hoc and when it opens, it will show you various post hoc tests. Normally I would use either Tukey or Bonferroni. If there are equal numbers of cases in each group, choose Tukey. If there are not equal numbers of cases in each group, choose Bonferroni. You can also click on more than one, but this is just a post hoc test so there is no need to do so. In this case I choose Bonferroni. You will also note that the significance level is maintained at 95% level of confidence or shown here as .05. Click continue.
Click OK and the output should appear.
SPSS ECDR 26.10.07
75
Now lets look back at the One-Way Anova analysis that we had run. The output would have appeared as such. Refer to Output One Way Anova.
We can see the Means by the age group for apology. The means look similar, with those in the age bracket of 25 34 scoring highest.
Descriptives apology 95% Confidence Interval for Mean Lower Upper Bound Bound 6.7727 7.4701 6.9807 5.7665 6.9318 8.0352 8.2335 7.4939
N 15-24 25-34 35-44 Total 173 63 13 249
Mean 7.1214 7.5079 7.0000 7.2129
Std. Deviation 2.32335 2.09356 2.04124 2.25199
Std. Error .17664 .26376 .56614 .14271
Minimum .00 2.00 4.00 .00
Maximum 10.00 10.00 10.00 10.00
Then the Levene test, as have been discussed earlier. If the Levene test is significant (the value in Sig. is less than 0.05) then this indicates that the variances of the samples are significantly different. If the Levene test is not
SPSS ECDR 26.10.07
76
significant (the value in Sig. is more than 0.05) then this indicates that the variances of the samples are approximately equal.
Test of Homogeneity of Variances apology Levene Statistic 1.280 df1 2 df2 246 Sig. .280
We note that the variances are equally distributed. Then we can see the Anova findings. The findings here indicate that the significance value is 0.478, which is more than 0.05. This indicates that there is no significant difference between the groups.
ANOVA apology Sum of Squares 7.522 1250.197 1257.719 df 2 246 248 Mean Square 3.761 5.082 F .740 Sig. .478
Between Groups Within Groups Total
Lastly the Bonferroni test is shown. SPSS will highlight with an asterisk (*) if there is any significant differences. In this case there is none.
Multiple Comparisons Dependent Variable: apology Bonferroni Mean Difference (I-J) -.38655 .12139 .38655 .50794 -.12139 -.50794 95% Confidence Interval Std. Error .33173 .64831 .33173 .68673 .64831 .68673 Sig. .735 1.000 .735 1.000 1.000 1.000 Upper Bound -1.1862 -1.4413 -.4131 -1.1474 -1.6841 -2.1633 Lower Bound .4131 1.6841 1.1862 2.1633 1.4413 1.1474
(I) Age 15-24 25-34 35-44
(J) Age 25-34 35-44 15-24 35-44 15-24 25-34
How to Present the Findings One-Way Anova findings can be presented as a sentence or a Table. If in a sentence, I would say:
SPSS ECDR 26.10.07
77
There was no significant difference in how the age group saw apology (F = 0.740, p = 0.478). Or, it could also be presented in Table form as follows: Table 1: One Way Anova by Age Scale Variable F Sig. Apology .740 .478 Assignment 1. Open the file Example Loyalty. Now try by yourself to run One-Way Anova from the above dataset for all the Service Recovery variables by Age scale. 2. Prepare a Table to depict your findings.
SPSS ECDR 26.10.07
78
How to Analyze: Manova MANOVA or Multivariate analysis of variance is simply an Anova that runs on several dependent variables. MANOVA is used to assess whether an overall difference exist between groups, and the differences among the combinations that is presented by the researcher. Normally, after a Manova is run, then only does the researcher carry out Univariate GLM analysis. As an example, in our One-way Anova, we ran the analysis only on age and apology. What if we wanted to test all the various methods available for service recovery and add on education level as well? Running a MANOVA can do this. Open the file Example Loyalty. Select Analyze, click on General Linear Model, Multivariate. The dialogue box will appear.
Click on all the service recovery variables and transfer them to Dependent Variables. Transfer age and education level to Fixed Factors.
SPSS ECDR 26.10.07
79
The buttons on the right side are for further consideration by the researcher. For me, I would leave the Model, Contrast and Save buttons. As for Plots, it depend on the researcher as to whether do you require graphs. If you do, select Plots and the following box will appear.
Select the plots that you require, lets say by age, edu, and its interaction. This can be done by clicking on age and placing it on the Horizontal Axis. Then click on Add.
SPSS ECDR 26.10.07
80
To have both age by edu, click age and place it on the Horizontal Axis and click on edu and place it on Separate Lines. Click Add. Click on Continue. As for Post Hoc, when you click on it you will be provided with a variety of options.
Select and transfer the relevant factors to the Post Hoc Tests for: box. Choose and click on the relevant post hoc test that you wish to carry out. In this case, Bonferroni.
SPSS ECDR 26.10.07
81
For the Options button, when you click on it, he following will display.
If you wish to display the Means, select all and transfer to the Display Means for: box. I would normally click on descriptive Statistics, Spread vs. level plots, and Residual plots. Then click on Continue. You will return to the main Multivariate dialogue box. Click OK. The output will appear. Refer to Output Manova.
SPSS ECDR 26.10.07
82
You will find displayed initially the Between Subject Factors and the Descriptive Statistics. The next box will depict the Multivariate Test, which is of greatest importance to you.
SPSS ECDR 26.10.07
83
c Multivariate Tests
Effect Intercept
age
edu
age * edu
Pillai's Trace Wilks' Lambda Hotelling's Trace Roy's Largest Root Pillai's Trace Wilks' Lambda Hotelling's Trace Roy's Largest Root Pillai's Trace Wilks' Lambda Hotelling's Trace Roy's Largest Root Pillai's Trace Wilks' Lambda Hotelling's Trace Roy's Largest Root
Value .752 .248 3.031 3.031 .194 .814 .220 .163 .400 .658 .439 .148 .346 .695 .383 .159
F Hypothesis df 36.541a 18.000 36.541a 18.000 a 36.541 18.000 36.541a 18.000 1.300 36.000 1.309a 36.000 1.318 36.000 b 1.978 18.000 .880 108.000 .877 108.000 .875 108.000 b 1.819 18.000 .754 108.000 .758 108.000 .763 108.000 1.960b 18.000
Error df 217.000 217.000 217.000 217.000 436.000 434.000 432.000 218.000 1332.000 1250.817 1292.000 222.000 1332.000 1250.817 1292.000 222.000
Sig. .000 .000 .000 .000 .120 .114 .108 .012 .802 .807 .812 .024 .970 .967 .964 .013
a. Exact statistic b. The statistic is an upper bound on F that yields a lower bound on the significance level. c. Design: Intercept+age+edu+age * edu
What I would normally look at is the Pillais Trace as it has been noted as more robust and appropriate. Nevertheless, many also use Wilks Lambda. Note whether it is significant or not for the overall interaction and by each variable. In this case, all are not significant indicating that there are no differences in how service recovery variables is seen by age, education, and its interactions. Then you may look at the Test Between Subjects, which will indicate significance at individual levels. Running a Univariate analysis can corroborate this. This is followed by a table on the means and then followed by the post hoc test. In this case, you may note the significant variables highlighted by SPSS in the Bonferroni test. This is finally followed by the various plots that depict the interactions. How to Present the Findings MANOVA findings are usually presented as a sentence. A MANOVA was carried out to determine if there was any interaction effect between age and education level with the various service recovery methods. Pillais trace for the variable Age (P = 0.194, F = 1.300, Sig = 0.120),
SPSS ECDR 26.10.07
84
Education Level (P = 0.400, F = 0.880, Sig = 0.802) and its interaction (P = 0.346, F = .7540, Sig = 0.970) was not significant for service recovery variables. It could also be presented in a Table. Table 1: Multivariate Tests for Service recovery Variables by Age and Education Level Effect Value F Sig. Age Pillai's Trace .194 1.300 .120 Wilks' Lambda .814 1.309(a) .114 Hotelling's Trace .220 1.318 .108 Roy's Largest Root .163 1.978(b) .012 Education Level Pillai's Trace .400 .880 .802 Wilks' Lambda .658 .877 .807 Hotelling's Trace .439 .875 .812 Roy's Largest Root .148 1.819(b) .024 Age *Education Level Pillai's Trace .346 .754 .970 Wilks' Lambda .695 .758 .967 Hotelling's Trace .383 .763 .964 Roy's Largest Root .159 1.960(b) .013
a Exact statistic b The statistic is an upper bound on F that yields a lower bound on the significance level. c Design: Intercept+age+edu+age * edu
Assignment 1. Open the file Example Loyalty. Now try by yourself to run a MANOVA from the above dataset for all the Loyalty variables by Age, Education, and Gender. 2. Prepare a Table to depict your findings.
SPSS ECDR 26.10.07
85
How to Analyze: Regression General linear model can be used to analyze designs with categorical predictor (e.g., name, rank) and continuous predictor (e.g., interval scales, ratio scales). The purpose of running a simple regression is for hypotheses testing about whether the predictor variables are related to the criterion variable. A simple regression equation is normally written as such: Y=b0 + b1X Basically, there are three types of major regression models, which are known as standard regression, stepwise regression, and hierarchical regression. Standard Regression Simple linear regression is used to predict the impact of independent variable on the values of a continuous or interval-scaled dependent variable. It depicts the strength of the predictor variables in order to make a better conclusion about others. Open the file Example Regression.sav Lets say if you want to know the relationship between the predictors (affect, loyalty, respect, contribute) and soft tactic (management influence methods). In other words, you want to find out which of the independent variables is the best predictor of the use of soft tactic. Click on Analyze, Regression, Linear. The dialogue box will appear as follows.
SPSS ECDR 26.10.07
86
Click on contribute, respect, affect and loyalty and move it to the Independent(s) box. Click on soft_T and move it to the Dependent box. The following dialogue box will appear.
SPSS ECDR 26.10.07
87
Then click on the Statistics button, and you will come to the Linear Regression Statistics box. Select casewise diagnostics in the Residuals box. The purpose of using the casewise diagnostics is to make sure that all observations outside the range of 3 standard deviations were considered as outliers and should be excluded for further analyses. Click continue.
SPSS ECDR 26.10.07
88
If you plan to display the model in a scatterplot, standard practice is to make the independent variable as X-axis, and dependent variable as Y-axis. Click on Plots which is located at the bottom of Linear Regression box, and you will open the following box. Click on ZPRED at the left hand side and move it to the X-axis, and click on ZRESID and move it to the Y-axis. Click on the normal probability plot in the Standardized Residual Plots and click continue.
You will return to the main Linear Regression box. Click on OK and you will obtain your Output. Refer to output std regressions.spo
SPSS ECDR 26.10.07
89
The first Table that you will see is Variables Entered/Removed(b) which shows the variables that have been used in this design and the method used.
Variables Entered/Removed(b)
Model 1
Variables Entered affect, respect, loyalty, contribute(a)
Variables Removed .
Method Enter
a All requested variables entered. b Dependent Variable: soft_T
The next Table shows the Model Summary, which indicates R. The important thing to note here is R Square, which measures the percentage of explanatory power of the independents used. Therefore, in this example, it is evident that the variables explain 10% of the variance in soft tactic.
SPSS ECDR 26.10.07
90
Model Summary(b)
R Square Adjusted R Square .323(a) .104 .081 a Predictors: (Constant), contribute, respect, loyalty, affect b Dependent Variable: soft_T
Model 1
Std. Error of the Estimate 1.04953
The next table shows that the model is significant (p< .00) with F value equals to 4.462.
ANOVA(b) Sum of Squares 19.660 168.530
Model 1
df 4 153
Regression Residual Total
Mean Square 4.915 1.102
F 4.462
Sig. .002(a)
188.190 157 a Predictors: (Constant), contribute, respect, loyalty, affect b Dependent Variable: soft_T
The unstandardized coefficient of an independent variable is known as , which measures the strength of the predictors and the criterion variables. We use the unstandardized coefficients in view of the fact that they can be measured on different scales. For example, we cannot compare the value for gender with the value for soft tactic.
Coefficients(a) Model 1 (Constant) affect loyalty respect contribute a Dependent Variable: soft_T Unstandardized Coefficients B 2.366 .338 .123 .292 -.444 Std. Error .569 .180 .131 .128 .171 Standardized Coefficients Beta .256 .111 .247 -.353 t B 4.158 1.879 .939 2.288 -2.595 Sig. Std. Error .000 .062 .349 .023 .010
The table above has shown that there is significant relationship between respect and contribute with soft tactic at p<.05. The following is the Normal P-P Plot of Regression Standardized Residual. The rationale for showing the graph is to confirm the normality and linearity of the data.
SPSS ECDR 26.10.07
91
Normal P-P Plot of Regression Standardized Residual
Dependent Variable: soft_T
1.0
Expected Cum Prob
0.8
0.6
0.4
0.2
0.0 0.0 0.2 0.4 0.6 0.8 1.0
Observed Cum Prob
How to Present the Findings Standard Regression findings can be presented as a sentence. If in a sentence, I would say: There was a significant relationship between respect and soft tactic ( = .292, p < .05). Assignment 1. Open the file Example Regression.sav. Now try by yourself to run standard regression from the above dataset for all the independent variables (affect, loyalty, respect, and contribute) on rational_T.
SPSS ECDR 26.10.07
92
How to Analyze: Exploratory Factor Analysis Exploratory Factor Analysis (EFA) is done when one wants to know if there are any simple patterns among the variables studies. This is especially true when there are a lot of observed variables, from a minimum of 3 to as many as possible. Open the file Example Loyalty. Lets say we want to see if the data that we obtained on loyalty, really reflects the theory that places it into three factors, that is Behavior, Attitude, and Cognition. Click on Analyze, Data Reduction, Factor. The following dialogue box will appear.
Select and transfer all the relevant loyalty items into the Variables box.
SPSS ECDR 26.10.07
93
You will notice a few buttons at the bottom of the screen, Descriptives, Extraction, Rotation, Scores, and Options. Lets look at them now. Click on Descriptives. I would normally then click on KMO and Bartletts Test of sphericity. You may also click on Anti-image. Click on Continue.
Click on Extraction. I would normally click on Scree Plot and maintain the others. Do note that there are many options of Method of factor analysis extraction that is offered by SPSS, but the usual is by Principal Components. Factor analysis also allows you to extract as many factors as possible based on Eigenvalues or for you to pre-determine the number of factors that you require. In this case, lets leave it at Eigenvalues over 1. As for Iterations, the set limit here is 25. You can place a higher number if you want to. Click on Continue.
SPSS ECDR 26.10.07
94
Click on Rotation. I normally use Varimax. Nevertheless, as you may notice there are many other methods that can be used. Click on Continue.
Ignore Scores. Click on Options. For the Coefficient Display Format, I would normally click on both boxes and I choose the suppress absolute values less than 0.4999. Nevertheless there are those that argue than anything less than 0.3999 could also be suppressed. Click on Continue.
SPSS ECDR 26.10.07
95
Click on OK. The output file should appear. See Output factor analysis.
The first thing that you will note is the KMO and Bartletts test. There are arguments as to what level of KMO is acceptable. For me, the higher the better of course, but the absolute minimum in 0.60. Bartletts Test is used to test equality of variance.
KMO and Bartlett's Test
SPSS ECDR 26.10.07
96
Kaiser-Meyer-Olkin Measure of Sampling Adequacy. Bartlett's Test of Sphericity Approx. Chi-Square df Sig.
.908 1812.110 55 .000
The next thing you will notice is the Anti-Image. Look at the Correlation box. Look for the Measure of Sampling Adequacy. Check if all elements on the diagonal of the matrix is greater than 0.5 to indicate that the sample is adequate. The next box is on Communalities. Then look at the Total Variance Explained box. This will tell you how many Components or Factors there are, and how many for you to keep. Normally one will take any variable that has an Eigenvalue above 1. Sometimes there may be a lot of components with an Eigenvalue over 1, so keep those which contribute to 70% to 80% of the variance. The other way is by examining the Scree Plot, which will be discussed later. In this case, there are only 2 components.
Total Variance Explained Component 1 2 3 4 5 6 7 8 9 10 11 Total 6.406 1.155 .735 .485 .454 .426 .346 .324 .297 .192 Initial Eigenvalues % of Variance 58.240 10.501 6.679 4.410 4.128 3.874 3.141 2.942 2.704 1.748 Cumulative % 58.240 68.741 75.420 79.829 83.958 87.832 90.973 93.915 96.620 98.368 100.000
.180 1.632 Extraction Method: Principal Component Analysis.
Lets look at the Scree Plot. The rule is to keep all factors before the elbow. In this case the first two values are acceptable.
SPSS ECDR 26.10.07
97
Scree Plot
Eigenvalue
0 1 2 3 4 5 6 7 8 9 10 11
Component Number
Then look at the Rotated Component Matrix and try to name the components based on the variables shown.
Rotated Component Matrix(a) Component 1 This restaurant is the first choice in your mind when you consider having dinner outside. Assume that you have only three choices when you are in need of having dinner, this restaurant must be one of them. You have regularly dined at this restaurant for a long period of time. You will keep dining at this restaurant, regardless of everything being changed somewhat. You have strong preference on this restaurant. You will continue to dine at this restaurant even if the price or service charge is increased somewhat. You have recommended other people to patronize this restaurant. There is a high probability that you will dine at this restaurant again. You will say positive thing to other people about the service provided by this restaurant. You will give positive feedback to this restaurant. You will try the new food or drinks that are recommended by this restaurant. Extraction Method: Principal Component Analysis. Rotation Method: Varimax with Kaiser Normalization. a Rotation converged in 3 iterations. .834 .806 .784 .709 .662 .640 .847 .828 .772 .764 .639 2
SPSS ECDR 26.10.07
98
Note: You will need to do a reliability test for your findings. This can be done by clicking on Analyze, Scale, Reliability Analysis. The dialogue box will appear.
Select all the variables in the first component of your factor analysis and transfer it to the box named Items. Make sure the Model selected is Alpha. Click on OK. The following output will appear. See Output Reliability.
SPSS ECDR 26.10.07
99
The first box will detail the case summaries. The second box will detail the Cronbach Alphas figure for the number of variables that you have keyed in. In this case, the reliability is quite high at 0.897. How to Present the Findings Factor Analysis is usually presented in a Table, with some further explanations. It normally also includes Reliability of the various components. A factor analysis was carried out on the various variables that describe loyalty. The initial Kaiser-Mayer-Olkin (KMO) was 0.908, which indicate that the analysis is meritorious. Bartletts test is significant (Chi Square = 1812.11, p<0.001). Two components were extracted with a cumulative variance of 68.74%. Please refer to Table 1.
SPSS ECDR 26.10.07
100
Table 1: Rotated Component Matrix for Loyalty
Attitude and Cognition
Variable This restaurant is the first choice in your mind when you consider having dinner outside. Assume that you have only three choices when you are in need of having dinner, this restaurant must be one of them. You have regularly dined at this restaurant for a long period of time. You will keep dining at this restaurant, regardless of everything being changed somewhat. You have strong preference on this restaurant. You will continue to dine at this restaurant even if the price or service charge is increased somewhat. You have recommended other people to patronize this restaurant. There is a high probability that you will dine at this restaurant again. You will say positive thing to other people about the service provided by this restaurant. You will give positive feedback to this restaurant. You will try the new food or drinks that are recommended by this restaurant. Eigenvalue % of Variance Cumulative % of Variance Reliability
.834 .806 .784 .709 .662 .640 .847 .828 .772 .764 .639 6.406 58.240 58.240 1.155 10.501 68.741
0.897 0.890 Extraction Method: Principal Component Analysis. Rotation Method: Varimax with Kaiser Normalization. a Rotation converged in 3 iterations. Assignment 1. Open the file Example Loyalty. Now try by yourself to run a Factor Analysis from the above dataset for all the Loyalty variables but to limit the components to 3. 2. Prepare a Table to depict your findings.
SPSS ECDR 26.10.07
101
Behavior
How to Analyze: Confirmatory Factor Analysis
SPSS ECDR 26.10.07
102
Confirmatory Factor Analysis (CFA) is normally done through the use of another computer program called AMOS. It runs on SPSS data. You can open the file similarly as that of SPSS, except that you will need the appropriate software. AMOS is basically a drawing of models software instead of a key in data software. This means that you must draw the relevant model in the canvas provided, fill it in from the data that you have (normally from SPSS) and then run the analysis. Once you have opened the file, it should appear as follows.
On the left hand are all the relevant short forms for the various tasks that you have to do. In the middle is the analysis output and the right side is your canvas. Lets say we want to run the findings from our earlier Exploratory Factor Analysis (EFA) to obtain a CFA. The first thing to do is to have the EFA data on hand and transfer it to AMOS. Go to Select Data Files and click on it. The following dialogue box will appear.
SPSS ECDR 26.10.07
103
Click on File Name and the following box will appear. Select the database that you wish to use. In this case, select Example Loyalty.
The following should be displayed. Click OK.
SPSS ECDR 26.10.07
104
The data is now recorded and saved for later use. Then go back to the earlier EFA discussion and look at the findings. It is displayed here.
SPSS ECDR 26.10.07
105
Table 1: Rotated Component Matrix for Loyalty
Attitude and Cognition
Variable This restaurant is the first choice in your mind when you consider having dinner outside. Assume that you have only three choices when you are in need of having dinner, this restaurant must be one of them. You have regularly dined at this restaurant for a long period of time. You will keep dining at this restaurant, regardless of everything being changed somewhat. You have strong preference on this restaurant. You will continue to dine at this restaurant even if the price or service charge is increased somewhat. You have recommended other people to patronize this restaurant. There is a high probability that you will dine at this restaurant again. You will say positive thing to other people about the service provided by this restaurant. You will give positive feedback to this restaurant. You will try the new food or drinks that are recommended by this restaurant. Eigenvalue % of Variance Cumulative % of Variance Reliability
.834 .806 .784 .709 .662 .640 .847 .828 .772 .764 .639 6.406 58.240 58.240 1.155 10.501 68.741
0.897 0.890 Extraction Method: Principal Component Analysis. Rotation Method: Varimax From here we can see that there is two components. The first has 6 variables and the second has 5. This means that we must draw on our canvas what reflects this situation. Select Draw a Latent Variable by clicking on it once. Move your cursor to the canvas. Your cursor should appear as the Draw a Latent Variable symbol. Click once. The following should appear.
SPSS ECDR 26.10.07
106
Behavior
Click again. The following should appear. You should remember that the first component had 6 variables and the second component had 5 variables. Continue drawing.
SPSS ECDR 26.10.07
107
You should get this.
SPSS ECDR 26.10.07
108
Then continue on till for the next component.
I would normally save the file at this stage. Click on File, Save As. The dialogue box will appear as below. Save the document in an appropriate file (Create a new folder for it).
SPSS ECDR 26.10.07
109
Now, lets make this drawing look good. I will click on Select all objects. You will notice that the drawings have changed color to blue.
SPSS ECDR 26.10.07
110
Click on Rotate the indicators... . Click once on the top drawing. This will happen.
So click on all till it appears as such.
SPSS ECDR 26.10.07
111
Then click on Resize the path diagram... . Everything will come into place in the middle of your canvas.
SPSS ECDR 26.10.07
112
Click on the Deselect all objects. The drawing will now return to black.
SPSS ECDR 26.10.07
113

Workshop On SPSS1

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Workshop On SPSS1

Uploaded by

Copyright:

Available Formats

Workshop on SPSS (Hands-On) for Beginners by Assoc. Prof. Dr.

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

Variable View Lets look into Variable View. Click on it.

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

Assignment Now try by yourself to key in the remaining interviews.

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

Select all the data in SPSS. Copy.

Paste the data in Microsoft Excel.

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

You can save the syntax file, as Example Syntax.

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

0 15-24 25-34 35-44

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

Frequency 105 144 57 148 15 29 173 63 13 60 33 6 29 34 83 4

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

The document should appear as OUTPUT.DOC.

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

Select Gender and transfer it to the Variables box.

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

SPSS ECDR 26.10.07

If click on row, the output will be as such.