Statistics and Data Presentation

NAME: ___Joel Eisenschmdit
Short Answer

1. What is a standardized mortality ratio (SMR) and when should it be utilized.
SMR is a ratio between the observed number of deaths in an study population and the number of death
would be expected, based on age and sex.
2.Differentiate between the use of the terms prevalence and incidence when researching disease.
Prevalence refers to the number of existing cases of disease. Incidence refers to number of new cases
of disease
Completion
3. The ___MEDIAN____ denotes the middlemost value when the values are arranged in numerical order.
4. _______MODE_______ is a measure of how often data are classified in the same manner.
5. The _____RANGE is the difference between the highest and lowest values.
6. The _______VARIANCE_________ demonstrates how values are spread around the mean and is the square

root of the variance.
Short Answer

7. What type of statistical data is shown below?
ANSWER: Interval

8. What was the relative frequency for the data in the 10,000-14,999 interval?

ANSWER: 16.5

9. What type of data is shown below?

ANSWER:
Nominal

10. Which fiscal agent provided more covered care for females than for others?

ANSWER: Medicare
11. In which column is the ratio specified in the table below?

ANSWER: A

12. What is the mean LOS for patients having community-acquired viral pneumonia?

ANSWER: 11.8

13. What is the median LOS for patients having nosocomial viral pneumonia?

ANSWER: 14.6

14. What type of data is shown below?

ANSWER: Nominal

15. Which hospital appears to be located in the suburbs?

ANSWER:Hostpial D
16. Which hospital drew the fewest patients from the city?

ANSWER: Hosptial C
17. Which hospital had the fewest admissions for September?

ANSWER:Hospital C 28(19%)

18. What type of data is shown below?

ANSWER: Ordinal

19. It appears from the information that the leadership perceptions of the majority of students improved

as a result of their clinical internship in management. Do you agree or disagree? Explain.
ANSWER:
Yes I agree. There is 0 very weaks compared to five and 3 very strong compared to 17 (49%) after
the internship.

20. What would be the numerator for calculating the incidence rate of cardiovascular disease in the data

shown below?
ANSWER:1000

PROJECT ASSIGNMENT: DATA IMPORT INTO SPSS OR PSPP
Your second task this week is to get experience loading/importing, summarizing, and performing some simple
statistical comparison using a small data sample from the Cleveland Heart Clinic. This is worth 10 points.
Step 1: Download the comma-delimited text file HeartData.txt from Blackboard. This contains a set of data
on various health measures for a group of men and women see in the clinic for chest pain.
Step 2: Using SPSS or PSPP or a suitable alternative, import the file, using the first row of text as the
column/variable names
Step 3: Perform a descriptive summary statistic function on ALL subjects for the AGE, Systolic Blood
Pressure, and Cholesterol variables and REPORT THE FOLLOWING SUMMARY STATISTICS FOR
EACH VARIABLE:
Total number
Mean
Minimum
Maximum
Range
Standard Deviation
Variance
Skewness
Step 4: Perform a t test for two independent samples to determine if Males and Females have different
cholesterol levels and REPORT THE T STATISTIC AND P VALUE HERE:

Step 5: Calculate a Pearson Correlation coefficient to examine the possible relationship between AGE and
Cholesterol, and REPORT THE R VALUE AND P VALUE HERE:

See the video tutorials and web sites for SPSS and/or PSPP for more information on performing these types of
data import and analysis functions.

EXTRA CREDIT OPPORTUNITY:
Repeat steps 1-3 for any variable of your choosing, using a different dataset that you obtain from another
biomedical/health database, such as the ones you or your classmates found and blogged about last week.
You need to submit two paragraphs. The first should describe the data source and variables, including a web
site link, and the second paragraph should describe and detail the summary statistics.
Some databases make their data available already in SPSS format, while others provide it in comma- or tabdelimited text format or Microsoft Access format.
Up to 5 bonus points will be added to your Assignments grade.

AGE

Descriptive Statistics

Age
Valid N
(listwise)

N
Range Minimum Maximum Mean
Statistic Statistic Statistic Statistic Statistic
303
48
29
77
54.37
303

Std.
Deviation
Variance
Skewness
Statistic
Statistic Statistic Std. Erro
9.082
82.485
-.202
.14

Systolic blood pressure

Descriptive Statistics

Systolic
Valid N
(listwise)

N
Range Minimum Maximum Mean
Statistic Statistic Statistic Statistic Statistic
303
106
94
200 131.62
303

Std.
Deviation
Variance
Skewness
Statistic
Statistic Statistic Std. Erro
17.538 307.586
.714
.14

Cholesterol

Descriptive Statistics

BP
Valid N
(listwise)

N
Range Minimum Maximum Mean
Statistic Statistic Statistic Statistic Statistic
303
438
126
564 246.26
303

Std.
Deviation
Statistic
51.831

Variance
Skewness
Statistic Statistic Std. Er
2686.427
1.143
.1

Step 4
I could not get the cholesterol test to give me back and results so I used Blood Pressure instead.

Group Statistics

BP

Sex
male
female

N
207
96

Mean
239.29
261.30

Std.
Deviation
42.782
65.089

Independent Samples Test
Levene's Test for Equality of Variances

t-test for Equality of Means

F
Sig.
t
df
Sig. (2-tailed)
95% Confidence Interval of the Difference
BP Equal variances assumed
Equal variances not assumed

12.297 .001

Std. Error
Mean
2.974
6.643

Mean Difference

Lower Upper
-3.503 301
.001
-22.012 6.284
-3.024 134.395.003
-22.012 7.278

Step 5.

Correlations
Age

Std. Error Difference

Cholesterol
Age
Resting
1
.214**
.000
24910.337 30376.693

Pearson Correlation
Sig. (2-tailed)
Sum of Squares and
Cross-products
Covariance
82.485
100.585
N
303
303
**
CholesterolResti Pearson Correlation
.214
1
ng
Sig. (2-tailed)
.000
Sum of Squares and 30376.693 811300.878
Cross-products
Covariance
100.585
2686.427
N
303
303
**. Correlation is significant at the 0.01 level (2-tailed).

-34.378 -9.646
-36.407 -7.617