Professional Documents
Culture Documents
Answers
1. Code book
- Codebook provides diverse information, including the type of variable, range,
frequent values, amount of missing. When we use code book in stata, it can describes
very detail content of data sets. I will use Framingham data to make a code book
here.
- I will use command in stata chat to use “codebook sysbp or “ codebook hdlc3” and
etc” to looking for about the missing values and total sample size from each variables
that I want.
- For example the variable sex3 in a stata windows were told to us about the missing
values calculated 1.171 data and the total sample size were calculated 4.434. and also
the others variables for instance total cholesterol variable showed to us about the total
missing values that is 1.385 data were missed from 4.434 total data sample size .
2. Describe
- The describe command given to us information about how the variable is stored in
Stata and to describe a data set about the number observation that we have, size data,
total variables, value labels from each variables and give us information about the
whole type variables that we have.
- I will use command in stata command chat box with “describe”
- There is 74 Variables in a Framingham data and the whole number observation from
this data were calculated 4,434.
Table 2. Contain data information about observation, size and total variables in
a whole framingham data
No Describe Contain Data
1 Number 4,434
Observation
2 Size 900,102
3 Total Variables 74
4 Variables types Randid, death, angina, hospmi, stroke, cvd,
hyperten,timeap, timemi,age1,diabetes1, sex1,bmi1,
hearrate,timechd,
timehyp,cursmoke2,cigpday2,hdlc1,ldlc1, and so on
until 74 variables.
3. Summarize
- Summarize in a stata is the basic descriptive statistics command in Stata
is summarize, which calculates means, standard deviations, and ranges.
- I was write in a chat box stata command to show some values about mean and standar
deviation , minimum and maximum in each variables with “Summarize”
Incident Event
1 Angina 264 264
2 Stroke 133 150
3 Incident Hospitalized 213 95
139,25 ± 21,1 140,92 ± 24,13
Systolic Blood
4 Pressure 1
5 Body Mass Index 26.22 ± 3.49 25.65 ± 4.45
6 Total cholesterol 26.22 ±3.49 25.65 ± 4.45
7 HDL Cholesterol 43.7 ± 13.2 53.6 ± 15.90
(mg/dl)
8 LDL Cholesterol 170.54 ± 44.65 180.94 ± 47.99
(mg/dl)
9 Current Smoker 539 582
Interpretation :
We know about our data that we had in a table 4 while in this Framingham data
examination 3 we have a total sample size around 3.263 people and we divided into two
group that are 1.387 men and 1.876 woman. And I want to analyses and describe a little
bit from each variable.
1. Angina The incident from the population who suffered with angina disease were
included in a group man population that is 264 people but 1.123 people don’t have an
incident angina pectoris meanwhile in a woman group population was 264 people too
who suffered with angina and the other hand 1.612 people not have an incident
angina pectoris.
2. Stroke The incident of stroke in this study in a man population was 133 people but
1.254 people don’t have an incident stroke and then the incident of stroke in a group
woman were calculated 150 people vice versa with 1.726 woman don’t have an
incident stroke disease.
3. Incidence hospitalized Were identified the incidence hospitalized in this study
from the people who suffered with cardiovascular disease that is 213 people in a men
group were hospitalized due to cardiovascular disease but 1.174 people don’t need
hospitalized and then 95 people in a woman group, meanwhile 1.781 people don’t
need too hospitalized among female group.
4. Current smoker The incident from the population who have a risk behaviour that
were doing current smoker in this study include in a group man population that is
539 people were smoker but 848 people non smoker. Meanwhile in a woman group
population was 582 people too who have a risk behaviour to get a cardiovascular
disease from were doing smoker habits but 1.294 non smoker .
5. Body Mass Index the mean value and standard deviation from the variable body
mass index were calculated in a man population is 26.22 ± 3.49 and in a woman
population is 25.65 ± 4.45. the value of standard deviation in a 2 group population
was smaller rather than the mean value its signified that the variable of body mass
index tend to have a homogen data. The mean or average body mass index in a man
population is 26.2 and then the average body mass index in woman population is
25.65.
6. Systolic blood pressure the value mean and standard deviation from the variable
systolic blood pressure were calculated in a man population that is 139.25 ± 21.1 and
in a woman population that is 140,92 ± 24.13. The value of standard deviation in a 2
group population tend to were smaller rather than the mean value its signified that the
variable of systolic blood pressure tend to have a homogen data. Other than that, the
mean or average systolic blood pressure in a man population around 139.25 and then
the average value systolic blood pressure in woman population is 140.92.
7. HDL Cholesterol the value mean and standard deviation from the variable HDL
cholesterol were calculated in a man population that is 43.7 ± 13.2 and in a woman
population that is 53.6 ± 15.90. The value of standard deviation in a 2 group
population tend to were smaller rather than the mean value its signified that the
variable of HDL cholesterol tend to have a homogen data. Other than that, the mean
or average systolic blood pressure in a man population around 43.7 and then the
average value HDL cholesterol in woman population is 53.6.
8. LDL Cholesterol the value mean and standard deviation from the variable LDL
cholesterol were calculated in a man population that is 170.54 ± 44.65 and in a
woman population that is 180.94 ± 47.99. The value of standard deviation in a 2
group population tend to were smaller rather than the mean value its signified that the
variable of LDL cholesterol tend to have a homogen data. Other than that, the mean
or average systolic blood pressure in a man population around 170.54 and then the
average value LDL cholesterol in woman population is 47.99.
.02
.015
Density
.01.005
0
Male Female
300
200
Frequency
100
0
d. Histogram for variable systolic blood pressure 3 (sysbp3) for each sex category
Male Female
200
150
Frequency
100
50
0
mean: 236.713
std. dev: 44.4495
mean: 60.6482
std. dev: 8.29677
. summarize sex3 totchol3 age3 sysbp3 diabp3 cursmoke3 cigpday3 bmi3 diabetes3
5. Tabulate incident event (Angina, Stroke, incident hospitalized and current smoker) based
on Sex3.
Incident Angina
Sex, exam Pectoris
3 No Yes Total
Incident Stroke
Sex, exam Fatal/non-fatal
3 No Yes Total
Incident Hospitalized
Sex, exam MI
3 No Yes Total
6. Tabulate Body Mass Index, Systolic blood pressure, Total cholesterol, LDL Cholesterol,
HDL Cholesterol based on Sex3.
. tabulate sex3, sum( bmi3)