Professional Documents
Culture Documents
THE CONCEPT OF
WEIGHTING
Many Slides courtesy of ICF Macro Intl.
Samson Olusina Bamiwuye (VISITING SCHOLAR , DPS , WITS UNIVERSITY)
November, 2013
ISSUES IN DATA USAGE (courtesy of Kofi
Awusabo-Asare)
Lies
Dammed lies
Statistics
10, 40, 100, 500, 2000 Which measure of central
tendency will you use if
Above is the salary of you were:
An employer ?
five workers in an
A Trade Union official?
establishment
Give reasons for your
choice of method
Calculate Application
(courtesy of Kofi Awusabo-Asare)
provide an overview of DHS Data
gain practical understanding of the correct
analysis of data.
Explore other MEASURE DHS Resources
6
Most slides were obtained from 2010 DHS Fellow
Workshop in Calverton, USA and 2012 DHS
Fellow Workshop in Uganda.
Three slides from 2012 Workshop on Analysis of
factors
These data are useful in identifying higher-risk and vulnerable
18
The plan for data processing and analysis
must be made after careful consideration of
the objectives of the study as well as of the
tools developed to meet the objectives.
The procedures for the analysis of data
19
When making a plan for data processing and
analysis the following issues should be
considered:
Sorting data,
Performing quality-control checks,
Data processing, and
Data analysis.
20
When the plan for data analysis is being
developed the data, of course, is not yet available.
However, in order to visualise how the data can be
organised and summarised it is useful at this
stage to construct DUMMY TABLES.
A DUMMY TABLE contains all elements of a real
21
Age Frequency Percentage
15-19
20-24
25+
Total 100.0
22
RESIDENCE Currently Not currently Total
using any using any
method of contraceptive
contraceptive
N (%)
N (%)
Urban
Semi-Urban
Rural
Total
Chi-square= ****; df= ********; p< *****
23
NEVER DO ANALYSIS when:
You have not read DHS Documentation Guides,
values
ENSURE:
You properly registered for data access on
http://www.measuredhs.com/data/Access-In
structions.cfm
DHS recode manual = DHS analysis bible
Download at:
http://
www.measuredhs.com/publications/publicati
on-dhsg4-dhsquestionnaires-and-manuals.c
fm
DHS Final Reports provide a wealth of
descriptive statistics about the most commonly
used indicators; they also provide sample sizes
(denominators) for calculating them
You always need to make sure that your sample
3. Correct weights?
4. Correct recoding – handling of special values
5. Correct variables? – check the recode manual
6. Correct tabulation? (e.g. row vs. column
percent)
Read the table heading!
Check your denominators
Men’s weight
weighting
Don’t forget to
Unit of analysis Variable
divide by
Households hv005
1,000,000!
Women or
children v005
Domestic Violence d005
In Stata:
Men mv005
gen wgt=v005/1000000
HIV test results hiv05
• Make sure you can match your results to the Final
Report tables
– A huge advantage of DHS data is that you can almost
always check your work against the FR tabs
– generate hivwgt=hiv05/1000000
– tab hiv03 v025 [iw=hivwgt]
• iw is iweight, or “importance” weight
Using SVY,
the
relationship is
no longer
significant at
the 95% level
• For analyses where you do need significance
testing or confidence intervals
• We need to tell Stata we’re using survey data so
that Stata takes the sample design into account
when calculating standard errors
• General format:
– svyset [pw=weight], psu(cluster)
strata(strata)
o pweight is a sampling weight
• To tabulate with a confidence interval:
– svy: tab var1 var2, ci
• Use help svy for lots of additional information
and explanation!
• To look at the standard error of education
levels among women:
– svy: tab v106, se
• To look at the confidence interval of education
levels among women by urban/rural:
– svy: tab v106 v025, col ci
• To run a Pearson’s chi-squared test
(approximation) to see if levels of education
among women are statistically significantly
different by urban/rural:
– svy: tab v106 v025, col pearson
The ‘Rule of Thumb’: Use the weight from the
smaller sample.
Stata provides two ways to analyze survey data
such as the DHS data.
The survey Commands
The preferred way is to use the family of
Health Terms
Module 3: Indicators and the DHS
Module 4: Steps in Conducting a DHS Survey
Module 5: Understanding DHS Tables and
Figures
The DHS STATcompiler www.STAcompiler.com
The DHS STATmapper www.STATmapper.com
HIV/AIDS Survey Indicators Database
www.measuredhs.com/hivdata
HIVmapper: www.HIVmapper.com
Facilitators Guide:
Sign up today for email alerts to receive