Professional Documents
Culture Documents
1. Assume that we collect all 10 SNPs and the minor allele frequency (MAF) of SNPs 1 to
5 is 0.3 and MAF of SNPs 6 to 10 is 0.15. Assume that the relative risk of one of them is
2.0 (we do not know which one). Assume that we are collecting 100 case and 100 control
individuals. With = 0.05, what is the power of this association study?
2. Use the greedy algorithm to find a minimum set of tag SNPs with a r 0.7. Please show
your work by drawing graphs before and after you choose each tag SNP.
3. Use the tags you find in (2). Assume that the relative risk of one of tag SNPs in the
greedy solution is 2.0 (we do not know which one). Assume that we are collecting 100
case and 100 control individuals. With = 0.05, what is the power of this association
study?
4. The greedy solution for finding the minimum set of tag SNPs is not the optimal solution.
What is the optimal solution? Please show your work by drawing graphs before and after
you choose each tag SNP.
5. Use the tags you find in (4). Assume that the relative risk of one of tag SNPs in the
optimal solution is 2.0 (we do not know which one). Assume that we are collecting 100
case and 100 control individuals. With = 0.05, what is the power of this association
study?
SNP B
A
a
A
a
a
A
A
A
a
A
2. Assume the causal SNP is B, but we collect SNP A. Assume that true case probability
and and true control probability are 0.4 and 0.5 respectively at SNP B. If we collect 500
case and 500 control individuals and have a significance threshold of 0.05, what is the
power at SNP A? (Note : Use the correlation that you get from above question)