Professional Documents
Culture Documents
Week 14
Final Exam
Prev
Home
o
40 questions
Quiz • 30 min
Final Exam
Submit your assignment
Due DateJan 25, 7:59 AM CET
Attempts3 every 8 hours
Receive grade
To Pass80 % or higher
Grade
—
Final Exam
Graded Quiz • 30 min
Final Exam
Total points 40
1.
Question 1
你想查询一个已知的蛋白质的三维结构是否已经被解析出来了,应该去访问的
数据库是
To which of the following databases should you refer in order to find out whether a
known protein has already had its 3D structure resolved?
1 point
OMIM
HGMD
dbGAP
PDB
2.
Question 2
Which of the following qualities of sequencing denotes the lowest sequencing error
rate?(single character recorded in phred33)
1 point
15
30
3.
Question 3
BAM 格式中不包括的信息有哪些
读断序列
读段比对程序的名字
读段的名字
读段的比对结果
4.
Question 4
高通量测序技术的序列回帖算法思想最类似以下哪种?
To which of the following algorithms is the reads mapping algorithm applied in high-
throughput sequencing technique most similar with respect to their basic ideas?
1 point
Smith-Waterman 局部比对
Kruskal 最小生成树算法
BLAST 索引和数据库搜索
5.
Question 5
下列哪一种测序仪不是高通量测序仪?
1 point
6.
Question 6
以下不属于生物信息学研究内容的是
基因组数据挖掘
基因组序列比对技术
构建系统发育树
动作和手势识别比对技术
7.
Question 7
下列关于替换矩阵的说法哪些是正确的
Which of the following statements are correct with respect to substitution matrix? (2
correct options)
1 point
一种替换在自然界中越容易发生,则这种替换在打分矩阵中对应的数值越小
The easier it is for a particular substitution to happen in the real world, the smaller
score this substitution has in the scoring matrix
The substitution matrix is always a matrix that is symmetric with respect to its main
diagonal
改变替换矩阵不会影响序列比对结果
现在人们已经找到了序列比对时最好的打分矩阵
Now people have found the best scoring matrix for sequence alignment
替换矩阵的值由且仅由经验公式决定
替换矩阵的值反应了碱基间的相似程度
The values in substitution matrix denote the similarities between bases
8-10
8.
Question 8
1 point
Needleman-Wunsch alone does not fit for next-generation sequencing data analysis,
while Smith-Waterman alone is suitable for that
Smith-Waterman alone does not fit for next-generation sequencing data analysis,
while Needleman-Wunsch alone is suitable for that
单独使用 Smith-Waterman 算法和 Needleman-Wunsch 算法均不适合用于高通量
测序数据分析
Needleman-Wunsch can have only one optimal solution, while Smith-Waterman can
have multiple optimal solutions
Needleman-Wunsch finds the locally optimal result, while Smith-Waterman find the
globally optimal result
Smith-Waterman 算法更适用于寻找两个蛋白序列之间相似的功能域
Smith-Waterman algorithm is more suitable for finding similar function domains from
two protein sequences
9.
Question 9
大规模进行数据比对时不采用动态规划算法的最主要原因
What is the main reason that the dynamic programming algorithm is NOT used for
large-scale alignments?
1 point
消耗内存大
结果不稳定
结果不准确
算法不可靠
可重复性差
编程难度大
Difficult to program
运算速度慢
It runs too slow
10.
Question 10
BLAST 有关说法中正确的有哪些
1 point
BLAST 屏蔽低复杂度区域的步骤没有作用,可以省略
The step of masking low-complexity regions in BLAST is useless and can be skipped
BLAST 适合对高通量数据进行拼接
BLAST 一定能找到最优解
BLAST 是目前最快的序列比对算法
BLAST 运行较比动态规划算法速度慢
1-3-4-6 1-3-4
11.
Question 11
When doing sequence alignment to the same sequence, how many times is the
theoretical computational overload of tblastx as big as that of blastn?
1 point
1/6
1/4
1/3
12
2
1/2
1/5
1/36
36
1/24
24
12.
Question 12
1 point
输入序列数量
序列的名称
种子字长
屏蔽或不屏蔽低复杂度区域
1-2-4 2-3-5
13.
Question 13
针对下图的说明中错误的是
Which of the following statements is NOT correct with respect to the figure below?
1 point
Given such an HMM, we can observe three values at each state: a, b, and c
There are in total 10 different state paths that starts from 1, ends at 3, and can generate
the token sequence "abccc"
14.
Question 14
各转移概率和生成概率如下表,则存在问题的一组是
The transition probabilities and emission probabilities are given below. Then which of
the following statements is NOT correct?
1 point
生成概率的 c 行
生成概率的 n 行
转移概率的 c 行
转移概率的 n 行
转录本分析中测定转录本表达水平的“金标准”(Gold Standard)是
What is the gold standard for quantifying the expression level of transcripts in
transcript analysis?
1 point
RNA-seq
表达序列标签
实时荧光定量 PCR
基因芯片 microarray
固相捕获
16.
Question 16
Which of the following statements are wrong with respect to the Split reads strategy
used in reads mapping in RNA-Seq?
1 point
该方法可以将所有读断定位到基因组上
该方法不能发现新的外显子
该方法能够发现新的剪切体
This method is always used together with the "join exon" method
该方法运行速度较慢
1-2-3 2-4-5
17.
Question 17
As shown in the figure below, the Transcript 1 has its expression level being 20 and
Transcript 2 has its being 30. Then what are the expression levels of Exon 1 and 2,
respectively?
1 point
40, 40
600, 30
40, 30
40, 50
20, 30
50, 30
30, 50
10, 50
18.
Question 18
已知 RNA-Seq 测序数据回帖后在某个基因区间的情况如下图所示(请仔细观察
图片,不同尝试图片可能会变)
Assume that the RNA-Seq reads are mapped back to part of a gene as shown
below(please check the picture carefully, the picture may change in different trial)
则该基因至少有几种转录本?
Then what is the minimum number of transcripts this gene could have?
1 point
19.
Question 19
在上一题中,该基因最多有多少个转录本?(假设所有转录本均已被测到)
In the previous question, what is the maximum number of transcripts this gene could
have? Assume that all the transcripts of this gene have been sequenced
1 point
3
6
20.
Question 20
下面关于长非编码 RNA(lncRNA)的说法,正确的是哪些
Which of the following statements are correct with respect to long noncoding RNAs
(lncRNAs)?
1 point
lncRNA 都没有功能
lncRNA 上没有外显子和读码框
3-4-5 3-4-5-6
21.
Question 21
Which of the following statements is NOT correct with respect to the identification of
noncoding RNAs?
1 point
选择合适的特征组合可以提高鉴定的准确率
可以鉴定出所有的非编码 RNA
可以利用序列的二级结构信息来鉴定非编码 RNA
The higher the LOG-ODD score is, the more reliable the ORF result would be
可以利用序列碱基保守性信息鉴定非编码 RNA
2-3-4 2-4-6
22.
Question 22
Assume that the probability that an error occurs in a trial is 0.2, and all trials are
independent of each other. Then what is the probability that, in three trials, there are at
least two of them that have an error occur?
1 point
0.040
0.096
0.148
0.006
0.104
0.084
23.
Question 23
We use Bonferroni Correction to set an upper bound of 0.05 for the value of the
probability that the Type I error occurs in a trial where 50000 genes are compared.
Then all the p-values of significant genes should be smaller than ____
1 point
1.0e-6
0.01
0.005
0.05
0.1
24.
Question 24
Which of the following classes of GO does the "vitamin transporter acitivity" belong
to?
1 point
Biological Regulation
Biological Component
Molecular Process
Cellular Function
Molecular Function
Biological Function
Molecular Regulation
Biological Process
Cellular Component
Cellular Process
25.
Question 25
1 point
It catalyzes the reaction where Serine and Glycine are transformed into each other
It catalyzes the reaction where Threonine and Glycine are transformed into each other
It catalyzes the reaction where Serine and Pyruvate are transformed into each other
26.
Question 26
Assume we get the gene list below in an analysis(in Entrez Gene ID format)
498
506
509
513
514
515
516
517
518
521
522
539
4508
4509
9551
10476
10632
27109
Then what is the most enriched KEGG pathway given by KOBAS (with all
parameters set to default)?
KOBAS: http://kobas.cbi.pku.edu.cn/
1 point
Dravet syndrome
Carnitine shuttle
Metabolic pathways
Huntington's disease
Option text
Alzheimer's disease
Oxidative phosphrylation
Beta oxidation
27.
Question 27
For the gene list given in the previous question, what is the most enriched GO term
given by KOBAS (with all parameters set to default)?
1 point
chemosynthesis
organelle envelope
cellular respiration
oxidative phosphorylation
photophosphorylation
hydrogen transport
28.
Question 28
蛋白质结构域方面的信息可以从下列哪个中查到?
From which one can one find information about protein motifs?
1 point
PolyPhen-2
SIFT
InterPro
SOAP
BLAT
MEGA
IntAct
DAMBE
KOBAS
29.
Question 29
1 point
物种分类层级关系
蛋白质结构
Protein structure
基因注释信息
Gene annotation
蛋白质序列
Protein sequence
基因组序列
Genome sequence
生命科学相关图书
NCBI 网站的培训视频和教学指导
基因型-表型 关联数据
药物设计和靶点信息
生命科学和医学相关文献和相关资源链接
30.
Question 30
UCSC 提供了下列哪些有用的工具?
1 point
BLAST
BatchPrimer3
MEME Suite
ClinVar
MedGen
ClustalW2
SIFT
PolyPhen-2
In-Silico PCR
Genome Browser
Blat
31.
Question 31
GO 的拓扑结构是?
1 point
双向星型结构
bi-directional star
双环图
dual-ring graph
层次树
Hierarchical Tree
有向无环图
无向有环图
Undirected Tree
总线结构
daisy-chain
32.
Question 32
世界上第一个被发现的新基因是
1 point
Jingwei 基因
Jingwei gene
Hun 基因
Hun gene
BC200 基因
BC200 gene
BSC4 基因
BSC4 gene
POXP2 基因
POXP2 gene
FGF4 基因
FGF4 gene
Tre2 基因
Tre2 gene
Sphinx 基因
Sphinx gene
“猴王” 基因
XIST 基因
XIST gene
33.
Question 33
下图所示的新基因起源机制是哪一种?
What is the mechanism of new gene origination described by the figure below?
1 point
基因水平转移
Lateral gene transfer
逆转录转座
Retrotransposition
基因重复
Gene duplication
外显子/结构域重排
exon/domain shuffling
可移动元件
mobile element
从头起源
De novo origination
34.
Question 34
给定图中的物种系统发生关系和基因在各物种中是否存在,依据最简约原则如
下哪一个推断是正确的?
Assume that we know the phylogeny and the existence of some genes as shown
below. Then which of the following statements is correct if we apply Occam's razor?
1 point
MNOP is a new gene originated after the divergence of Species 5 and the ancester of
Species 1, 2, 3, and 4
EFGH 是一个在所有物种中都有的新基因
35.
Question 35
如下哪个生物信息学方法可以用来寻找新基因?
Which of the following bioinformatics methods can be used to find new genes?
1 point
SOAP
Blast
KOBAS
SIFT
BWA
36.
Question 36
如下哪个计算方法不能对一个之前未知的从头起源基因提供有用的信息?
Which of the following methods cannot provide useful information for a de novo new
gene about which we knew nothing before?
1 point
蛋白理化性质(如 pI 值)预测
Prediction of the physical and chemical properties of proteins, such as the pI value
基于已知功能基因的同源注释
37.
Question 37
下列关于直系同源基因和旁系同源基因说法正确的是
Which of the following statements is correct with respect to orthologs and paralogs?
1 point
直系同源基因是由物种分化产生的
旁系同源基因是由物种分化产生的
旁系同源基因是由基因复制产生的
直系同源基因是由基因复制产生的
38.
Question 38
如下哪些技术可以用来提供转录组数据
1 point
RNA-seq
Mass spectrometry
SNP chip
cDNA microarray
39.
Question 39
Which of the following species has orthologous DNA sequences for the human gene
SRGAP2C?
1 point
家猪
小家鼠
Mus musculus
临夏鸵鸟
Struthio linxiaensis
索氏桃花水母
Craspedacusta sowerby
黑腹果蝇
Drosophila melanogaster
大肠杆菌
Escherichia coli
酿酒酵母
Saccharomyces cerevisiae
黑猩猩
Pan troglodytes
北极熊
Ursus maritimus
斑马鱼
Brachydanio rerio
40.
Question 40
我们今天知道的基因组上含有基因最多的物种是
To the best of our knowledge, which of the following species has the most abundant
genes?
1 point
拟南芥
Arabidopsis thaliana
小家鼠
Mus musculus
酿酒酵母
Saccharomyces cerevisiae
北极熊
Ursus maritimus
大肠杆菌
Escherichia coli
黑腹果蝇
Drosophila melanogaster
大豆
Glycine max
智人
Homo sapiens
番茄
Solanum lycopersicum
2.
Question 2
Which of the following qualities of sequencing denotes the lowest sequencing error
rate?(single character recorded in phred33)
1 point
40
3.
Question 3
BAM 格式中不包括的信息有哪些
1 point
读段序列
读段比对的染色体名字
读段的结构信息
读段的质量
4.
Question 4
高通量测序技术的序列回帖算法思想最类似以下哪种?
To which of the following algorithms is the reads mapping algorithm applied in high-
throughput sequencing technique most similar with respect to their basic ideas?
1 point
Smith-Waterman 局部比对
广度优先搜索
Kruskal 最小生成树算法
Kruskal algorithm for Minimum Spanning Tree
BLAST 索引和数据库搜索
5.
Question 5
下列哪一种测序仪不是高通量测序仪?
1 point
罗氏 454 焦磷酸测序仪
6.
Question 6
以下不属于生物信息学研究内容的是
Which of the following statements does NOT belong to bioinformatics research? (2
correct options)
1 point
氨基酸序列比对技术
序列数据库搜索
转录组序列比对技术
表型预测方法
测序仪的水平稳定控制
基因组数据挖掘
基因组序列比对技术
代谢分析图模型
构建系统发育树
1-4
7.
Question 7
下列关于替换矩阵的说法哪些是正确的
Which of the following statements are correct with respect to substitution matrix?
1 point
现在人们已经找到了序列比对时最好的打分矩阵
Now people have found the best scoring matrix for sequence alignment
替换矩阵的值由且仅由经验公式决定
改变替换矩阵不会影响序列比对结果
Changing substitution matrix won't influence the result of a sequence alignment