Professional Documents
Culture Documents
LMI-2: Genetic
Variations
Traci Tosh, MSIS
2016
Objectives
Identify the basic genetic resources that are available for searching diseases
and disorders; such as Gene, ClinVar, dbSNP, and OMIM
Navigate NCBIs variation databases and tools to find specific types of
information related to variation data, such as clinical significance
Explain how genetic variations and mutations impact health
Small Variations
Small variations tend to be variations in a single base pair.
Small
AAGGATG
|||||||
TTCCTAC
G to T
C to A
AAGTATG
|||||||
TTCATAC
Source:
Matten, Wayne. A Librarians Guide to NCBI. National Center for Biotechnology Information. April 2013.
Types of Variations
Insertions & Deletions
Reference sequence
AAGGATGC
Insertion
AAGGCATGC
Deletion
AAG--TGC
Source:
Matten, Wayne. A Librarians Guide to NCBI. National Center for Biotechnology Information. April 2013.
GGC
Gly
GGT
Gly
Change to
a stop
codon
TCG
Ser
GGAAAGGTTAA
GlyLysVal
TCG
Ser
TAG
Ter
GGAAGGTTAA
GlyArgLeu
TGG
Trp
Source:
Matten, Wayne. A Librarians Guide to NCBI. National Center for Biotechnology Information. April 2013.
Genetic databases and tools are freely offered through the National Center for
Biotechnology Information (NCBI) website ( http://www.ncbi.nlm.nih.gov/). There are
three ways to search for genetic databases and tools, one, through the drop down box
next to the search box, two, from one of the subjects on the left side of the page, or
three, type your search into the search box, leave the drop down at All Databases and
click Search.
After an All Databases search, this is the screen that will display. It is called the
Entrez screen and is a quick way to access all NCBI databases: http
://www.ncbi.nlm.nih.gov/gquery/
VariationDatabases
dbSNP
dbVar
Medical Genetics
ClinVar
MedGen
Genetic Testing Registry
OMIM (omim.org)
dbGaP
Above is a list of the variation resources that are available through NCBI. We
are going to focus on the Variation resources in blue, as well as the Gene
database.
The results are organized by name and gene ID, description, chromosome location,
other known gene symbols, and the Online Mendelian Inheritance in Man (OMIM)
database number listed as MIM. OMIM will be mentioned later in this tutorial. If too
many results display, use the filters located on the left side to narrow down the search.
This should be familiar as PubMed has the same screen design.
Clicking on the link to the Gene database from the previous search on Li Fraumeni brings
up a list of gene results. Select the first listing for TP53 and human. Make sure to look
through the list of results to verify the record you are choosing is not an animal record.
The ability to filter by Organism is located on the right side of the screen.
Discontinued records display above the listed results.
In the selected TP53 gene record the top portion, or Summary, of the record textually
describes the gene; the official symbol and name, the gene type, the lineage, other
symbols this gene is known as, and a brief summary regarding TP53. Below the
Summary section is a section on Genomic Context with a diagram visually showing
which chromosome this gene is found on and the location.
On the right side of the screen is the Table of contents with quick links to sections
within this gene record for easier searching of the record. Located below the table of
contents are the Related Information links to related gene content in other databases,
primarily NCBI (these links will be displayed on the next slide). Remember, there is
also the ability to go directly to another resource from the Entrez search screen as well.
Scrolling down the page is the Sequence Viewer. This viewer is a visual
representation of where the gene is located on Chromosome 17 and where
the variations are located.
Related
Information
linkstoother
databasesthat
referencethe
TP53gene
Each of the
colored boxes
found under
ClinVar Short
Variations or
Cited Variants,
dbSNP has a
meaning (see
next slide) and
if hovered
over, will open
a pop-up
window with
information on
the variation
and links to
other NCBI
resources
such as
PubMed.
Source:
4.3. SNP Bins For Clinical Associations. Image. Graphical View Legend. National Center for Biotechnology Information,
Gene database. August 2016.
Continuing to scroll down the Gene page is where the section on Associated conditions for TP53 can
be found. Clicking on the condition link, for example Li-Fraumeni syndrome 1, will display the
Genetic Testing Registry page. Clicking on the Compare labs will display an alphabetical list of labs
with the different tests and services each lab runs. There are also links to other databases such as
MedGen and OMIM which will be discussed later in this module.
The Genetic Testing Registry (GTR) record of the disease provides a summary of the disease, available tests, associated
genes, and related conditions. It is a registry for voluntary submission of genetic test information by providers.
Notice the links on the right side of the screen for Reviews, Clinical resources, Practice guidelines, Molecular resources,
and Consumer resources.
Scrolling down the Genetic Testing Registry (GTR) record is where a breakdown of the
conditions related to Li Fraumeni along with linked icons to other resources. To look at the
full GeneReviews record for Li Fraumeni, click on the green G next to Li-Fraumeni 1.
GeneReviews are part of the NCBI Bookshelf. On the right side of the screen is a section
titled, In this GeneReview. These are quick links to sections of the currently displayed
GeneReview. Use the back arrow to return to the Gene database.
OMIM
OMIM stands for Online Mendelian Inheritance in Man. It is a database comprised of human
genes and genetic phenotypes. One of the pluses to OMIM are the external links it has in
each record to other NCBI and non-NCBI databases. To learn more about the information
located within each OMIM record, click on the FAQ link.
Linking to OMIM from the Gene database brings up a list of results on Li-Fraumeni
and TP53. Notice that the OMIM results list is similar in looks to a PubMed results
list. Scroll down the page until you find the listing for Tumor Protein p53 and click
the title.
The Tumor Protein p53 record in OMIM will display. OMIM is more textual in design
than the Gene database. To the left of the record is the table of contents; notice that
the OMIM record encompasses a wide variety of information on the Tumor Protein p53
gene record. To view a particular section of the record, click on the link in the table of
contents.
Selecting the Molecular Genetics section from the table of contents displays a listing of
diseases associated with the Tumor Protein 53. Within each disease section are
highlighted links to other resources as well as linked references to articles.
Currently the record displayed is the gene record. To link to the phenotype record use
the MIM Number link to the in the Gene-Phenotype Relationship table next to the
appropriate phenotype. The table in the phenotype record will then display as the
Phenotype-Gene Relationship table.
Located to the right of the record are the external links to other sites containing
information on the disease/gene. For more on information on the External
Links, select the External Links tab at the top of the screen. Click on the link for
Variation to access links to specific variation resources. Select ClinVar.
ClinVar
ClinVar is a database containing records based on genetic variations and their affiliation with medical
disorders. ClinVar is intended to meet the needs of the medical genetics community.
Disclaimer from ClinVar: ClinVar is based on direct submission, and not all variants known to cause or be
associated with medical disorders have been submitted to ClinVar or mapped to genomic coordinates.
You should not interpret the absence of medical information for a variant in a ClinVar file as a lack of
medical or functional effect for that variation
Selecting the ClinVar database link in OMIM brings up a list of all the TP53 variations in
ClinVar. There are quite a few listed so we will need to use filters to narrow down our
search. Similar to PubMed and the Gene database, additional filters are available on the
left side of the screen.
Filtering to Pathogenic narrows down our search to only display the genes with
pathogenic variations along with their corresponding conditions and locations.
Click on one of the results to view an individual record.
Choose the detailed pathogenic record for the TP53 gene with a coding DNA of c.637C>T and protein
changes of amino acid Arg213 to amino acid Ter. The ClinVar record lists the information on the
genetic variation, protein changes, clinical significance, and links to condition information.
Continue scrolling down the page to the tabs for Clinical Assertions (who the submitter is, clinical
significance, links to citations), Summary Evidence (laboratories, families, allele origin), and
Supporting Observations (phenotypes, description of genetic variation).
Click on the dbSNP link in the ClinVar record to bring up the result in the
dbSNP database.
dbSNP
dbSNP is a database containing a catalog of short variations in nucleotide
sequences from a wide range of organisms.
Selecting the dbSNP record in ClinVar retrieves one result. Click on the
result to retrieve the full dbSNP record.
At the top of the dbSNP record is where you will find the type of organism
(human) that the variation is associated with, the alleles that are affected (C/T),
and the clinical significance (with pathogenic allele).
A) The Integrated Maps section provides a summary of genome mapping information for the variation
by clicking on any value in the Chromosome Position (Chr Pos) column or the Contig Position
(Contig Pos) column.
B) The SNP GenView section is being replaced with the Variation Viewer which will be discussed
later in this tutorial.
C) The RefSeqGene Mapping table summarizes variation mapping information and protein changes.
A
B
C
The SNP GeneView page mentioned earlier displays a list of short genetic variations of the gene and
the functional amino acid changes they cause. However, it only reports human variation on GRCh38,
the new assembly for the human genome. A new Variation Viewer is available to view the gene TP53
variations in GRCh37p13 (previous human genome assembly) or GRCh38, and will replace SNP
GeneView later this year. The Variation Viewer is the genomic browser used to search and view
genomic variations listed in dbSNP and ClinVar databases
To search the new Variation Viewer, click the VarView button at the top of the dbSNP page and choose
which assembly in which you wish to view the variations.
The Variation Viewer is similar to PubMed in how the screen is structured - main search
topic is on the right and the limits/filters are on the left. The Sequence Viewer is the main
display and is a visual representation of where the gene is located on the Chromosome as
well as where the variations are located. Located under the Sequence Viewer is a table
with links to the variations, their locations, type of variant, the gene its associated with, the
clinical significance, and any publications associated with the variation.
To learn more about the Variation Viewer, click on the You Tube link located
at the top right of the screen.
Variation Portal
Up to this point I have shown you how to access the variation databases and portals
linking from one resource to another. A shortcut is through the Variation Portal which
contains many, but not all, of NCBIs variation resources all in one easy to reference
location.
http://www.ncbi.nlm.nih.gov/variation/
Formative Assessment:
Answer
2
3
4
Question
Click on the corresponding button to see the answer!!
1.
2.
Can you filter your results in the Gene database? True or False?
3.
You must run a new search each time you wish to view other genetic databases and tools on
your topic. True or False?
4.