Professional Documents
Culture Documents
Students
Paul Denny - UniProt Content Team
In this guide:
• Search for proteins
• How to get the most from a basic search
• Functional data in a protein entry
• Explore specific functions, locations and structural data
• Protein sequences and sequence features
• Accessing protein sequences
• Amino acid modifications
• Proteomes
• What proteomes are and how to access them
• Mutations and disease annotations
• Proteins implicated in disease
UniProt
A comprehensive, high-quality and freely accessible
resource of protein sequence and functional information:
• Primary sequences including sequences of isoforms
• Physiological protein function including subcellular location, pathways,
reactions, interactions and involvement in disease
• Structural information including topology and access to 3D structures
• Data analysis tools such as BLAST and multiple sequence alignment
European Bioinformatics Institute (EMBL-EBI), Protein Information Resource (PIR), SIB Swiss Institute of Bioinformatics (SIB),
Hinxton, Cambridge, UK Washington DC and Delaware, USA Geneva, Switzerland
www.uniprot.org
Download
Multiple file types, download formats
Customize
and download sources – from website Customizable view, columns, download
to data services. options.
Visualize Explore
Visualizations to help interpret data, Protein entries with comments,
e.g. ProtVista, interaction viewer, features, data provenance. Proteome
subcellular location, 3D structure and sequence cluster collections.
viewer.
Analyze
A workbench of tools like BLAST, Align, Peptide search, ID
mapping interwoven through user flow.
Data Sources
Analyse/re- Use/organise
Compare/integrate
analyse
Homepage uniprot.org
Tools UniProt release
2022_03
Resources
Search bar uniprot.org
Select Advanced
database search
Search bar:
Gene names
Protein names Search using a List
Diseases
of accessions or IDs
Example 1: Using UniProt to study components of
biological processes and pathways
ATG16
ATG12
ATG12 ATG12
ATG16
ATG5 ATG5
ATG5
Select
database
Restrict search
to specific field
Search
‘AND’, ‘OR’,
‘NOT’ term
Add or Remove
field
Retrieve/ID mapping Tool
Find multiple
proteins at
once
Retrieve/ID mapping Tool
Use the retrieve
tool to search
for multiple
entries at a time
List gene
names Set search
parameters
Recent
mapping
Previous
mapping(s)
Access your
tool results
page
Results page
Filter
results
Reviewed Vs Unreviewed
Protein sequence and Direct sequence
function data is submissions and
obtained from scientific sequence data from
publications sequence repositories
O94817 O94817
Q9H1Y0 Q676U5
Q9H1Y0
Q9H1Y0
to gene names
Search for entries containing a
specific string of amino acids
Enter peptide
Specify organism
sequence
4 protein entries contain this specific string of amino acids
UniProtKB
View Entries
abstract publication
maps to
Accessing cross references
Restrict BLAST
results by
taxonomy
Tools - BLAST
Filters
Align multiple
sequences
(fasta format)
Tools - Align
Different
result
outputs
Display user-
requested
protein features
tracks
Tools - Align
Display protein
features tracks
Example 4: Accessing proteomes
Search
specific
diseases
Access
disease
data
Access null
mutant
phenotype
data
Access
phenotype
data due to
RNAi and
morpholino
Mutations that disrupt one or multiple amino acids
Provide
mutant
name
Indicate
amino acids
affected
Summary:
• Perform a search
• Access functional data and sequences
• Analyse sequences
• Explore proteomes
• Obtain disease and mutagenesis data
Also:
• UniProt releases every 8 weeks and is freely available
• Data is provided in a range of downloadable formats
• text, XML, XML/RDF, FASTA, GFF, tab-delimited
uniprot.org
We need your help!
Please help us improve the UniProt website by providing valuable feedback.
Further help and
documentation
• For further help, contact us or go to
the Help centre.
Key staff: Cecilia Arighi (Curation), Lionel Breuza (Curation), Elisabeth Coudert (Curation), Hongzhan Huang (Development),
Damien Lieberherr (Curation), Michele Magrane (Curation), Maria Martin (Development), Peter McGarvey (Content), Darren
Natale (Content), Sandra Orchard (Content), Ivo Pedruzzi (Curation), Sylvain Poux (Curation), Manuela Pruess (Coordination),
Shriya Raj (Coordination), Nicole Redaschi (Development), Karen Ross (Content)
Content / Curation: Lucila Aimo, Ghislaine Argoud-Puy, Andrea Auchincloss, Kristian Axelsen, Emmanuel Boutet, Emily
Bowler-Barnett, Hema Bye-A-Jee, Cristina Casals-Casas, Paul Denny, Anne Estreicher, Maria Livia Famiglietti, Marc
Feuermann, John S. Garavelli, Arnaud Gos, Nadine Gruaz, Chantal Hulo, Nevila Hyka-Nouspikel, Florence Jungo, Kati Laiho,
Philippe Le Mercier, Antonia Lock, Yvonne Lussi, Patrick Masson, Anne Morgat, Sandrine Pilbout, Lucille Pourcel, Pedro
Raposo, Catherine Rivoire, Karen Ross, Christian Sigrist, Elena Speretta, Shyamala Sundaram, Nidhi Tyagi, C. R. Vinayaka,
Qinghua Wang, Kate Warner, Lai-Su Yeh, Rossana Zaru
Development: Shadab Ahmed, Leslie Arminski, Parit Bansal, Delphine Baratin, Teresa Batista Neto, Jerven Bolleman,
Chuming Chen, Yongxing Chen, Beatrice Cuche, Edouard De Castro, Leonardo de Costa Gonzales, ThankGod Ebenezer, Jun
Fan, Elisabeth Gasteiger, Sebastien Gehant, Abdulrahman Hussein, Alexandr Ignatchenko, Giuseppe Insana, Rizwan Ishtiaq,
Vishal Joshi, Dushyanth Jyothi, Arnaud Kerhornou, Aurelian Luciani, Marija Lugaric, Jie Luo, Monica Pozzato, Daniel Rice,
James Stephenson, Edward Turner, Preethi Vasudev, Yuqi Wang, Hermann Zellner, Jian Zhang
European Bioinformatics Institute (EMBL-EBI), Protein Information Resource (PIR), SIB Swiss Institute of Bioinformatics (SIB),
Hinxton, Cambridge, UK Washington DC and Delaware, USA Geneva, Switzerland
www.uniprot.org
Funding
National Eye Institute (NEI), National Human Genome Research Institute (NHGRI),
National Heart, Lung, and Blood Institute (NHLBI), National Institute on Aging (NIA),
National Institute of Allergy and Infectious Diseases (NIAID), National Institute of
Diabetes and Digestive and Kidney Diseases (NIDDK), National Institute of General
Medical Sciences (NIGMS), National Cancer Institute (NCI) and National Institute of
Mental Health (NIMH) of the National Institutes of Health
SERI