You are on page 1of 3

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/220691924

BLAST - an essential guide to the basic local alignment search tool

Book · January 2003


Source: DBLP

CITATIONS READS

57 1,015

3 authors, including:

Ian Korf Joseph Bedell


University of California, Davis Sigma Aldrich
151 PUBLICATIONS   31,131 CITATIONS    28 PUBLICATIONS   3,074 CITATIONS   

SEE PROFILE SEE PROFILE

All content following this page was uploaded by Joseph Bedell on 23 May 2014.

The user has requested enhancement of the downloaded file.


Book reviews

An Essential Guide to the Basic Thus, effective training of new scientists,


Local Alignment Search Tool: with the skills to leverage the
BLAST sophisticated tools that have been
Ian Korf, Mark Yandell and Joseph developed for performing fast yet sensitive
Bledell sequence database searches, has become
O’Reilly Associates, Sebastopol, imperative for computational biology to
CA; ISBN 0 596 00299 8; 311 pp.; remain a growing, and useful, science.
US$39.95 (pbk); July 2003 Although there are several books that
provide a good introduction to the
This book contains five parts: a one science behind bioinformatics-oriented
chapter introduction, three chapters of computational biology (‘Biological

Downloaded from http://bib.oxfordjournals.org/ by guest on June 3, 2012


theory, five on practice, three on Sequence Analysis’ by Durbin, Eddy,
‘Industrial Strength BLAST’, reference Mitchison and Krogh being the seminal
manuals for NCBI-BLAST and WU- piece), and a recent collection of scripting
BLAST, plus five appendices. The language-oriented practical applications
introduction walks a novice user through texts (there are two O’Reilly books
a preliminary search via the NCBI covering Perl programming for
BLAST website and explains how to bioinformatics, plus my personal
interpret the output. The theory provides favourite, ‘Genomic Programming in
some background on molecular biology, Perl’ by Rex Dwyer), no single text
the purpose and mechanism of sequence provided an integrated view of the
alignment methods, and evaluating science behind alignment searching and
alignments and significance measures the most popular program for doing that,
resulting from database searches. The BLAST. This book aims for that, and I
practical section provides an in-depth believe it succeeds well enough to
discussion of the BLAST suite of recommend that this book be essential
programs, detailed explanation of the reading material for advanced
BLAST output file format, cursory undergraduates taking an upper division
description of alignment statistics, and bioinformatics course, and for graduate
tips, hints and recipes for building students just starting in a computationally
protocols and pipelines out of the BLAST oriented biological field of study. It may
suite of programs. The final content even be appropriate for advanced students
chapter, ‘Industrial Strength BLAST’, and professors who need to ‘bone up’ on
provides an in-depth explanation on how their BLAST skill set.
to install BLAST, curate databases and The initial introductory chapter walks
perform effective and efficient searches by the reader through a straightforward and
applying software and hardware elementary search on the NCBI BLAST
optimisations. The remaining sections website. A HOX gene from coelacanth is
(reference and appendices) provide searched against the nr database and the
reference and code materials to facilitate resulting hits are inspected. The book
advanced use of BLAST. does a fine job of introducing the novice
The current generation of user to the task of interpreting a BLAST
computational biologists enjoy the report. Immediately following this
enviable position of being in such demand introductory example, the book has a soft
that they cannot fulfil the biological introduction to the central dogma of
community’s need for their services. molecular biology, the atomic structure of

& HENRY STEWART PUBLICATIONS 1467-5463. B R I E F I N G S I N B I O I N F O R M A T I C S . VOL 5. NO 1. 93–96. MARCH 2004 93


Book reviews

nucleic acids and proteins, and the genetic searches. The tips are all worthwhile,
evolution. This clearly didactic sort of although only some may be applicable to
introduction is necessary to prepare a any particular laboratory or researcher,
novice reader to make intelligent depending on their interests. The book
decisions when they attempt to discern even provides a short section on ‘How to
biological information from their BLAST lie with BLAST statistics’, which is of
search output. questionable value, although it does
The following two chapters introduce demonstrate the importance of database
sequence alignment and the theory size in E-value computations. The
behind the statistical measures used to remaining textual contact section contains
evaluate sequence similarity, respectively. three chapters. The first contains all the
The alignment chapter gives clear, information required to download and
worked-out examples of both install BLAST and sequence databases
Needleman–Wunsch and Smith– locally. The next provides critical advice
Waterman, something I find lacking in on how to maintain BLAST databases –
some other bioinformatics texts (proper as the authors say, ‘one of the most
treatment of edge conditions is often neglected yet important aspects of using

Downloaded from http://bib.oxfordjournals.org/ by guest on June 3, 2012


completely missing). A working example BLAST’.
Perl program that demonstrates dynamic The remaining sections, BLAST
programming can be used as a reference Reference, and the Appendices, have the
by students who wish to understand the feel of ‘needing to fill the book out a
mechanics of sequence alignment little’. The reference contains little more
methods and is a welcome addition. The than the manual pages for the NCBI-
chapter on alignment statistics introduces BLAST and WU-BLAST programs,
the relevant concepts, without delving while the appendices contain information
too deeply into some of the more obscure that probably belonged directly in the
but important issues behind the theory. content of the book: description of the
The five practical chapters perform a various NCBI-BLAST sequence
very important service. Perhaps other alignment output formats, tables of values
readers who have independently learned computed on the similarity matrices used
to set up and use BLAST locally will for nucleotide and protein scoring
appreciate the value of the information schemes, and source code for a couple of
presented in these chapters, because it is Perl utility scripts.
sadly lacking (or hidden) in the standard In all I heartily recommend this book
BLAST documentation. The first chapter be read by anybody who does not know
walks the reader through the basic BLAST but has a need to use it. It
BLAST programs, how the BLAST provides a firm introduction to both the
algorithm achieves its speed-up basic molecular biology and computer
(fortunately, a short introduction to time science underlying BLAST, and does so in
and memory complexity earlier in the a didactic and low-nonsense style. The
book as prepared the reader to appreciate ‘meat’ of this book is in its clear
the algorithm), and where to go to learn explanations of knowledge that previously
more details. The next few chapters had to be learnt from other BLAST
perform the tedious chore of completely practitioners or gleaned from extensive
explaining the standard BLAST output searching of old mailing lists and obscure
file format, how to interpret BLAST documentation pages.
statistics, and crucial tips for improving
the speed and sensitivity in BLAST David Konerding

94 & HENRY STEWART PUBLICATIONS 1467-5463. B R I E F I N G S I N B I O I N F O R M A T I C S . VOL 5. NO 1. 93–96. MARCH 2004

View publication stats

You might also like