Professional Documents
Culture Documents
Textbook Computer Vision Theory Algorithms Practicalities Fifth Edition Davies Ebook All Chapter PDF
Textbook Computer Vision Theory Algorithms Practicalities Fifth Edition Davies Ebook All Chapter PDF
https://textbookfull.com/product/biota-grow-2c-gather-2c-cook-
loucas/
https://textbookfull.com/product/numerical-algorithms-methods-
for-computer-vision-machine-learning-and-graphics-1st-edition-
justin-solomon/
https://textbookfull.com/product/advances-in-computer-vision-
proceedings-of-the-2019-computer-vision-conference-cvc-
volume-1-kohei-arai/
https://textbookfull.com/product/computer-vision-metrics-
textbook-edition-scott-krig-auth/
Fundamentals of Computer Vision 1st Edition Wesley E.
Snyder
https://textbookfull.com/product/fundamentals-of-computer-
vision-1st-edition-wesley-e-snyder/
https://textbookfull.com/product/robotics-vision-and-control-
fundamental-algorithms-in-matlab-peter-corke/
https://textbookfull.com/product/probability-theory-and-examples-
fifth-edition-durrett/
https://textbookfull.com/product/computer-vision-technology-for-
food-quality-evaluation-second-edition-sun/
https://textbookfull.com/product/computer-vision-based-
agriculture-engineering-1st-edition-han-zhongzhi-author/
Computer Vision
This page intentionally left blank
Computer Vision
Principles, Algorithms,
Applications, Learning
Fifth Edition
E.R. Davies
Royal Holloway, University of London, United Kingdom
Academic Press is an imprint of Elsevier
125 London Wall, London EC2Y 5AS, United Kingdom
525 B Street, Suite 1800, San Diego, CA 92101-4495, United States
50 Hampshire Street, 5th Floor, Cambridge, MA 02139, United States
The Boulevard, Langford Lane, Kidlington, Oxford OX5 1GB, United Kingdom
No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical,
including photocopying, recording, or any information storage and retrieval system, without permission in writing
from the publisher. Details on how to seek permission, further information about the Publisher’s permissions policies
and our arrangements with organizations such as the Copyright Clearance Center and the Copyright Licensing
Agency, can be found at our website: www.elsevier.com/permissions.
This book and the individual contributions contained in it are protected under copyright by the Publisher (other than
as may be noted herein).
Notices
Knowledge and best practice in this field are constantly changing. As new research and experience broaden our
understanding, changes in research methods, professional practices, or medical treatment may become necessary.
Practitioners and researchers must always rely on their own experience and knowledge in evaluating and using any
information, methods, compounds, or experiments described herein. In using such information or methods they
should be mindful of their own safety and the safety of others, including parties for whom they have a professional
responsibility.
To the fullest extent of the law, neither the Publisher nor the authors, contributors, or editors, assume any liability
for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or
from any use or operation of any methods, products, instructions, or ideas contained in the material herein.
ISBN: 978-0-12-809284-2
vii
viii Contents
xxi
This page intentionally left blank
Foreword
It is an honor to write a foreword for Roy Davies’ new edition of Computer and
Machine Vision, now entitled Computer Vision: Principles, Algorithms,
Applications, Learning. This is one of the major books in Computer Vision and
not just for its longevity, having now reached its Fifth Edition. It is actually a
splendid achievement to reach this status and it reflects not only on the tenacity
and commitment of its author, but also on the achievements of the book itself.
Computer Vision has shown awesome progress in its short history. This is part
due to technology: computers are much faster and memory is now much cheaper
than they were in the early days when Roy started his research. There have been
many achievements and many developments. All of this can affect the evolution
of a textbook. There have been excellent textbooks in the past, which were neither
continued nor maintained. That has been avoided here as the textbook has contin-
ued to mature with the field and its many developments.
We can look forward to a future where automated computer vision systems
will make our lives easier while enriching them too. There are already many
applications of Computer Vision in the food industry and robotic cars that will be
with us very soon. Then there are continuing advancements in medical image
analysis, where Computer Vision techniques can be used to aid in diagnosis and
therapy by automated means. Even accessing a mobile phone is considerably
more convenient when using a fingerprint and access by face recognition con-
tinues to improve. These have all come about due to advancements in computers,
Computer Vision, and applied artificial intelligence.
Adherents of Computer Vision will know it to be an exciting field indeed. It
manages to cover many aspects of technology from human vision to machine
learning requiring electronic hardware, computer implementations, and a lot of
computer software. Roy continues to cover these in excellent detail.
I remember the First Edition when it was first published in 1990 with its
unique and pragmatic blend of theory, implementation, and algorithms. I am
pleased to see that the Fifth Edition maintains this unique approach, much appre-
ciated by students in previous editions who wanted an accessible introduction to
Computer Vision. It has certainly increased in size with age, and that is often the
way with books. It is most certainly the way with Computer Vision since many of
its researchers continue to improve, refine, and develop new techniques.
A major change here is the inclusion of Deep Learning. Indeed, this has been
a major change in the field of Computer Vision and Pattern Recognition. One
implication of the increase in computing power and the reduction of memory cost
is that techniques can become considerably more complex, and that complexity
lends itself to application in the analysis of “big data.” One cannot ignore the per-
formance of deep learning and convolutional neural networks: one only has to
peruse the program of top international conferences to perceive their revolution-
ary effect on research direction. Naturally, it is early days but it is good to have
xxiii
xxiv Foreword
Mark S. Nixon
University of Southampton, Southampton, United Kingdom
July 2017
Preface to the Fifth Edition
The first edition of this book came out in 1990, and was welcomed by many
researchers and practitioners. However, in the subsequent two decades the subject
moved on at a rapidly accelerating rate, and many topics that hardly deserved a
mention in the first edition had to be solidly incorporated into subsequent edi-
tions. For example, it seemed particularly important to bring in significant
amounts of new material on feature detection, mathematical morphology, texture
analysis, inspection, artificial neural networks, 3D vision, invariance, motion anal-
ysis, object tracking, and robust statistics. And in the fourth edition, cognizance
had to be taken of the widening range of applications of the subject: in particular,
two chapters had to be added on surveillance and in-vehicle vision systems. Since
then, the subject has not stood still. In fact, the past four or five years have seen
the onset of an explosive growth in research on deep neural networks, and the
practical achievements resulting from this have been little short of staggering. It
soon became abundantly clear that the fifth edition would have to reflect this radi-
cal departure—both in fundamental explanation and in practical coverage. Indeed,
it necessitated a new part in the book—Part 3, Machine Learning and Deep
Learning Networks—a heading which affirms that the new content reflects not
only “Deep Learning” (a huge enhancement over the older “Artificial Neural
Networks”) but also an approach to pattern recognition that is based on rigorous
probabilistic methodology.
All this is not achieved without presentation problems: for probabilistic
methodology can only be managed properly within a rather severe mathematical
environment. Too little maths, and the subject could be so watered down as to be
virtually content-free: too much maths, and many readers might not be able to
follow the explanations. Clearly, one should not protect readers from the (mathe-
matical) reality of the situation. Hence, Chapter 14 had to be written in such a
way as to demonstrate in full what type of methodology is involved, while provid-
ing paths that would take readers past some of the mathematical complexities—at
least, on first encounter. Once past the relatively taxing Chapter 14, Chapters 15
and 21 take the reader through two accounts consisting largely of case studies,
the former through a crucial development period (20122015) for deep learning
networks, and the latter through a similar period (20132016) during which deep
learning was targeted strongly at face detection and recognition, enabling remark-
able advances to be made. It should not go unnoticed that these additions have so
influenced the content of the book that the title had to be modified to reflect
them. Interestingly, the organization of the book was further modified by
collecting three applications chapters into the new Part 5, Putting Computer
Vision to Work.
It is worth remarking that, at this point in time, computer vision has attained a
level of maturity that has made it substantially more rigorous, reliable, generic,
and—in the light of the improved hardware facilities now available for its
xxv
xxvi Preface to the Fifth Edition
Roy Davies
Royal Holloway, University of London, United Kingdom
Preface to the First Edition
Over the past 30 years or so, machine vision has evolved into a mature subject
embracing many topics and applications: these range from automatic (robot)
assembly to automatic vehicle guidance, from automatic interpretation of docu-
ments to verification of signatures, and from analysis of remotely sensed images
to checking of fingerprints and human blood cells; currently, automated visual
inspection is undergoing very substantial growth, necessary improvements in
quality, safety, and cost-effectiveness being the stimulating factors. With so much
ongoing activity, it has become a difficult business for the professional to keep up
with the subject and with relevant methodologies: in particular, it is difficult for
them to distinguish accidental developments from genuine advances. It is the pur-
pose of this book to provide background in this area.
The book was shaped over a period of 1012 years, through material I have
given on undergraduate and postgraduate courses at London University, and con-
tributions to various industrial courses and seminars. At the same time, my own
investigations coupled with experience gained while supervising PhD and post-
doctoral researchers helped to form the state of mind and knowledge that is now
set out here. Certainly it is true to say that if I had had this book 8, 6, 4, or even
2 years ago, it would have been of inestimable value to myself for solving practi-
cal problems in machine vision. It is therefore my hope that it will now be of use
to others in the same way. Of course, it has tended to follow an emphasis that is
my own—and in particular one view of one path towards solving automated
visual inspection and other problems associated with the application of vision in
industry. At the same time, although there is a specialism here, great care has
been taken to bring out general principles—including many applying throughout
the field of image analysis. The reader will note the universality of topics such as
noise suppression, edge detection, principles of illumination, feature recognition,
Bayes’ theory, and (nowadays) Hough transforms. However, the generalities lie
deeper than this. The book has aimed to make some general observations and
messages about the limitations, constraints, and tradeoffs to which vision algo-
rithms are subject. Thus there are themes about the effects of noise, occlusion,
distortion, and the need for built-in forms of robustness (as distinct from less suc-
cessful ad hoc varieties and those added on as an afterthought); there are also
themes about accuracy, systematic design, and the matching of algorithms and
architectures. Finally, there are the problems of setting up lighting schemes which
must be addressed in complete systems, yet which receive scant attention in most
books on image processing and analysis. These remarks will indicate that the text
is intended to be read at various levels—a factor that should make it of more last-
ing value than might initially be supposed from a quick perusal of the contents.
xxvii
xxviii Preface to the First Edition
Language: Spanish