You are on page 1of 14

Home Search Collections Journals About Contact us My IOPscience

Seismic data interpretation using the Hough transform and principal component analysis

This article has been downloaded from IOPscience. Please scroll down to see the full text article.

2011 J. Geophys. Eng. 8 61

(http://iopscience.iop.org/1742-2140/8/1/008)

View the table of contents for this issue, or go to the journal homepage for more

Download details:
IP Address: 192.100.180.20
The article was downloaded on 09/12/2010 at 22:31

Please note that terms and conditions apply.


IOP PUBLISHING JOURNAL OF GEOPHYSICS AND ENGINEERING
J. Geophys. Eng. 8 (2011) 61–73 doi:10.1088/1742-2132/8/1/008

Seismic data interpretation using the


Hough transform and principal
component analysis
M G Orozco-del-Castillo1 , C Ortiz-Alemán1 , R Martin2 ,
R Ávila-Carrera1 and A Rodrı́guez-Castellanos1
1
Instituto Mexicano del Petróleo, Eje Central Lázaro Cárdenas 152, México, DF 07730, México
2
Université de Pau et des Pays de l’Adour, CNRS & INRIA Magique-3D, Laboratoire de Modélisation
et d’Imagerie en Géosciences UMR 5212, Avenue de l’Université, 64013 Pau Cedex, France
E-mail: rcarrer@imp.mx, jcortiz@imp.mx, mgabriel@imp.mx, arcastel@imp.mx and
roland.martin@univ-pau.fr

Received 19 February 2010


Accepted for publication 9 November 2010
Published 9 December 2010
Online at stacks.iop.org/JGE/8/61

Abstract
In this work two novel image processing techniques are applied to detect and delineate
complex salt bodies from seismic exploration profiles: Hough transform and principal
component analysis (PCA). It is well recognized by the geophysical community that the lack
of resolution and poor structural identification in seismic data recorded at sub-salt plays
represent severe technical and economical problems. Under such circumstances, seismic
interpretation based only on the human-eye is inaccurate. Additionally, petroleum field
development decisions and production planning depend on good-quality seismic images that
generally are not feasible in salt tectonics areas. In spite of this, morphological erosion, region
growing and, especially, a generalization of the Hough transform (closely related to the Radon
transform) are applied to build parabolic shapes that are useful in the idealization and
recognition of salt domes from 2D seismic profiles. In a similar way, PCA is also used to
identify shapes associated with complex salt bodies in seismic profiles extracted from 3D
seismic data. To show the validity of the new set of seismic results, comparisons between both
image processing techniques are exhibited. It is remarkable that the main contribution of this
work is oriented in providing the seismic interpreters with new semi-automatic computational
tools. The novel image processing approaches presented here may be helpful in the
identification of diapirs and other complex geological features from seismic images.
Conceivably, in the near future, a new branch of seismic attributes could be recognized by
geoscientists and engineers based on the encouraging results reported here.

Keywords: diapirs, Hough transform, image processing, feature extraction, salt bodies, seismic
exploration, profiles, pattern recognition

1. Introduction in seismic volumes (3D) without the full assistance of an


interpreter. Several advances in DIP have been made in
The analysis of seismic data is important for understanding the the computer science research area and in its application
subsurface of the earth, but the process is usually not a simple to other areas, particularly medical images (Udupa 1999)
task. Because of the growing volume and resolution of seismic and detection, recognition and tracking of human features
data, digital image processing (DIP) is becoming an important (e.g. faces, heads, arms) (Turk and Pentland 1991). These
component of this process, aiming to detect geological features advances have not been fully applied to geophysical science,

1742-2132/11/010061+13$33.00 © 2011 Nanjing Geophysical Research Institute Printed in the UK 61


M G Orozco-del-Castillo et al

and we believe that automated and semi-automated seismic facies in data using a back-propagation neural network (Yang
interpretations are feasible and very useful. and Huang 1991).
Traditionally seismic analysis has been done by human- Another line of investigation corresponds to fault surfaces.
eyed empirical interpretation of just some of the processed Where it is not required to extract the actual fault surfaces,
2D slices of the whole 3D seismic volume. This usually some methods have been employed to enhance a fault structure
implies loss of information. Due to the increasing volume (Weickert 1999, Bakker et al 1999, Fehmers and Höcker 2003).
and resolution of seismic data, along with the increasing Despite the fact that filtering methods have proven useful to the
computational power, direct processing and semi-automatic interpreters, there is still a significant human experience-based
interpretation of 3D seismic data are becoming more practical interpretation to be done after its application. Hence, there
(Jeong et al 2006). Seismic interpretation can be broadly has been a trend to automatically or semi-automatically detect
subdivided into two components (Cohen and Coifman 2002): faults from seismic cubes in recent years. For example, Cohen
structural, which investigates the nature and geometry of the et al (2006) proposed a method for detecting and extracting
subsurface structures, and stratigraphic, which investigates the fault surfaces by creating and processing a volume of estimates
subsurface stratigraphy. A first step in seismic interpretation from seismic data which represents the likelihood that a given
usually consists of image segmentation, and it relies heavily on point lays on a fault surface. Jeong et al (2006) developed
the human visualization of sophisticated and complex images. a volumetric, seismic fault detection system aimed for an
Seismic interpretation also involves feature discrimination and interactive nonlinear 3D processing. This system was also
visualization, both of which are fundamental to exploratory combined with a graphics processing unit, showing the benefits
data analysis in many other areas of science. over a CPU implementation.
An early and significant contribution was the coherence Oil and gas prospecting has found a major challenge
cube, proposed by Bahorich and Farmer (1995). Their work in regions with complex geological settings, like areas with
has served other research areas related to the coherence salt tectonics. Broad areas exist in the world where seismic
concept, like the robust coherence estimation algorithm based depth imaging is a difficult task due to the progressive lack
on multiple traces with locally adapted similarity (semblance) of resolution beneath the presence of salt bodies. As oil
exploration targets may be located close or below salt bodies,
measures (Marfurt et al 1998). Another variant of the
in the underlying geologic structure, there is a growing interest
coherence cube based on eigenanalysis of the covariance
in computational tools that can help seismic interpreters
matrix was proposed by Gersztenkorn and Marfurt (1999).
to estimate geometry, position and depth distribution of
A practical survey of several variants of the coherence cube
diapirs from seismic profiles and volume data. Under
algorithm can be found in Chopra (2002). Some other
favourable circumstances, traditional seismic processing and
approaches include a more efficient discontinuity measure
interpretation can provide an appropriate location of the top of
computation method using a normalized trace of a small
salt bodies. Nevertheless, estimation of the base of salt domes
correlation matrix (Cohen and Coifman 2002). Applications
(and geometry distribution of salt at depth in general) is often a
with high-order statistics and supertrace techniques for more
difficult task. In the application of standard seismic processing
accurate coherence estimation are presented by Lu et al (2005). techniques it is common to find complex wave diffraction
Some efforts have been made to automatically detect and patterns giving rise to a significant lack of illumination near
classify geological features, such as seismic facies. Seismic and below salt bodies.
facies are groups of seismic reflections whose parameters (such A traditional and successful approach to seismic data
as amplitude, continuity, reflection geometry and frequency) analysis has been the Radon transform (Moon et al 1986,
differ from those of adjacent groups (West et al 2002). Foster and Mosher 1992, Trad et al 2003). A very similar
Seismic facies analysis involves two key steps: (a) their approach in the field of DIP has been the Hough transform
classification and (b) their interpretation to produce a geologic (Hough 1962) which, just like the Radon transform, is a
and depositional interpretation. West et al (2002) presented mapping from image space to a parameter space. The Hough
an application of textural analysis to 3D seismic volumes, transform has been applied for error analysis (Aguado et al
combining image textural analysis with a neural network 2000, Jacquemin and Mallet 2005, Montana 1992, Niblack
classification to quantitatively map seismic facies in 3D data. and Petkovic 1988, Rosenhahn et al 2001, Shapiro 1975,
A similar approach based on competitive neural networks for 1978a, 1978b), reduction of the computational complexity
the classification and identification of reservoir facies from (Kiryati and Bruckstein 1992), extensions to other shapes
seismic data was presented by Saggaf et al (2003). Some (Aguado et al 1995, Ballard 1981, Van Ginkel 2002), choice
other automatic techniques for classification of seismic facies of the appropriate parameterization (Duda and Hart 1972, Van
include identification of the boundaries of rapidly varying sand Veen and Groen 1981, Westin and Knutsson 1992), tracking
facies using a Bayesian linear decision function (Matlock et al and pose estimation (Princen et al 1994), etc. The Hough
1985); determination of the sand/shale ratio of various zones transform has also been successfully applied to seismology,
in the reservoir using discriminant factor analysis (Mathieu using a variant of this methodology based on a cascade of
and Rice 1969); identification of seismic facies using both two Hough transforms and a specific backward transformation
principal component analysis (PCA) and discriminant factor to automatically extract faults from a 3D seismic cube
analysis (Dumay and Fournier 1988); segmentation of a (Jacquemin and Mallet 2005).
seismic section based on its texture through a knowledge-based The detection of geologic patterns on seismic profiles
expert system (Simaan 1991) and detection of anomalous can be thought of as a complex problem, i.e. a problem

62
Seismic data interpretation using Hough transform and PCA

that is best suited for a human than a machine, at least satisfy yi = hxi + k for different values of h and k. However,
in the traditional use of computing power. Interpreters of writing this equation as k = yi − hxi and considering the hk
seismic data are usually able to detect patterns successfully plane (also known as the parameter space), the equation for a
despite changes in the images due to noise and variations unique line for a determined pair (xi , yi ) is obtained. Besides,
from one body to another and even from one seismic profile a second point (xj , yj ) also has a line in the parameter space
to another. A main problem for the interpretation of seismic associated with it, and this one intersects the line associated
profiles is being able to automate or semi-automate the abilities with (xi , yi ) in (h , k  ), where h is the slope and k is the y-
of an interpreter using a computational method. Much of intercept of the line that contains (xi , yi ) and (xj , yj ) in the xy
the work in automated pattern recognition ignores the issue plane. In fact, all of the points contained in this line have lines
of which aspects of the pattern stimulus are important for in the parameter space that pass through (h , k  ).
identification. For example, one of the most important goals of The main attraction for the use of the Hough transform
seismic stratigraphy is to recognize and analyse seismic facies comes from the subdivision of the parameter space in what
regarding the geologic environment. Two main problems are denominated accumulator cells, where (hmax , hmin ) and
become evident: to determine which seismic parameters are (kmax , kmin ) are the expected range values for the slope and
relevant for characterizing the facies and to be sure that there is the y-intercept, respectively. The cells with coordinates
a link between the seismic parameters and the geologic facies (i, j), with an accumulator value A(i, j), correspond to the
we are investigating (Dumay and Fournier 1988). Seismic square associated with the coordinates in the parameter space
pattern recognition and detection of geological structures are (hi , kj ). Initially, these cells are set to zero. Then, for each
also very high level operations for which the classification point (xp , yp ) of the image plane, the parameter h is fixed to
according to detailed geometry or specific information may each and every one of the allowed values of subdivision for
be very difficult, inefficient, and either way, probably useless. the x axis, and k is obtained from solving the corresponding
When trying to detect seismic patterns, the interpreter does equation k = yk − hxk . After this procedure, a value of M
not focus his or her attention on the geometric or quantitative in A(i, j) corresponds to M points of the xy plane located in
characteristics of them, but rather on their global features, the line y = hi x + kj . The precision of the collinearity of
which may or may not be directly related to our intuitive notion these points is determined by the number of subdivisions of
of features. In other words, the interpreter is able to detect the the hk plane. When subdividing the x axis in P increments,
principal components of the patterns, but he is probably not for each point (xp , yp ), the P values of k correspond to the P
able to tell which those components are. Interpretation of possible values of h. With n image points, this method implies
seismic data is a very high level task. By these means, seismic nP operations. Therefore, this procedure is linear in n, and nP
pattern recognition is very similar to other pattern recognition operations are easily calculated using a standard computer.
tasks, like speech or face recognition. Face recognition is also The Hough transform was originally defined to detect
a very high level task, where humans perform considerably straight lines in black and white images. As it is trivial to
better than computers, especially for quantitative and statistical generalize the Hough transform to other shapes and grey-
methodologies which attempt to detect individual features, and value images, we describe it in its extended form. We
define a face model by the position, size and relationships set up an N-dimensional accumulator array; each dimension
among these features. corresponding to one of the parameters of the shape looked for.
In this work we study the feasibility of applying two Each element of this array contains the number of votes for the
distinct pattern recognition approaches as auxiliary tools for presence of a shape with the parameters corresponding to that
seismic methods to improve detection of salt bodies and element. Of course, if a shape with certain fixed parameters
determination of their complex geometry: a Hough transform is present in the image, all of the pixels that are part of it will
mathematical morphology approach and a PCA approach. ‘vote’ for it, yielding a large peak in the accumulator array. For
These two methodologies, while not entirely automatic, clearly this particular example, we generalize the Hough transform
have the potential to reduce the workload of the seismic to detect parabolas, which in comparison with lines consist of
interpreter by asking him to define how the salt bodies look, three different parameters. The use of three parameters instead
by either establishing some initial parameters or manually of two implies a dimensional increase in the parameter space
selecting some training images, instead of looking for the salt and consequently in the accumulator array, now being 3D and
bodies manually throughout the whole seismic cube. no longer 2D as in the case of line detection.

2. The Hough transform mathematical morphology 2.2. Mathematical morphology


approach
The field of mathematical morphology provides a wide range
of operators to image processing, all based around a few
2.1. Overview of the Hough transform methodology
simple mathematical concepts from set theory. The operators
Hough (1962) originally proposed his methodology, currently are particularly useful for the analysis of binary images and
referred to as Hough transform, to detect straight lines with the common uses include edge detection, noise removal, image
intention of finding bubble tracks. Consider a point (xi , yi ) enhancement and image segmentation.
and the explicit general equation of a line, yi = hxi + k. An The two most basic operations in mathematical
infinite number of lines pass through (xi , yi ), and all of them morphology are erosion and dilation. Both of these operators

63
M G Orozco-del-Castillo et al

(a)

(b)

Figure 1. Original seismic profile (a) and the binarized segment of the profile used in this section, showing three distinct salt bodies (b).

take two pieces of data as inputs: an image to be eroded or 2.3. Detection of salt bodies using the Hough transform
dilated, and a structuring element (also known as a kernel).
The two pieces of input data are each treated as representing In the segment of the seismic profile shown in figure 1, several
salt bodies with parabolic-like shapes can be observed. The
sets of coordinates in a way that is slightly different for binary
aim of this work was to develop a way to extract them from the
and greyscale images.
rest of the profile by featuring them as parabolas with defined
For a binary image, white pixels are normally
parameters. The proposed methodology consists of digitally
used to represent foreground regions, while black pixels
processing the original seismic profile using mathematical
denote background (note that in some implementations this
morphology, such that the final product is appropriate to
convention is reversed, as is the case in this work for visual
analysis through the Hough transform.
purposes). Then the set of coordinates corresponding to
that image is simply the set of two-dimensional Euclidean
coordinates of all the foreground pixels in the image, with a 2.3.1. DIP of the original seismic profile. The values in
coordinate’s origin normally taken in one of the corners so that this profile (figure 1(b)) were initially modified so it could
all coordinates have positive elements, i.e. each element of a be seen as a greyscale image of 8 bits (0–255), and later
set belongs to N 2 . binarized (thresholded to have values of 0 and 1) because the
Let A and B be two sets of N 2 , with components techniques used further on for segmentation and recognition
a = (a1 , a2 ) and b = (b1 , b2 ), respectively. The translation of are designed for binary images. For visual purposes, the black
A through x = (x1 , x2 ), represented as (A)x is defined as pixels correspond to the value 1, while the white pixels to 0.
The standard generalized Hough transform can theoretically
(A)x = {c|c = a + x, a ∈ A}. (1) be applied to this image, but the quantity of information in
the image would considerably affect the time and the results
Let A denote a binary image and B denote a structuring element of the analysis, mostly because a great part of the information
(usually a square mask of values equal to 1; in this paper a does not represent the objects we are trying to extract. It was
3 × 3 mask was used). Then the erosion of A by B is given by therefore needed to apply an erosion operation that would
A ⊕ B = {x|(B)x ⊆ A}. (2) diminish the quantity of non-useful information, i.e. that
information not representing the salt bodies. When taking
So, the erosion of A by B is the set of all the points x so into account that the non-useful areas are not so large in terms
that B, translated through x, is contained in A. This means of size (quantity of pixels), then the erosion operation results
that, only when B is completely contained inside A, the pixel are even more appropriate. In other words, the segments
values are retained, else they get deleted, i.e. they get eroded. of useful information are formed by large accumulations of
Although equation (2) is not the only definition for erosion, it pixels, while the others are mostly formed by narrow lines or
is normally the most adequate in practical implementations of small accumulations. Accounting for this, one should erode
morphology. the image until the non-useful information has disappeared;

64
Seismic data interpretation using Hough transform and PCA

(a)

(b)

(c)

Figure 2. (a) The binarized seismic profile image after two binary erosion operations, (b) the result of the region growing on it, and (c) the
final results of the region growing applied in (b) with the centroids calculated from (a).

we experimentally determined that nine erosion operations of the remaining black areas after nine erosion operations
are needed to achieve this (figure 2(a)). The inconvenience is (figure 2(a)).
that, by its nature, the erosion operation also erases some of Applying region growing with these seeds to the original
the relevant information. To avoid this issue, we decided to image recovers both useful and, due to the connectivity of the
implement and apply a region growing algorithm to recover the original regions, non-useful information. To solve this issue,
lost information due to the erosion operations. Region growing we need to break the connectivity of the useful areas from the
non-useful ones in the original image. As we did before, this
is a commonly used procedure that groups pixels or sub-
can be achieved through erosion. At this stage, the erosion
regions to greater ones. The simplicity of this method resides
operation is not intended to remove unwanted areas in the
in the pixel aggregation that starts with a set of generating image, but rather to break the connectivity of the useful from
points (seeds) from which the regions ‘grow’ by adding to each the non-useful areas, so that the region growing algorithm can
of these points the neighbour points that have similar properties be successfully applied. We experimentally determined that
(e.g. grey level, texture, colour) (Gonzalez and Woods 1992). two erosion operations are sufficient to achieve this effect, as
Since we are dealing with binary images, the region growing shown in figure 2(b). It is in this image in which we applied
algorithm we used is the simplest of its possible variations; it region growing with the centroids calculated from figure 2(a),
simply looks for all of the black pixels that are connected and the final result is shown in figure 2(c). Notice how a
to the seed. The seeds were calculated as the centroids large portion of the non-useful information from the original

65
M G Orozco-del-Castillo et al

Table 1. Parameters of the three parabolas found using the Hough


transform.
a h k

1 0.0025 170 130


2 0.0025 534 174
3 0.0047 1260 287

It is now necessary to vary the parameters a and h, from


amin to amax and hmin to hmax , respectively. Since the point
(h, k) represents the vertex of the parabola in the image, it is
obvious that hmin = 1 and hmax is the number of columns.
The values of amin and amax that control the aperture of the
parabola were determined experimentally (according to the
possible apertures of the pseudo-parabolas representing the
salt bodies) as amin = 0.002 and amax = 0.03, and the
intermediate values were distributed logarithmically, unlike h
and k which were distributed linearly. With these values, and
the possible values for k (from 1 to the number of rows in the
image), a three-dimensional parameter space is created with
accumulator cells where all possible parabolas from the black
Figure 3. Workflow of the Hough transform approach. pixels in the image shown in figure 2(c) are stored. After the
transformation process, a cube with dimensions corresponding
image has been eliminated, leaving the three areas of the image to the rows, columns and the number of intervals for a was
corresponding to the three salt bodies that we intend to extract. obtained. This cube contains in each cell the amount of
There remains a group of pixels in the lower part of the image ‘votes’ that the parabola received, with parameters given by
which do not correspond to a salt body. However, the curve the location of the cell in the space. To extract those cells
recognition is not affected by means of the Hough transform with the greater amount of votes without extracting nearby
nature. The process is illustrated in figure 3. cells that represent different parabolas, but the same salt body,
a min–max clustering was applied. The detection of the
‘peaks’ in this 3D space is not a very complicated task since
2.3.2. The Hough transform analysis for the detection of the the values are clearly above those in cells not representing
parabolic-like top of salt bodies. We originally described parabolas; however, it could be possible that the extraction of
the Hough transform methodology in its original and most the parabolas from the parameter space is more complicated,
commonly used form, applied for the detection of lines (whose so some other kind of search could be needed. Montana (1992)
equation is given by yi = hxi + k). To achieve the recognition proposed a genetic search of a generalized Hough transform
of parabolas, the modification of the usual form of the Hough space for the detection and tracking of a class of sonar signals,
transform is needed. The general equation of a parabola is which yielded very narrow peaks not too far above the random
given by background variations. He concluded that the genetic search
1 required far fewer evaluations to outperform an exhaustive
y−k = (x − h)2 , (3) search algorithm.
4m
The result of the clustering yielded the three parabolas
where the vertex is located at (h, k), focused at (h, k + m) and
with the greatest number of votes. The parameters of these
directrix y = k − m, with m being the distance from the vertex
parabolas are shown in table 1. These parabolas were overlaid
to the focus. For simplicity purposes, it is possible to rewrite
in the original images as shown in figure 4.
equation (3) as
The application of this methodology to one seismic
y − k = a(x − h)2 , (4) profile yields the parameters for three different parabolas that
resemble the top of their corresponding salt body. Since the
where a = 1/4m.
salt bodies show similar characteristics across the different
It is obvious that the parameter space (or Hough space)
profiles of the seismic cube, the successive application of the
that represents parabolas is a three-dimensional space formed aforementioned procedure would result in a set of parameters
by the parameters (a, h, k), instead of the two-dimensional for each salt body.
space representing lines, formed by the parameters (h, k). This
implies that, to accomplish the spatial transformation, we need
to clear one of the variables and change the values of the other 3. The PCA approach
two from a minimum to a maximum value for each one. For
We consider that the automatic detection of seismic patterns
example, clearing k from equation (4) we obtain
should emulate how a human interpreter actually does it,
k = y − a(x − h)2 . (5) i.e. a qualitative approach instead of a quantitative one. By

66
Seismic data interpretation using Hough transform and PCA

these means, seismic pattern detection is very similar to other body images, which we call ‘salt-body-space’. Each vector is
high-level pattern detection tasks like face detection. One of of length N 2 and is a linear combination of the original salt
the most successful approaches to automatic qualitative face bodies’ images. Because these vectors are the eigenvectors of
detection was proposed by Turk and Pentland (1991). Using the covariance matrix corresponding to the original salt bodies’
a PCA approach, their near-real-time computer system was images, and because they are salt-body-like in appearance, we
able to locate and track a subject’s head, and then recognize refer to them as eigenbodies.
the person by comparing characteristics of the face to those of Let the training set of salt bodies’ images be
known individuals. This is why we believe that an information B1 , B2 , B3 , . . . , BM . The average body of the set is defined by
theory approach like PCA of coding and decoding seismic
1 
M
patterns, in this particular case salt bodies’ images, may give an = Bn . (6)
insight into the information content of them, emphasizing their M n=1
significant local and global features, rather than quantitative Each body differs from the average by the vector
ones.
i = Bi − . (7)
Mathematically, we wish to find the principal components
of the distribution of the salt bodies, or the eigenvectors This set of very large vectors is then subject to principal
of the covariance matrix of a training set of salt bodies’ component analysis. We need to calculate the eigenvalues
images, treating an image as a point (or vector) in a very and eigenvectors of the covariance matrix
high dimensional space. The eigenvectors should then be
1 
M
ordered, with each one accounting for a different amount of C= n Tn = AAT , (8)
the variation among the salt bodies’ images according to their M n=1
respective eigenvalues. These can be thought of as a set of where matrix A = [1 2 ... M ]. Matrix C, however, is N 2
features that together characterize the variation between salt × N 2 , and determining the N 2 eigenvectors and eigenvalues
bodies’ images. Each individual body in the training set (minus is an intractable task for typical image sizes. The problem is
the average salt body) can be represented exactly in terms of a solved by Turk and Pentland by calculating the eigenvalues
linear combination of the eigenvectors. Each body can also be and eigenvectors of the M × M matrix given by L = AT A,
approximated using only the ‘best’ eigenvectors, i.e. those with instead of C = AAT , where the first (most significant)
the largest associated eigenvalues, which therefore account for eigenvectors and eigenvalues, are indeed the same. This
the most variance within the set of salt body images. The alternative methodology enables the near-real-time calculation
best M eigenvectors span an M-dimensional subspace of all of the eigenvectors and eigenvalues. These vectors v determine
possible images (salt-body-space). The construction of this linear combinations of the M training set images to form the
PCA approach system consists of the following. eigenbodies ul ,
(1) Acquiring an initial set of salt bodies’ images (the training 
M
set). ul = vlk k l = 1, . . . , M. (9)
k=1
(2) Calculating the eigenvectors from the covariance matrix
The eigenbody images calculated from the eigenvectors of L
of the training set, and keeping only the M images that
span a basis set with which to describe salt bodies’ images. In
correspond to the M highest eigenvalues. These M images
this framework, identification becomes a pattern recognition
define a subspace of the salt-body-space.
task. The eigenbodies span an M -dimensional subspace of
Once the construction has been achieved, the following steps the original N 2 image space. The M significant eigenvectors
are then used to detect new salt bodies. of the L matrix are chosen as those with the largest associated
eigenvalues.
(1) Tracking of a seismic profile capturing input images.
A new salt body image B is transformed into its
(2) Projecting the input image onto the salt-body-space.
eigenbodies’ components (projected into ‘salt-body-space’)
(3) Determining if the image is a seismic body by measuring
by a simple operation,
the distance of the original image (as a vector) to the
salt-body-space. ωk = uTk (B − ), (10)

for k = 1, . . . , M . The weights form a vector
3.1. Theoretical background [ω1 , ω2 , . . . , ωM  ] that describes the contribution of each
eigenbody in representing the input salt body image, treating
Let a salt body image I (x, y) be a two-dimensional N × N the eigenbodies as a basis set for salt body images. This
array of intensity values. An image may also be considered as process is equivalent to projecting the original salt body image
a vector of dimension N 2 . A set of images maps to a collection onto the low-dimensional salt-body-space. The distance ε
of points in this space. Images presenting the same pattern, in between the new input image and the salt-body-space is
our case salt bodies, will not be randomly distributed in this simply the squared distance (Euclidean norm) between the
space since they have a similar overall configuration, and can mean-adjusted input image and the salt-body-space f =
be described by a lower dimensional subspace. The main idea M 
of the principal component analysis is to find the vectors that i=1 ωi ui :

best account for the distribution of salt body images within the ε = (B − ) − f . (11)
entire image space. These vectors define the subspace of salt

67
M G Orozco-del-Castillo et al

Figure 4. Original image with overlaid resulting parabolas from table 1.

Figure 5. Set of the 15 initial training images obtained manually from several profiles of the seismic cube after applying the Gaussian
function in equation (12).

3.2. Application to seismic profiles where x and y correspond to the values of the rows and columns
of the image respectively, ranging between 1 and M (230), and
The purpose is to detect the salt bodies present in a seismic σ is the standard deviation or aperture of the Gaussian, which
cube. A profile of this cube is shown in figure 1(a). A was experimentally determined as 2.5. Greater values for
closer look at the seismic profile makes three salt bodies quite σ would tend to leave the image intact, while smaller ones
evident (as shown in figure 1(b)). The detection of salt bodies would drastically obscure the entire salt bodies, instead of just
is sensitive to the background, i.e. the system is not able to the corners of the images which do not account to the bodies.
determine the limit of a salt body, so it is also taken into Therefore, the multiplied image would correspond to
consideration by the detection and recognition processes. To −(i 2 +j 2 )
deal with this problem without having to solve other difficult Iij e σ 2
Gij = , (13)
image-processing problems such as the segmentation of the 2π σ 2
salt body (which would also mean having to detect the body to where i and j correspond to the values of the rows and columns
segment it, and having it segmented for its detection), we respectively of the image, and I is the original image. We
performed a pointwise product of the input images (both assembled a set of 15 training images obtained manually from
the training images and the new input images) by a two- several profiles of the seismic cube. Figure 5 shows these after
dimensional Gaussian window centred on the salt body, thus applying the Gaussian function to every one of them. While
diminishing the background and accentuating the middle of these images were obtained manually by selecting areas of
the body. The two-dimensional Gaussian is given by different profiles showing salt bodies, the results from the
−(x 2 +y 2 )
Hough transform methodology could be used as inputs for this
e σ2 procedure to improve the automation of the detection of the
h(x, y) = (12)
2π σ 2 salt bodies throughout the seismic cube.

68
Seismic data interpretation using Hough transform and PCA

Figure 6. The 15 ‘best’ eigenbodies (ordered), corresponding to the 15 eigenvectors with the largest eigenvalues associated with them, as
calculated in equation (9).

(a) (b)

(c) (d)

Figure 7. Four different images and their projection on the salt-body-space. The left images on (a) and (c) show two images corresponding
to salt bodies, and the left images on (b) and (d) correspond to other images in the same seismic cube. Notice how in (a) and (c) the
right-hand-side images are similar, while those on the left-hand-side, in (b) and (d), considerably differ.

The first step is to calculate the mean image of the the projected image differs considerably from the original,
set of images, what we can call the average salt body. as shown in figure 7. The images in figures 7(a) and (c)
Once we have the mean image, according to the procedure correspond to salt bodies and their projection on the salt-body-
described before, we are able to obtain the 15 (due to the space, meanwhile the others in figures 7(b) and (d) correspond
number of training images) greatest eigenvalues and their to the areas of the seismic profile that do not contain these
associated (most representative) eigenvectors of the covariance bodies, and their respective projections.
matrix given by equation (9). These vectors determine linear
Since not all of the eigenvectors, and consequently the
combinations of the 15 training set salt bodies’ images to form
the eigenbodies shown in figure 6. Notice how the images in eigenbodies, have the same contribution in representing the
the first row represent the general form of the bodies, while input body image, it is advisable to take into consideration
those in the last row represent their details. only M of them because of processing-time issues, where M
These eigenbodies are used to project an image into the is less or equal to the number M of training images. The effect
salt-body-space by the operation described in equation (10). of M to an input image is shown in figure 8. The first image
According to how an image is projected into this space, we can is the original image; the following four are projections of this
know if the image corresponds to an element of the space, i.e. image on the salt-body-space using values of 1, 5, 10 and 15,
represents a salt body. When a salt body image is projected respectively for M . It can be seen that the projections are
into this space, the result is a very similar image, but if not adequate and fairly similar in all cases; therefore, the choice

69
M G Orozco-del-Castillo et al

Figure 8. Different projections of the same image. The image on the left shows the original image, while the four images on the right
correspond to projections of the original image onto the salt-body-space using values of 1, 5, 10 and 15 for M , i.e. using 1, 5, 10 and 15
eigenvectors, respectively.

(a)

(b)

(c)

Figure 9. Results of the image tracking. The seismic profile on which the methodology was applied is shown in (a). The results of image
tracking are shown in (b) and (c), for values of 2 and 15 respectively for M . Notice how in (c) the bright spots corresponding to salt bodies
are more evident and there appear fewer bright areas than in (b).

of M is not critical in terms of performance, but is important for the training of the system (figure 9(a)), but similar enough
in terms of processing time. for the salt bodies to be evident in it. The results are shown in
figures 9(b) and (c); the first image shows the seismic profile,
while the other two show the results using values of 2 and
3.3. Image tracking and parameter determination
15 for M , respectively. Bright areas correspond to areas
The procedure explained so far allows recognizing whether an where the appearance of salt bodies is more probable, i.e.
image is a salt body or not, but in order to semi-automate areas where the value of ε as calculated in equation (13) are
the recognition in seismic profiles/cubes it is needed to lower. The image using M = 2 shows a greater number of
‘show’ the system every possible image in them. There is bright areas overall than the one for M = 15 but, despite
an obvious tradeoff between precision and time, but, not being these differences, they both show three very distinct bright
the main goal of this work, we decided to track a complete spots, each one corresponding to the salt bodies present in the
seismic profile pixel by pixel, capturing all of the possible seismic profile.
images present in it and measuring the distance (according to The final results for M = 3 and a pixel-by-pixel tracking of
equation (11)) of its projection onto the salt-body-space the seismic profile are shown in figure 10(a). As stated before,
(equation (10)) to the salt-body-space itself. We do this in the bright areas correspond to areas where there is a greater
a horizontally rotated seismic profile, distinct from those used possibility of finding a salt body. Despite several of these

70
Seismic data interpretation using Hough transform and PCA

(a)

(b)

Figure 10. Image tracking results as an image and as a 2D function. The image in (a) corresponds to the grey-level image (0–255) obtained
by a pixel-by-pixel tracking using M = 3. The image in (b) shows the same results, but as a 2D function. Brighter spots in (a) correspond to
greater values in (b).

areas, there clearly appear three bright spots corresponding to algorithms for the location of peaks and valleys. Other
the three salt bodies in the seismic profile. The results are more approaches for peak detection correspond to data mining, such
obvious if this image is plotted as a two-dimensional function, as hierarchical clustering, partitioning relocation clustering,
as shown in figure 10(b). In this new image, the three bright and density-based partitioning (Berkhin 2002). The choice
spots, corresponding to the three salt bodies, appear as peaks of which peak detection technique should be used in the
in a relatively smooth image. The determination of how strong automation of the salt body detection process throughout the
or how great the peaks should be to represent a salt body and whole seismic cube will depend on the characteristics of the
how to find them in a dataset is a pattern recognition technique data obtained from the application of the PCA methodology,
that will depend on the task at hand. In this case, a simple and is out of the scope of this work.
thresholding of the image successfully isolates the location of
the peaks from the rest of the image; for other cases, where 3.4. Extraction of salt bodies
the contrast of the peaks corresponding to the location of salt For an appropriate interpretation of seismic profiles, it is
bodies to the rest of the image could not be as obvious, an helpful to display the recognized patterns in three dimensions.
image segmentation or clustering technique should be used. In the previous sections we introduced a novel approach to
This also holds true for the future automation of the salt body salt dome identification by using the Hough transform and
detection process in a seismic cube. This task of peak detection PCA. By using these approaches we are able to identify the
is very similar to the detection of the cell with the greater approximate location of salt bodies inside a seismic profile.
amount of votes in the Hough parameter space, which was This location is used to infer coordinates of points inside the
addressed earlier on. Peak detection in data is a relevant matter salt bodies that will be used as seeds for 3D region growing in a
of research, where several techniques have been proposed and seismic cube. Once the salt bodies are identified in the seismic
proved useful. Some approaches use shape information of cube and in order to obtain always closed forms for region
the histogram of the data to achieve multilevel thresholding growing in 3D, we used conventional seismic interpretation
(Prewitt and Mendelsohn 1966, Weszka et al 1974, Otsu 1979). in several profiles. Results from the application of region
Kanungo et al (2006) proposed a segmentation technique growing to the extraction of the estimated salt bodies are
based on both thresholding the data and applying genetic depicted in figure 11.

71
M G Orozco-del-Castillo et al

paper. Special thanks are given to K Marfurt for his fruitful


discussions on feature extraction and pattern recognition
methods applied to oil exploration. This contribution was
supported by project IMP/D.000475, D.00468, SENER-
Conacyt 128376. We also thank the French CNRS for the
financial support by grant 94154.

References
Aguado A S, Montiel M E and Nixon M S 1995 Ellipse detection
via gradient direction in the Hough transform Proc. 5th Int.
Conf. on Image Processing and its Applications (Edinburgh,
UK) pp 375–8
Aguado A S, Montiel M E and Nixon M S 2000 Bias error analysis
of the generalized Hough transform J. Math. Imaging
Vis. 12 25–42
Bahorich M and Farmer S 1995 3D seismic discontinuity for faults
and stratigraphics features: the coherence cube Leading
Edge 14 1053–8
Bakker P, Van Vliet L J and Verbeek P W 1999 Edge preserving
Figure 11. Three-dimensional geometry of the recognized salt orientation adaptive filtering Proc. IEEE Comp. Soc. Conf. on
bodies using region growing on the seismic volume. Computer Vision and Pattern Recognition (CVPR’99) vol 1
pp 1535–40
Ballard D H 1981 Generalizing the Hough transform to detect
4. Conclusions arbitrary shapes Pattern Recognit. 13 111–22
Berkhin P 2002 Survey of clustering data mining techniques
The Hough transform was successfully applied to seismic Technical Report (Accrue Software)
profiles to detect parabolic shapes, which were associated with Chopra S 2002 Coherence cube and beyond First Break
the presence of salt bodies. We believe that this methodology 20 27–33
Cohen I and Coifman R R 2002 Local discontinuity measures for
could be useful for the detection of other geologic structures, 3D seismic data Geophysics 67 1933–45
including other salt bodies which do not present parabolic Cohen I, Coult N and Vassiliou A 2006 Detection and extraction of
shapes. The Hough transform could be generalized to detect fault surfaces in 3D seismic data Geophysics 71 21–7
more complicated geometric structures like hyperbolas or Duda R O and Hart P E 1972 Use of the Hough transformation to
high-order polynomials; the biggest disadvantage would be detect lines and curves in pictures Commun. ACM 15 11–5
Dumay J and Fournier F 1988 Multivariate statistical analyses
that the parameter space resulting from the application of the applied to seismic facies recognition Geophysics 53 1151–9
generalized Hough transform would be represented by a higher Fehmers G and Höcker C 2003 Fast structural interpretation with
dimensional space. This can be prohibitive when generated structure-oriented filtering Geophysics 68 1286–93
and/or processed by a conventional PC. We also applied PCA Foster J and Mosher C 1992 Suppression of multiple reflections
for the semi-automatic recognition of salt bodies in the same using the Radon transform Geophysics 57 386–95
Gersztenkorn A and Marfurt K J 1999 Eigenstructure-based
seismic profile, where only the manual selection of images coherence computations as an aid to 3D structural and
containing them is needed. The method appears to be capable stratigraphic mapping Geophysics 64 1468–79
of extracting conspicuous geological features from the data. Gonzalez R C and Woods R E 1992 Digital Image Processing
A PCA system working directly with 3D data should improve (Reading, MA: Addison Wesley)
the accuracy of the detections, although it would also increase Hough P V C 1962 Method and means for recognising complex
patterns US Patent 3969654
significantly the time required for the training of the system Jacquemin P and Mallet J L 2005 Automatic faults extraction using
and the detection process itself. The detection process of both double Hough transform SEG Technical Program Expanded
the Hough transform and PCA methodologies yielded similar Abstracts vol 24 pp 755–8
results, but with the first one the top of salt body (vertex of Jeong W, Whitaker R and Dobin M 2006 Interactive 3D seismic
the parabola) is found, while the bright points obtained from fault detection on the Graphics Hardware Proc. IEEE VGTC
5th Int. Workshop on Volume Graphics (VG 06) (Eurographics
the PCA methodology correspond to the centre of the salt Association, Boston, MA) pp 111–8
bodies. As we find these results encouraging, we believe that Kanungo P, Nanda P K and Samal U C 2006 Image segmentation
more complex patterns of geological units could be recognized using thresholding and genetic algorithm Proc. Conf. on Soft
and extracted by using other generalizations of the Hough Computing Technique for Engineering Applications, SCT 2006,
transform and PCA processing. (Rourkela, India, 24–26 March 2006)
Kiryati N and Bruckstein A M 1992 What’s in a set of points? IEEE
Trans. Pattern Anal. Mach. Intell. 14 496–500
Acknowledgments Lu W, Li Y, Zhang S and Xiao H 2005 Higher-order-statistics and
supertrace-based coherence-estimation algorithm
Geophysics 70 13–8
We would like to acknowledge Jim Spurlin for his critical Marfurt K J, Kirlin R, Farmer S L and Bahorich M S 1998 3D
reading and suggestions to refine this work. We also thank seismic attributes using a semblance-based coherence
Sandra Pineda for her detailed revision of the grammar in this algorithm Geophysics 63 1150–65

72
Seismic data interpretation using Hough transform and PCA

Mathieu P G and Rice G W 1969 Multivariate analysis used in the Shapiro S D 1978a Feature space transforms for curve detection
detection of stratigraphic anomalies from seismic data Pattern Recognit. 10 129–43
Geophysics 34 507–15 Shapiro S D 1978b Properties of transforms for the detection of
Matlock R J, McGowen R S and Asimakopoulos G 1985 Can curves in noisy pictures Comput. Graph. Image
seismic stratigraphy problems be solved using automated Process. 8 219–36
pattern analysis and recognition? 55th Ann. Int. Meeting Soc. Simaan M A 1991 A knowledge-based computer system for
Expl. Geophys. (Expanded abstracts session S17.7) segmentation of seismic sections based on texture 61st Ann. Int.
Montana D 1992 Genetic search of a generalized Hough transform Meeting Soc. Expl. Geophys. pp 289–92 (Expanded abstracts)
space Proc. SPIE Conf. on Adaptive and Learning Systems Trad D, Ulrych T and Sacchi M 2003 Latest views of the sparse
vol 1706 pp 14–21 Radon transform Geophysics 68 386–99
Moon W, Carswell A, Tang R and Dilliston C 1986 Radon Turk M and Pentland A 1991 Eigenfaces for recognition J. Cogn.
transform wave field separation for vertical seismic profiling Neurosci. 3 71–86
data Geophysics 51 940–7 Udupa J K 1999 Three-dimensional visualization and analysis
Niblack W and Petkovic D 1988 On improving the accuracy of the methodologies: a current perspective Radiographics 19
Hough transform: theory, simulations, and experiments Proc. 783–806
IEEE Computer Society Conf. CVPR pp 574–9 Van Ginkel M 2002 Image analysis using orientation space based on
Otsu N 1979 A threshold selection method from gray-level steerable filters PhD Thesis Delft University of Technology,
histograms IEEE Trans. Syst., Man., Cybern. 9 62–6 Delft, The Netherlands
Princen J, Illingworth J and Kittler J 1994 Hypothesis testing: a Van Veen T M and Groen F C A 1981 Discretization error in the
framework for analyzing and optimizing Hough transform Hough transform Pattern Recognit. 14 137–45
performance IEEE Trans. Pattern Anal. Mach. Weickert J 1999 Coherence-enhancing diffusion filtering Int. J.
Intell. 16 329–41 Comput. Vis. 31 111–27
Prewitt J M S and Mendelsohn M L 1966 The analysis of cell West B, May S, Eastwood E and Rossen C 2002 Interactive seismic
images Ann. N. Y. Acad. Sci. 128 1035–1053 facies classification using textural attributes and neural
Rosenhahn B, Krüger N, Rabsch T and Sommer G 2001 Tracking networks Leading Edge 21 1042–49
with a novel pose estimation algorithm Proc. Int. Workshop on Westin C F and Knutsson H 1992 The Möbius strip
Robot Vision pp 9–18 parameterization for line extraction Proc. ECCV-92, Computer
Saggaf M, Toksöz N and Marhoon M 2003 Seismic facies Vision vol 588 pp 33–8 (Lecture Notes in Computer Science)
classification and identification by competitive neural networks Weszka J S, Nagel R N and Rosenfeld A 1974 A threshold selection
Geophysics 68 1984 technique IEEE Trans. Comput. C-23 1322–6
Shapiro S D 1975 Transformations for the computer detection of Yang F M and Huang K Y 1991 Multi-layer perception for the
curves in noisy pictures Comput. Graph. Image detection of seismic anomalies 61st Ann. Int. Meeting Soc.
Process. 4 328–38 Expl. Geophys. pp 309–12 (Expanded abstracts)

73

You might also like