The Hough Transform - Finding Simple Curves: The Hough Transform

Finding Simple Curves: The Hough Transform

8.2 The Hough Transform

The method from Paul Hough—originally published as a US Patent [111] and often referred to as the “Hough transform” (HT)—is a general approach to localizing any shape that can be deﬁned para- metrically within a distribution of points [64, 117]. For example, many geometrical shapes, such as lines, circles, and ellipses, can be readily described using simple equations with only a few parameters.

Since simple geometric forms often occur as part of man-made objects, they are especially useful features for analysis of these types of images (Fig. 8.2).

The Hough transform is perhaps most often used for detecting straight line segments in edge maps. A line segment in 2D can be described with two real-valued parameters using the classic slope- intercept form

y=k·x+d, (8.1)

Fig. 8.2 Simple geometrical forms such as sections of lines, circles, and ellipses are often found in man-made objects.

162

8.2The Hough Transform

x y

p₁= (x₁, y₁)

p₂= (x₂, y₂)

y₁=k·x₁+d y₂=k·x₂+d

Fig. 8.3

Two points,p₁andp₂, lie on the same line wheny₁ = kx₁+dandy₂=kx₂+dfor a particular pair of parametersk andd.

wherekis the slope anddthe intercept—that is, the height at which the line would intercept the y axis (Fig. 8.3). A line segment that passes through two given edge pointsp₁= (x₁, y₁) andp₂= (x₂, y₂) must satisfy the conditions

y₁=k·x₁+d and y₂=k·x₂+d, (8.2) fork, d∈R. The goal is to find values ofkanddsuch that as many edge points as possible lie on the line they describe; in other words, the line that fits the most edge points. But how can you determine the number of edge points that lie on a given line segment? One possibility is to exhaustively “draw” every possible line segment into the image while counting the number of points that lie exactly on each of these. Even though the discrete nature of pixel images (with only a finite number of different lines) makes this approach possible in theory, generating such a large number of lines is infeasible in practice.

8.2.1 Parameter Space

The Hough transform approaches the problem from another direction. It examines all the possible line segments that run through a single given point in the image. Every lineL_j =kj, d_jthat runs through a pointp₀= (x₀, y₀) must satisfy the condition

L_j:y₀=k_jx₀+d_j (8.3) for suitable values k_j, d_j. Equation 8.3 is underdetermined and the possible solutions fork_j, d_j correspond to an inﬁnite set of lines passing through the given pointp₀ (Fig. 8.4). Note that for a givenk_j, the solution ford_j in Eqn. (8.3) is

d_j=−x₀·k_j+y₀, (8.4) which is another equation for a line, where nowk_j, d_jare thevariables andx₀, y₀ are the constantparametersof the equation. The solution set {(k_j, d_j)} of Eqn. (8.4) describes the parameters of all possible linesL_j passing through the image pointp₀= (x₀, y₀).

For an arbitrary image point p_i = (x_i, y_i), Eqn. (8.4) describes the line

M_i:d=−x_i·k+y_i (8.5) with the parameters −x_i, y_i in the so-called parameter or Hough space, spanned by the coordinates k, d. The relationship between

163

8Finding Simple Curves: The Hough Transform

Fig. 8.4 A set of lines passing through an image point. For all possible linesL_jpassing through the pointp₀ = (x₀, y₀), the equationy₀ = k_jx₀+d_j holds for appropriate values of the parametersk_j, d_j.

x y

p₀

L₁

L₂

L₃ L₄

(x, y)imagespace and (k, d)parameter space can be summarized as follows:

Image Space(x, y) Parameter Space (k, d) Point p_i= (xi, yi) ←→ Mi:d=−xi·k+yi Line Line Lj: y=kj·x+dj ←→ q_j= (kj, dj) Point Each image pointp_i and its associated line bundle correspond to exactly one line M_i in parameter space. Therefore we are interested in those places in the parameter space where lines intersect. The example inFig. 8.5illustrates how the linesM₁ andM₂ intersect at the position q₁₂ = (k₁₂, d₁₂) in the parameter space, which means (k₁₂, d₁₂) are the parameters of the line in the image space that runs through both image pointsp₁andp₂. The more linesM_ithat intersect at a single point in the parameter space, the more image space points lie on the corresponding line in the image! In general, we can state:

IfNlines intersect at position (k, d) inparameter space, then N image points lie on the corresponding liney =kx+d in image space.

Fig. 8.5 Relationship between image space and parameter space.

The parameter values for all possible lines passing through the image pointp_i= (x_i, y_i) in image space (a) lie on a single lineM_iin parameter space (b). This means that each pointq_j = (k_j, d_j) in parameter space corresponds to a single lineL_jin image space. The intersection of the two linesM₁,M₂at the point q₁₂= (k₁₂, d₁₂) in parameter space indicates that a lineL₁₂ through the two pointsk₁₂and d₁₂exists in the image space.

x y

k d

p₁= (x₁, y₁)

p₂= (x₂, y₂)

M₁:d=−x₁·k+y₁ M₂:d=−x₂·k+y₂

q₁₂= (k₁₂, d₁₂) L₁₂

(a)x/yImage space (b)k/dParameter space

8.2.2 Accumulator Map

Finding the dominant lines in the image can now be reformulated as ﬁnding all the locations in parameter space where a signiﬁcant number of lines intersect. This is basically the goal of the HT. In order 164

8.2The Hough Transform

y d

(a) Image space (b) Accumulator map

Fig. 8.6

The accumulator map is a discrete representation of the parameter space (k, d). For each image point found (a), a discrete line in the parameter space (b) is drawn. This oper- ation is performedadditively so that the values of the array through which the line passes are incremented by 1. The value at each cell of the accumulator array is the number of parameter space lines that intersect it (in this case 2).

to compute the HT, we must ﬁrst decide on a discrete representation of the continuous parameter space by selecting an appropriate step size for thek and d axes. Once we have selected step sizes for the coordinates, we can represent the space naturally using a 2D array.

Since the array will be used to keep track of the number of times parameter space lines intersect, it is called an “accumulator” array.

Each parameter space line is painted into the accumulator array and the cells through which it passes are incremented, so that ultimately each cell accumulates the total number of lines that intersect at that cell (Fig. 8.6).

8.2.3 A Better Line Representation

The line representation in Eqn. (8.1) is not used in practice because for vertical lines the slope is inﬁnite, that is,k=∞. A more practi- cal representation is the so-calledHessian normal form (HNF)¹ for representing lines,

x·cos(θ) +y·sin(θ) =r, (8.6) which does not exhibit such singularities and also provides a natural linear quantization for its parameters, the angleθ and the radius r (Fig. 8.7).

With the HNF representation, the parameter space is deﬁned by the coordinates θ, r, and a point p = (x, y) in image space corresponds to the relation

r(θ) =x·cos(θ) +y·sin(θ), (8.7) for angles in the range 0≤ θ < π (see Fig. 8.8). Thus, for a given image point p, the associated radius r is simply a function of the angleθ. If we use the center of the image (of sizeM×N),

x_r=

!x_r y_r

= 1 2·

!M N

, (8.8)

1 The Hessian normal form is a normalized version of the general (“alge- braic”) line equationAx+By+C= 0, withA = cos(θ), B = sin(θ), andC=−r(see, e.g., [35, p. 194]).

165

8Finding Simple Curves: The Hough Transform

Fig. 8.7 Representation of lines in 2D.

In the commonk, drepresen- tation (a), vertical lines pose a problem becausek = ∞. The Hessian normal form (b) avoids this problem by representing a line by its angleθ

and distancerfrom the origin. x x

y y

(x, y) (x, y)

k=∞ d= ?

y=kx+d x·cos(θ) +y·sin(θ) =r

(a) (b)

as the reference point for thex/yimage coordinates, then it is possible to limit the range of the radius to half the diagonal of the image, that is,

−r_max≤r(θ)≤r_max, with r_max= ¹₂

M²+N². (8.9) We can see that the functionr(θ) in Eqn. (8.7) is the sum of a cosine and a sine function onθ, each being weighted by thexandycoordi- nates of the image point (assumed to be constant for the moment).

The result is again a sinusoidal function whose magnitude and phase depend only on the weights (coeﬃcients) x, y. Thus, with the Hes- sian parameterization θ/r, an image point (x, y) does not create a straight line in the accumulator map A(i, j) but a unique sinusoidal curve, as shown inFig. 8.8. Again, each image point adds a curve to the accumulator and each resulting cluster point corresponds to to a dominant line in the image with a proportional number of points on it.²

Fig. 8.8 Image space and parameter space using the HNF representation. The image (a) of size M×Ncontains four straight linesL_a, . . . , L_d. Each point on an image line creates a sinusoidal curve in theθ/rpa- rameter space (b) and the corresponding line parameters are indicated by the clearly visible cluster points in the accumulator map. The reference point x_rfor thex/ycoordinates lies at the center of the image. The line anglesθ_iare in the range [0, π) and the associated radii r_iare in [−r_max, r_max] (the lengthr_maxis half of the image diagonal). For example, the the angleθ_aof lineL_ais approximatelyπ/3, with the (positive) radiusr_a≈0.4r_max. Note that, with this parameterization, lineL_chas the angleθ_c≈2π/3 and theneg- ativeradiusr_c ≈ −0.4r_max.

Image Space (x/y) Parameter Space (θ/r)

−x +x

−y

y i

j m−1 n−1

x_r

a a

b b

d d

r_max r_a

θ_a r_b

r_c θ_c

M 2 M

2 N

2 N 2

−r r_max

−r_max 0

0 0

π 0

π 2

(a) (b)

2 Note that, inFig. 8.8(a), the positive direction of they-coordinate runs upwards (unlike our usual convention for image coordinates) to stay in line with the previous illustrations (and high school geometry). In practice, the consequences are minor: only the rotation angle runs in the opposite direction and thus the accumulator image in Fig. 8.8(b) was mirrored horizontally for proper display.

166

Dalam dokumen Digital Image Processing (Halaman 182-187)