Move-based Algorithms for the Optimization of an Isotropic Gradient MRF Model

(1)

Move-based Algorithms for the Optimization of an Isotropic Gradient MRF Model

Behrooz Nasihatkon Richard Hartley

5 December 2012

NICTA Funding and Supporting Members and Partners

(2)

Outline

•

Total Variation

•

Current Methods

•

The 3-clique Model

•

Move-based Algorithms

•

Main Theorem

•

Conclusion

(3)

Total Variation

• goodregularizer.

• For a functionx: Ω⊆Rⁿ →RTotal Variation is defined as TV(^x) =

Z

Ω

|∇_tx(t)|dt

• discontinuity preserving(edge preserving for images).

x₁(t) x₂(t) TV(^x1) =TV(^x2)

(4)

Total Variation

Z

Ω

|∇_tx(t)|dt

x₁(t) x₂(t) TV(^x1) =TV(^x2)

(5)

Total Variation

Z

Ω

|∇_tx(t)|dt

x₁(t) x₂(t) TV(^x1) =TV(^x2)

(6)

Modeling TV using MRFs

• Approximate Total Variation using an MRF,

• A set of nodes 1,2, . . . ,n,

• A set of labelsx= [x₁,x₂, . . . ,x_n],x_i∈ L.

• Energy function

E(x) =X

i

f_i(x_i) + ˜TV(x),

(7)

Current Models

• Approximate Magnitude of Gradient usingedge-basedpotentials.

TV˜ (x) = X

(i,j)∈C²

w_ij|x_i−x_j|

• Magnitude of Gradient (MoG) at each nodei is approximated by

MoG(i) =X

j∈Ni

w_ij|x_i−x_j|

b

bbb b

b b bbbb bb

bbb b b bbbb bb

bbb b b

b

bb b b bb

b

4Neighbourhood 8Neighbourhood 16Neighbourhood

(8)

4-connected model

MoG(i) =|x_i−x_j|+|x_i−x_k|

MoG(ⁱ) =1

b

b bbbbbb

b bbbb bi j

k

(9)

Diagonal Edges

MoG(ⁱ) =|x_i−x_j|+|x_i−x_k|+

√ 2

2 |x_i−x_l|

MoG(ⁱ) =1

b

b bbbbbb

b bbbb bi j

k ^b l

(10)

The 3-Clique Model

• The gradient vector≈

x_j−x_i x_k−x_i

,

• For ordered labelsx_i ∈ {1,2, . . . ,M} MoG(ⁱ) =

q

(x_i−x_j)²+ (x_i−x_k)²

• For general labelsx_i ∈ Lwith a semi-metricd

MoG(i) = q

d(x_i,x_j)²+d(x_i,x_k)²

b

b bbbbbb

b bbbb b

i j

k

(11)

The 3-Clique Model

x_j−x_i x_k−x_i

,

q

(x_i−x_j)²+ (x_i−x_k)²

MoG(i) = q

b

b bbbbbb

b bbbb b

i j

k

(12)

The 3-Clique Model

x_j−x_i x_k−x_i

,

q

(x_i−x_j)²+ (x_i−x_k)²

MoG(i) = q

b

b bbbbbb

b bbbb b

i j

k

(13)

The 3-clique Model

4 neighbours 8 neighbours 3-cliques

(14)

Move-Based Algorithms

min

x∈L

X

i

f_i(x_i) +γ X

(i,j,k)∈C³

q

• Move-based approachis a popular way of optimizing Multi-label MRFs.

• Optimizing the multi-label MRF iteratively by solvinga series of binary MRF optimizations.

b

b bbbbbb

b

x1x2x3x4· · ·

b

b bbbbbb

b

u1u2u3u4· · ·

(15)

The Alpha-Expansion Algorithm

• Nodes have a choice to switch toαor stay unchanged:

l_α⁰(x_i) =x_i l_α¹(x_i) =α

• lû_α(x) = [l_αû¹(x₁),l_αû²(x₂), . . . ,l_αûⁿ(x_n)]

procedureALPHA-EXPANSION(x,L) repeat

for eachα∈ Ldo u^∗ ←argmin_uE(l^u_α(x)) x ←l^u_α^∗(x)

end for untilconvergence return x

end procedure

(16)

The Alpha-Beta Swap Algorithm

• Nodes with labelsαorβ have a chance to swap.

l_α,β⁰ (x_i) =

α ifx_i ∈ {α, β}

x_i otherwise l_α,β¹ (x_i) =

β ifx_i∈ {α, β}

x_i otherwise

procedureALPHA-BETA-SWAP(x,L) repeat

for eachα, β∈ L×Ldo u^∗ ←argmin_uÊ(lû_α,β(x)) x ←lû_α,β^∗ (x)

end for untilconvergence

(17)

General Move Algorithm

• Take arbitraryl⁰andl¹

l⁰(x_i) =arbitrary l¹(x_i) =arbitrary

• The pair of functions(l⁰,l¹)is called theupdate policy.

• State Preservation Property

∀x∈ L l⁰(x) =x or l¹(x) =x

(18)

General Move Algorithm

• Take arbitraryl⁰andl¹

l⁰(x_i) =arbitrary l¹(x_i) =arbitrary

• The pair of functions(l⁰,l¹)is called theupdate policy.

• State Preservation Property

∀x∈ L l⁰(x) =x or l¹(x) =x

(19)

Solving the Binary Problem

• How to solve

u^∗ ←argmin_u^E(l^u(x))

• Energy functions consisting ofquadraticandcubicterms are solvable by graph-cuts if and only if they aresubmodular¹.

• The functionf:{0,1}×{0,1} →R^issubmodularif f(0,1) +f(1,0)≥f(0,0) +f(1,1).

• A pseudo-Boolean function ofnvariables is submodular ifany restrictionto any pair of variables is submodular.

1Kolmogorov and Zabih 2004.

(20)

Central Question

• Main Question: Given

E(x) = X

(i,j,k)∈C³

q

what choice of policy(l⁰,l¹)results in a submodularE(l^u(x))as a function ofu, so we can solve

u^∗ ←argmin_u^E(l^u(x))

(21)

Main Theorem (General Case)

E(x) = X

(i,j,k)∈C³

q

Theorem

Assume d:L × L →R^{is a}semi-metricand the update policy has the state preservation property, the energy function E_x⁰(u) =E(l^u(x))is submodular for allx,if and only iffor any three labels x,y,z∈ L

d(x,y¹)−d(x,y⁰)

d(x,z¹)−d(x,z⁰)

≥0, where x^u is a compact form for l^u(x).

(22)

Main Theorem (Ordered Labels)

E(x) =P

(i,j,k)∈C³

p(x_i−x_j)²+ (x_i−x_k)²

(Middlebury Dataset)

Proposition

WithL={0,1, . . . ,M−1}and d(x,y) =|x−y|(ordered labels), and havingstate preservation propertyfor the update policy, the energy function E_x⁰(u) =E(l^u(x))is submodular for allxif and only if(l⁰,l¹)is a mirrored update policy.

(23)

Mirrored Policy

Definition

An update policy(l⁰,l¹)is calledmirroredif

(i) ∀x ∈ A l⁰(x)<l¹(x)or∀x∈ A l⁰(x)>l¹(x), (ii) ∃µ∈ Lsuch that∀x ∈ A

l⁰(x)+l¹(x)

2 ∈ {µ, µ+¹

2, µ+1}.

µ

y¹ z⁰ z¹ y⁰

v⁰ v¹

w¹ w⁰

µ+1

(24)

Main Theorem (Unordered Labels)

E(x) =P

(i,j,k)∈C³

p1(x_i 6=x_j) +1(x_i6=x_k)

(Buffalo-Xiph.org)

Proposition

With d(x,y) =1(x6=y)(unordered labels), and assuming thestate preservation propertyfor the update policy, the energy function E_x⁰(u) =E(l^u(x))is submodular for allxif and only if for any pair of active labels y,z ∈ A, we have l⁰(y)6=l¹(z).

(25)

Mirrored Swap

ls⁰(x) =

min(x,s−x) 0≤s−x<M,

x otherwise, ^l

1 s(x) =

max(x,s−x) 0≤s−x<M,

x otherwise.

procedureMIRRORED-SWAP(x,M) repeat

for eachs∈ {1,2, . . . ,2n−3}do u^∗ ←argmin_u^E(l^u_s(x))

x ←_l^u_s(x) end for untilconvergence return x

end procedure

s/2

0 1 2 3 4 5 6 7 8

(26)

Conclusion

• Suits MRFs with thecontinuousandorderedlabels,

• Alpha-expansion cannot be applied toordered labels,

• Mirrored Swap algorithm for theordered labels,

• Forunordered labels, the submodularity holds for vaster types of binary moves, including alpha-expansion and alpha-beta swap.

(27)

Thanks

(28)

Questions?

??

? ?

(29)