Approximation Algorithm: Survivable Network Design Problem

This is the lecture notes from Chandra Chekuri's CS583 course on Approximation Algorithms. Chapter 13: Survivable Network Design Problem. You can read Chapter 14: Introduction to Cut and Partitioning Problems, here. Chapter 12: Primal Dual for Constrained Forest Problems, here.

Chapter 13

In this chapter we consider the Survivable Network Design Problem problem. The input is an undirected graph

G = (V, E)

with edge-weights

c : E \to R_{+}

and integer requirements

r (u v)

for each pair of vertices

u v

. We wrote

u v

instead of

(u, v)

to indicate that the requirement function is for unordered pairs (alternatively,

r (u, v) = r (v, u)

for all

u, v

). The goal is to find a min-cost subgraph

H = (V, F)

G

such that each the connectivity betweeen

u

and

v

H

is at least

r (u v)

. We obtain two versions of the problem: EC-SNDP if the connectivity requirement is edge-connectivity and VC-SNDP for vertex connectivity. It turns out that EC-SNDP is much more tractable than VC-SNDP and we will focus on EC-SNDP.

Figure 13.1: Example of EC-SNDP. Requirement only for three pairs. A feasible solution shown in the second figure as red edges. In this example the paths for each pair are also vertex disjoint even though the requirement is only for edge-disjointness.

For EC-SNDP there is a seminal work of Jain based on iterated rounding that yields a 2-approximation as a special case of a more general problem. Prior to his work there was an augmentation based approach that yields

2 k

and

2 H_{k}

approximations where

k = max_{u v} r (u v)

is the maximum connectivity requirement. Despite being superceded by Jain's result in terms of the ratio, the augmentation approach is important for various reasons and we will discuss both.

We first consider the LP relaxation for the EC-SNDP. We do this by setting up the requirement function

f : 2^{V} \to Z

where we let

f (S) = max_{u \in S, v \in V - S} r (u v)

. The goal is to find a min-cost subgraph

H

G

such that

δ_{H} (S) \geq f (S)

Claim 13.0.1. The requirement function

f

that captures EC-SNDP is proper and hence skew-supermodular.

Proof. It is easy to see

f

is symmetric. Consider disjoint sets

A, B

. Suppose

f (A \cup B) = k

which means that there is some

s \in A \cup B

and

t \in V - (A \cup B)

such that

r (s t) = k

. If

s \in A

then

f (A) \geq k

and if

s \in B

then

f (B) \geq k

. Therefore

max {f (A), f (B)} \geq k = f (A \cup B)

since

s \in A

s \in B

13.1 Augmentation approach

The augmentation approach for EC-SNDP is based on iteratively increasing the connectivity of pairs from

1

k

where

k = max_{u v} r (u v)

. In fact this works for any proper function

f : 2^{V} \to Z

and we will work in this generality rather than focus only on EC-SNDP.

Claim 13.1.1. Let

f

be a proper function and let

p

be an integer. Then the truncated function

g_{p} : 2^{V} \to Z

defined as

g_{p} (S) = min {p, f_{p} (S)}

is proper.

Proof. Exercise.

Lemma 13.1. Let

G = (V, E)

be graph and let

f : 2^{V} \to Z

be a proper function and let

p \geq 0

be a non-negative integer. Let

X \subseteq E

be a set of edges such that

| δ_{X} (S) | \geq g_{p} (S)

Consider the function

h_{p + 1} : 2^{V} \to {0, 1}

where

h_{p + 1} (S) = 1

iff

f (S) \geq p + 1

and

| δ_{X} (S) | = p

. Then

h_{p + 1}

is uncrossable and symmetric.

Proof. Consider the function

g_{p + 1}

which is proper and hence also skew-supermodular. For notational convenience we use

h

for

h_{p + 1}

. Suppose

h (A) = h (B) = 1

. This implies

g_{p + 1} (A) \geq p + 1

and

g_{p + 1} (B) \geq p + 1

and

| δ_{X} (A) | = δ_{X} (B) = p

g_{p + 1}

is skewsupermodular. First case is when

g_{p + 1} (A) + g_{p + 1} (B) \leq g_{p + 1} (A \cup B) + g_{p + 1} (A \cap B)

. This implies that

g_{p + 1} (A \cup B) = g_{p + 1} (B) = p + 1

. By submodularity of

| δ_{X} |

we have

| δ_{X} (A) | + | δ_{X} (B) | \geq | δ_{X} (A \cap B) | + | δ_{X} (A \cup B) |

and by feasibility of

X

for

g_{p}

we have

| δ_{X} (A \cap B) | = | δ_{X} (A \cap B) | = p

. This implies that

h (A \cap B) = h (A \cup B) = 1

Similarly, if

g_{p + 1} (A) + g_{p + 1} (B) \leq g_{p + 1} (A - B) + g_{p + 1} (B - A)

we can argue that

h (A - B) = h (B - A) = 1

via posi-modularity of

| δ_{X} |

Exercise 13.1. Let

G = (V, E)

be a graph and let

f : 2^{V} \to Z

be a proper function. Suppose

F

be a feasible cover for

g_{p}

. Let

h_{p + 1}

be the residulal uncrossable function that arises from

g_{p}

as in the preceding lemma. Let

F^{'} \subseteq E ∖ F

be a feasible cover for

h_{p + 1}

in the graph

G^{'} = (V, E ∖ F)

. Then

F \cup F^{'}

is a feasible cover for

g_{p + 1}

Lemma 13.2. Let

f

be the requirement function of an instance of EC-SNDP in

G = (V, E)

and let

p

be an integer and let

X \subseteq E

be a set of edges. There is a polynomial time algorithm to find the minimal violated sets of

g_{p}

with respect to

X

Proof. For each pair or nodes

(s, t)

find a source-minimal

s - t

mincut

S

in the graph

H = (V, X)

and a sink-minimal mincut

T

via maxflow^[1]. Let

S

be the cut. If

| δ_{X} (S) | < p

then

p

is a violated set. We compute all such minimal cuts over all pairs of vertices and take the minimal sets in this collection. We leave it as an exercise to check that the minimal violated sets of

g_{p}

are the minimal sets in this collection and will be disjoint.

Corollary 13.1. Let

f

be the requirement function of an instance of EC-SNDP in

G = (V, E)

and let

p

be an integer. Let

X

be set of edges such that

X

is feasible to cover

g_{p}

. In the graph

G^{'} = (V, E ∖ X)

and for any

F \subseteq (E ∖ X)

the minimal violated sets of

h_{p + 1}

with respect to

F

can computed in polynomial time.

Proof. The minimial violated sets of

h_{p + 1}

with respect to

F

are the same as the minimal violated sets of

g_{p + 1}

with respect to

X \cup F

\underline{Augmentation-Algorithm (G = (V, E), f)}

1. If

E

does not cover

f

output "infeasible"

k = max_{S} f (S)

is the maximum requirement

A ⟵ \emptyset

4. for

(p = 1

k)

G^{'} = (V, E ∖ A)

B. Let

g_{p}

be the function defined as

g_{p} (S) = min {f (S), p}

C. Let

h_{p}

be the uncrossable function where

h_{p} (S) = 1

iff

g_{p} (S) > | δ_{A} (S) |

D. Find

A^{'} \subseteq E ∖ A

that covers

h_{p}

G^{'}

A \leftarrow A \cup A^{'}

5. Output

A

Figure 13.2: Example to illustrate the augmentation approach. Top picture shows a set of edges that connect all pairs with connectivity requirement at least

1

. Second picture shows the residual graph in which one needs to solve the augmentation problem. Note that

s_{2}

and

t_{2}

are isolated vertices in the residual graph, however, the cuts induced by them are already satisfied by the edges chosen in the first iteration.

Theorem 13.2. The augmentation algorithm yields a

2 k

-approximation for EC-SNDP where

k

is the maximum connectivity requirement.

Proof. We sketch the proof. The algorithm has

k

iterations and in each iteration it uses a black box algorithm to cover an uncrossable function. We saw a primaldual

2

-approximation for this problem. We observe that if

F^{*}

is an optimum solution to the given instance then in each iteration

F^{*} ∖ A

is a feasible solution to the covering problem in that iteration. Thus the cost paid by the algorithm in each iteration can be bound by

2 c (F^{*})

and hence the total cost is at most

2 k

OPT. The preceding lemmas argue that the primal-dual algorithm can be implemented in polynomial time.

Remark 13.1. A different algorithm that is based on augmentation in reverse yields a

2 H_{k}

approximation where

H_{k}

is the

k^{'}

th harmonic number. We refer the reader to ^[2].

13.2 Iterated rounding based 2-approximation

In the section we describe the seminal result of Jain ^[3] who obtained a

2

-approximation for EC-SNDP via iterated rounding. He proved a more general polyhedral result. Consider the problem of covering a skew-supermodular function

f : 2^{V} \to Z

by the edges of a graph

G = (V, E)

. The natural cut covering LP relaxation for the problem is given below.

\begin{aligned} min \sum_{e \in E} c (e) x_{e} \\ \sum_{e \in δ (S)} x_{e} & \geq f (S) S \subset V \\ x_{e} & \in [0, 1] e \in E \end{aligned}

Note that upper bound constraints

x_{e} \leq 1

are necessary in the general setting when

f

is integer valued since we can only take one copy of an edge. The key structural theorem of Jain is the following.

Theorem 13.3. Let

x

be a basic feasible solution to the LP relaxation. Then there is some edge

e \in E

such that

x_{e} = 0

x_{e} \geq 1 / 2

With the above in place, and the observation that the residual function of a skew-supermodular function is again a skew-supermodular function, one obtains an interative rounding algorithm.

\underline{Cover-Skew-Supermodular (G, f)}

1. If

E

does not cover

f

output "infeasible"

A \leftarrow \emptyset, g = f

3. While

A

is not a feasible solution do

A. Find an optimum basic feasible solution

x

to cover

g

G^{'} = (V, E ∖ A)

B. If there is some

e

such that

x_{e} = 0

then

E \leftarrow E - {e}

C. Else If there is some

e

such that

x_{e} \geq 1 / 2

then

A = A \cup {e}

g = f_{A}

(recall

f_{A} (S) = f (S) - | δ_{A} (S) |

)

4. Output

A

Corollary 13.4. The integrality gap of the cut LP is at most

2

for any skew-supermodular function

f

Proof. We consider the iterative rounding algorithm and prove the result via induction on

m

the number of edges in

G

. The base case of

m = 0

is trivial since the function has to be

0

Let

x^{*}

be an optimum basic feasible solution to the LP relaxation. We have

\sum_{e \in E} c_{e} x_{e}^{*} \leq

OPT. We can assume without loss of generality that

f

is not trivial in the sense that

f (S) \geq 1

for at least some set

S

, otherwise

x = 0

is optimal and there is nothing to prove. By Theorem 13.3, there is an edge

\tilde{e} \in E

such that

x_{\tilde{e}}^{*} = 0

x_{\tilde{e}}^{*} \geq 1 / 2

. Let

E^{'} = E ∖ \tilde{e}

and

G^{'} = (V, E^{'})

. In the former case we can discard

\tilde{e}

and the current LP solution restricted to

E^{'}

is a feasible fractional solution and we obtain the desired result via induction since we have one less edge.

The more interesting case is when

x_{\tilde{e}}^{*} \geq 1 / 2

. The algorithm includes

\tilde{e}

and recurses on

G^{'}

and the residual function

g : 2^{V} \to Z

where

g (S) = f (S) - | δ_{\tilde{e}} (S) |

. Note that

g

is skew-supermodular. We observe that

A^{'} \subseteq E^{'}

is a feasible solution to cover

g

G^{'}

iff

A^{'} \cup {\tilde{e}}

is a feasible solution to cover

f

G

. Furthermore, we also observe that the fractional solution

x^{'}

obtained by restricting

x

E^{'}

is a feasible fractional solution to the LP relaxation to cover

g

G^{'}

. Thus, by induction, there is a solution

A^{'} \subseteq E^{'}

such that

c (A^{'}) \leq 2 \sum_{e \in E^{'}} c (e) x_{e}^{*}

. The algorithm outputs

A = A^{'} \cup {\tilde{e}}

which is feasible to cover

f

G

. We have

c (A) = c (A^{'}) + c (\tilde{e}) \leq c (A^{'}) + 2 c (\tilde{e}) x_{\tilde{e}}^{*} \leq 2 \sum_{e \in E^{'}} c (e) x_{e}^{*} + 2 c (\tilde{e}) x_{\tilde{e}}^{*} = 2 \sum_{e \in E} c (e) x_{e}^{*} .

We used the fact that

x_{\tilde{e}}^{*} \geq 1 / 2

to upper bound

c (\tilde{e})

2 c (\tilde{e}) x_{\tilde{e}}^{*}

2-approximation for EC-SNDP : We had already seen that the requirement function for EC-SNDP is skew-supermodular. To applyTheorem 13.3 and obtain a 2-approximation for EC-SNDP we need to argue that the LP relaxation can be solved efficiently. We observe that the LP relaxation at the top level can be solved efficiently via maxflow. We need to check that in the graph

G

with edge capacities given by the fractional solution

x

the min-cut between every pair of vertices

(s, t)

is at least

r (s, t)

. Note that the algorithm is iterative. As we proceed the function

g = f_{A}

where

f

is the original requirement function and the

A

is the set of edges already chosen.

Exercise 13.2. Prove that there is an efficient separation oracle for each step of the iterative rounding algorithm when

f

is the requirement function for a given EC-SNDP instance.

We now prove Theorem 13.3. The proof consists of two steps. The first step is a characterization of basic feasible solutions via laminar tight sets. The second step is a counting argument.

13.2.1 Basic feasible solutions and laminar family of tight sets

Let

x

be a basic feasible solution to the LP relaxation. We are done if there is any edge

e

such that

x_{e} = 0

x_{e} = 1

. Hence the interesting case is when

x

is fully fractional, that is,

x_{e} \in (0, 1)

for every

e \in E

Definition 13.5. A set

S \subseteq V

is tight with respect to

x

x (δ (S)) = f (S)

The LP relaxation is of the form

A x \geq b, x \in [0, 1]^{m}

. We number the edges as

e_{1}, e_{2}, \dots, e_{m}

arbitarily. Note that each row of

A

corresponds to a set

S

and the non-zero entries in the row corresponding to

S

are precisely for edges in

δ (S)

. For notational convenience we use

χ_{S}

to denote the

m

-dimensional row vector where

χ_{S} (i) = 0

e_{i} \notin δ (S)

and

χ_{S} (i) = 1

e \in δ (S)

. By the rank lemma, if

x

is a basic feasible solution that is fully fractional, then there are

m

tight sets

S_{1}, S_{2}, \dots, S_{m}

such that the vectors

χ_{S_{1}}, χ_{S_{2}}, \dots, χ_{S_{m}}

are linearly independent in

R^{m}

. In other words

x

is the unique solution to the system

χ_{S_{i}}^{T} x = f (S_{i}), 1 \leq i \leq m

. Note that for a given basic feasible solution

x

there can be many such bases. A key technical lemma is that one choose a nice one.

Lemma 13.3. Let

x

be a basic feasible solution to the cut covering LP relaxation of a skew-supermodular function

f

where

x_{e} \in (0, 1)

for all

e

. Then there is a laminar family

L

of tight sets

S_{1}, S_{2}, \dots, S_{m}

such that

x

is the unique solution to the system

χ_{S_{i}}^{T} x = f (S_{i})

Figure 13.3: Laminar family of tight sets.

We need an auxiliary uncrossing lemma.

Lemma 13.4. Suppose

A

and

B

are two tight sets with respect to

x

such that

A, B

cross. Then one of the following holds:

• $A \cap B, A \cup B$ are tight and $χ_{A} + χ_{B} = χ_{A \cup B} + χ_{A \cap B}$ .
• $A - B, B - A$ are tight and $χ_{A} + χ_{B} = χ_{A - B} + χ_{B - A}$ .

Proof. Since

f

is skew-supermodular

f (A) + f (B) \leq f (A \cap B) + f (A \cup B)

f (A) + f (B) \leq f (A - B) + f (B - A)

. We will consider the first case.

A, B

are tight, hence

x (δ (A)) = f (A)

and

x (δ (B)) = f (B)

. Moreover the function

h (S) = x (δ (S))

is submodular (recall that the cut capacity function in an undirected graph is symmetric submodular). Thus

x (δ (A)) + x (δ (B)) \geq

x (δ (A \cup B)) + x (δ (A \cap B))

. We also have by feasibility of

x

that

x (δ (A \cup B)) \geq f (A \cup B)

nad

x (δ (A \cap B)) \geq f (A \cap B)

. Putting together we have

x (δ (A)) + x (δ (B)) = f (A) + f (B) \leq f (A \cap B) + f (A \cup B) \leq x (δ (A \cup B)) + x (δ (A \cap B)) \leq x (δ (A)) + x (δ (B)) .

This implies that

x (δ (A \cup B)) = f (A \cup B)

and

x (δ (A \cap B)) = f (A \cap B)

. Thus both

A \cap B

and

A \cup B

are tight. Moreover we observe that

x (δ (A)) + x (δ (B)) = x (δ (A \cup B)) + x (δ (A \cap B)) + 2 x (E (A - B, B - A))

where

E (A - B, B - A)

is the set of edges between

A - B

and

B - A

. From the above tightness we see that

x (δ (A)) + x (δ (B)) = x (δ (A \cup B)) + x (δ (A \cap B))

, and since

x

is fully fractional it means that

E (A - B, B - A) = \emptyset

. This implies that

χ_{A} + χ_{B} = χ_{A \cup B} + χ_{A \cap B}

(why?).

The second case is similar where we use posimodularity of the cut function.

Proof of Lemma 13.3. One natural way to proceed is as follows. We start with tight sets

S = {S_{1}, S_{2}, \dots, S_{m}}

such that

x

is characterized as the unique solution of the equations implied by these sets. If the family

{S_{1}, S_{2}, \dots, S_{m}}

is laminar we are done. Otherwise we pick some two crossing sets, say

S_{1}, S_{2}

without loss of generality and uncross them using Lemma ??. We get a new family

S^{'}

with

m

tight sets and the number of crossings in the new family goes down by at least one (as we saw in Lemma ?? previously). Suppose we replace

S_{1}, S_{2}

S_{1} \cap S_{2}, S_{1} \cup S_{2}

. The technical issue is to argue linear independence of the vectors in the new family. This is where we need the property

χ_{S_{1}} + χ_{S_{2}} = χ_{S_{1} \cap S_{2}} + χ_{S_{1} \cup S_{2}}

. Although natural, the linear algebraic argument turns out to be a bit tedious.

Instead we use a slick argument. Let

L

be a maxmial laminar family of

x

-tight sets such that the vectors

χ_{S}, S \in L

are linearly independent. If

L = m

then we are done because we have

m

linearly independent vectors that together span

R^{m}

. Suppose

| L | < m

. Then there must be a tight set

S

such that

χ_{S}

is not spanned by the vectors in

L

. Choose a tight set

S

that is not spanned and crosses the fewest number of sets from

L

. Since

L

is maximal, there must be some set

T \in L

such that

S, T

cross (otherwise we can add

S

L

). Here we use Lemma 13.4 and consider two cases. Suppose

S \cap T, S \cup T

are tight. Note that

S \cap T, S \cup T

cross fewer sets in

L

than

S

does. By the choice of

S

, it must be the case that both

S \cap T

and

S \cup T

are spanned by

L

. However, we have

χ_{S} + χ_{T} = χ_{S \cap T} + χ_{S \cup T}

which implies that

χ_{S}

is also spanned, a contradiction. The proof for the other case when

S - T

and

T - S

are tight is similar. Thus we have

L = m

and this is the desired family.

13.2.2 Counting argument

The second key ingredient in the proof is a counting argument that exploits Lemma 13.3. An easier counting argument shows that there is an edge with

x_{e} \geq 1 / 3

in any basic feasible solution. The tight bound of

1 / 2

is more delicate and Jain's original proof is perhaps a bit hard to understand (see ^[4]). The argument has been subsequently refined and a "fractional token" based analysis ^[5] was developed and this is the proof in ^[6]. The token based analysis is flexible and powerful in iterated rounding based algorithms. In an attempt to make the proof even more transparent, the author of this notes developed yet another proof in ^[7]. We describe that proof below.

The proof is via contradiction where we assume that

0 < x_{e} < \frac{1}{2}

for each

e \in E

. We call the two nodes incident to an edge as the endpoints of the edges. We say that an endpoint

u

belongs to a set

S \in L

u

is the minimal set from

L

that contains

u

We consider the simplest setting where

L

is a collection of disjoint sets, in other words, all sets are maximal. In this case the counting argument is easy. Let

m = | E | = | L |

. For each

S \in L, f (S) \geq 1

and

x (δ (S)) = f (S)

. If we assume that

x_{e} < \frac{1}{2}

for each

e

, we have

| δ (S) | \geq 3

which implies that each

S

contains at least

3

distinct endpoints. Thus, the

m

disjoint sets require a total of

3 m

endpoints. However the total number of endpoints is at most

2 m

since there are

m

edges, leading to a contradiction.

Figure 13.4: Easy case of counting argument.

Now we consider a second setting where the forest associated with

L

has

k

leaves and

h

internal nodes but each internal node has at least two children. In this case, following Jain, we can easily prove a weaker statement that

x_{e} \geq 1 / 3

for some edge

e

. If not, then each leaf set

S

must have four edges leaving it and hence the total number of endpoints must be at least

4 k

. However, if each internal node has at least two children, we have

h < k

and since

h + k = m

we have

k > m / 2

. This implies that there must be at least

4 k > 2 m

endpoints since the leaf sets are disjoint. But

m

edges can have at most

2 m

endpoints. Our assumption on each internal node having at least two children is obviously a restriction. So far we have not used the fact that the vectors

χ_{S}, S \in L

are linearly independent. We can handle the general case to prove

x_{e} \geq 1 / 3

by using the following lemma.

Lemma 13.5. Suppose

C

is a unique child of

S

. Then there must be at least two endpoints in

S

that belong to

S

Proof. If there is no endpoint that belongs to

S

then

δ (S) = δ (C)

but then

χ_{S}

and

χ_{C}

are linearly dependent. Suppose there is exactly one endpoint that belongs to

S

and let it be the endpoint of edge

e

. But then

x (δ (S)) = x (δ (C)) + x_{e}

x (δ (S)) = x (δ (C)) - x_{e}

. Both cases are not possible because

x (δ (S)) = f (S)

and

x (δ (C)) = f (C)

where

f (S)

and

f (C)

are positive integers while

x_{e} \in (0, 1)

. Thus there are at least two end points that belong to

S

Using the preceding lemma we prove that

x_{e} \geq 1 / 3

for some edge

e

. Let

k

be the number of leaves in

L

and

h

be the number of internal nodes with at least two children and let

ℓ

be the number of internal nodes with exactly one child. We again have

h < k

and we also have

k + h + ℓ = m

. Each leaf has at least four endpoints. Each internal node with exactly one child has at least two end points which means the total number of endpoints is at least

4 k + 2 ℓ

. But

4 k + 2 ℓ = 2 k + 2 k + 2 ℓ > 2 k + 2 h + 2 ℓ > 2 m

and there are only

2 m

endpoints for

m

edges. In other words, we can ignore the internal nodes with exactly one child since there are two endpoints in such a node/set and we can effectively charge one edge to such a node.

We now come to the more delicate argument to prove the tight bound that

x_{e} \geq \frac{1}{2}

for some edge

e

. We describe invariant that effectively reduces the argument to the case where we can assume that

L

is a collection of leaves. This is encapsulated in the lemma below which requires some notation. Let

α (S)

be the number of sets of

L

contained in

S

including

S

itself. Let

β (S)

be the number of edges whose both endpoints lie inside

S

. Recall that

f (S)

is the requirement of

S

Lemma 13.6. For all

S \in L, f (S) \geq α (S) - β (S)

Assuming that the lemma is true we can do an easy counting argument. Let

R_{1}, R_{2}, \dots, R_{h}

be the maximal sets in

L

(the roots of the forest). Note that

\sum_{i = 1}^{h} α (R_{i}) = | L | = m

. Applying the claim to each

R_{i}

and summing up,

\sum_{i = 1}^{h} f (R_{i}) \geq \sum_{i = 1}^{h} α (R_{i}) - \sum_{i = 1}^{h} β (R_{i}) \geq m - \sum_{i = 1}^{h} β (R_{i}) .

Note that

\sum_{i = 1}^{h} f (R_{i})

is the total requirement of the maximal sets. And

m - \sum_{i = 1}^{h} β (R_{i})

is the total number of edges that cross the sets

R_{1}, \dots, R_{h}

. Let

E^{'}

be the set of edges crossing these maximal sets. Now we are back to the setting with

h

disjoint sets and

E^{'}

edges with

\sum_{i = 1}^{h} f (R_{i}) \geq | E^{'} |

. This easily leads to a contradiction as before if we assume that

x_{e} < \frac{1}{2}

for all

e \in E^{'}

. Formally, each set

R_{i}

requires

> 2 f (R_{i})

edges crossing it from

E^{'}

and therefore

R_{i}

contains at least

2 f (R_{i}) + 1

endpoints of edges from

E^{'}

. Since

R_{1}, \dots, R_{h}

are disjoint the total number of endpoints is at least

2 \sum_{i} f (R_{i}) + h

which is strictly more than

2 | E^{'} |

Proof of Lemma 13.6. Thus, it remains to prove the claim which we do by inductively starting at the leaves of the forest for

L

Case 1:

S

is a leaf node. We have

f (S) \geq 1

while

α (S) = 1

and

β (S) = 0

which verifies the claim.

Case 2:

S

is an internal nodes with

k

children

C_{1}, C_{2}, \dots, C_{k}

. See Fig 13.5 for the different types of edges that are relevant.

E_{c c}

is the set of edges with end points in two different children of

S

E_{c p}

be the set of edges that cross exactly one child but do not cross

S

E_{p o}

be the set of edges that cross

S

but do not cross any of the children.

E_{c o}

is the set of edges that cross both a child and

S

. This notation is borrowed from ^[6:1].

Figure 13.5:

S

is an internal node with several children. Different types of edges that play a role.

p

refers to parent set

S, c

refer to a child set and

o

refers to outside.

Let

γ (S)

be the number of edges whose both endpoints belong to

S

but not to any child of

S

. Note that

γ (S) = | E_{c c} | + | E_{c p} |

Then,

\begin{aligned} β (S) & = γ (S) + \sum_{i = 1}^{k} β (C_{i}) \\ (13.1) & \geq γ (S) + \sum_{i = 1}^{k} α (C_{i}) - \sum_{i = 1}^{k} f (C_{i}) \\ = γ (S) + α (S) - 1 - \sum_{i = 1}^{k} f (C_{i}) \end{aligned}

(13.1) follows by applying the inductive hypothesis to each child. From the preceding inequality, to prove that

β (S) \geq α (S) - f (S)

(the claim for

S

), it suffices to show the following inequality.

\begin{matrix} (13.2) & γ (S) \geq \sum_{i = 1}^{k} f (C_{i}) - f (S) + 1. \end{matrix}

The right hand side of the above inequality can be written as:

\begin{matrix} (13.3) & \sum_{i = 1}^{k} f (C_{i}) - f (S) + 1 = \sum_{e \in E_{c c}} 2 x_{e} + \sum_{e \in E_{c p}} x_{e} - \sum_{e \in E_{p o}} x_{e} + 1. \end{matrix}

We consider two subcases.

Case 2.1:

γ (S) = 0

. This implies that

E_{c c}

and

E_{c p}

are empty. Since

χ (δ (S))

is linearly independent from

χ (δ (C_{1})), \dots, χ (δ (C_{k}))

, we must have that

E_{p o}

is not empty and hence

\sum_{e \in E_{p o}} x_{e} > 0

. Therefore, in this case,

\sum_{i = 1}^{k} f (C_{i}) - f (S) + 1 = \sum_{e \in E_{c c}} 2 x_{e} + \sum_{e \in E_{c p}} x_{e} - \sum_{e \in E_{p o}} x_{e} + 1 = - \sum_{e \in E_{p o}} x_{e} + 1 < 1.

Since the left hand side is an integer, it follows that

\sum_{i = 1}^{k} f (C_{i}) - f (S) + 1 \leq 0 = γ (S)

Case 2.2:

γ (S) \geq 1

. Recall that

γ (S) = | E_{c c} | + | E_{c p} |

\sum_{i = 1}^{k} f (C_{i}) - f (S) + 1 = \sum_{e \in E_{c c}} 2 x_{e} + \sum_{e \in E_{c p}} x_{e} - \sum_{e \in E_{p o}} x_{e} + 1 \leq \sum_{e \in E_{c c}} 2 x_{e} + \sum_{e \in E_{c p}} x_{e} + 1

By our assumption that

x_{e} < \frac{1}{2}

for each

e

, we have

\sum_{e \in E_{c c}} 2 x_{e} < | E_{c c} |

| E_{c c} | > 0

, and similarly

\sum_{e \in E_{c p}} x_{e} < | E_{c p} | / 2

| E_{c p} | > 0

. Since

γ (S) = | E_{c c} | + | E_{c p} | \geq 1

we conclude that

\sum_{e \in E_{c c}} 2 x_{e} + \sum_{e \in E_{c p}} x_{e} < γ (S) .

Putting together we have

\sum_{i = 1}^{k} f (C_{i}) - f (S) + 1 \leq \sum_{e \in E_{c c}} 2 x_{e} + \sum_{e \in E_{c p}} x_{e} + 1 < γ (S) + 1 \leq γ (S)

as desired.

Tightness of the analysis: The LP relaxation has an integrality gap of

2

even for the MST problem. Let

G

be the cycle on

n

vertices with all edge costs equal to

1

. Then setting

x_{e} = 1 / 2

on each edge is feasible and the cost is

n / 2

while the MST cost is

n - 1

. Note that the optimum fractional solution here is

1 / 2

-integral. However, there are more involved examples (see Jain's paper or ^[4:1]) based on the Petersen graph where the optimum basic feasible solution is not half-integral while there are one or more edges with fractional value at least

1 / 2

. Jain's iterated rounding algorithm is an unusual algorithm in that the output of the algorithm may not have any discernible structure until it is completely done.

Running time: The strength of the iterated rounding approach is the remarkable approximation guarantees it delivers for various problems. The weakness is the high running time which is due to two reasons. First, one needs a basic feasible solution for the LP–this is typically much more expensive than finding an approximately good feasible solution. Second, the algorithm requires computing an LP solution many times. Finding faster algorithms with comparable approximation guarantees is an open research area.

The source minimal $s$ - $t$ mincut in a directed / undirected graph is unique via submodularity and can be found by computing $s$ - $t$ maxflow and finding the reachable set from $s$ in the residual graph. Similarly sink minimal set. ↩︎
Michel X Goemans and David P Williamson. “The primal-dual method for approximation algorithms and its application to network design problems”. In: Approximation algorithms for NP-hard problems (1997), pp. 144–191. ↩︎
Kamal Jain. “A factor 2 approximation algorithm for the generalized Steiner network problem”. In: Combinatorica 21.1 (2001), pp. 39–60. ↩︎
Vĳay V Vazirani. Approximation algorithms. Springer Science & Business Media, 2013. ↩︎ ↩︎
Viswanath Nagarajan, R Ravi, and Mohit Singh. “Simpler analysis of LP extreme points for traveling salesman and survivable network design problems”. In: Operations Research Letters 38.3 (2010), pp. 156–160. ↩︎
David P Williamson and David B Shmoys. The design of approximation algorithms. Cambridge university press, 2011. ↩︎ ↩︎
Chandra Chekuri and Thapanapong Rukkanchanunt. “A note on iterated rounding for the Survivable Network Design Problem”. In: 1st Symposium on Simplicity in Algorithms (SOSA 2018). Schloss DagstuhlLeibniz-Zentrum fuer Informatik. 2018. ↩︎

Approximation Algorithm: Survivable Network Design Problem

13.1 Augmentation approach

13.2 Iterated rounding based 2-approximation

13.2.1 Basic feasible solutions and laminar family of tight sets

13.2.2 Counting argument

Recommended for you

Report Article