Week 5, Lecture 1

Set Cover

Input: A set of elements B; sets S₁ , . . . , S_m ⊆ B
Output: A selection of the S_i whose union is B.
Cost: Number of sets picked.

We need to make the selection such that the cost is minimum.
We can do this greedily:

Repeat until all elements of B are covered:
    Pick the set S_i with the largest number of uncovered elements

For example,
Let 
Set to cover B = {1,2,3,4,5,6}
Set Family = {1,2,3,4}, {1,3,5}, {2,4,6}

According to the greedy approach, {1,2,3,4} is picked first as it has the largest number of uncovered elements. Then the remaining two sets are picked in order to cover elements 5 and 6. So, the greedy approach picked all three sets.
But the optimal solution is picking just two sets: {1,3,5} and {2,4,6}
Hence, we notice that the greedy solution is not always optimal.

Note - Set Cover is NP complete.

Approximation:

Suppose B contains n elements and that the optimal cover consists of k sets. Then the greedy algorithm will use at most k ln(n) ¹ sets.

Proof:

Let n_t be the number of elements still not covered after t iterations of the greedy
algorithm. Since these remaining elements are covered by the optimal k sets, there must be some set with at least n_t/k of them. Therefore, according to the greedy approach:

n_t+1 <= n_t - n_t/k = n_t(1-1/k),

which by repeated application gives: n_t <= n₀(1-1/k)^t
We now use the inequality: 
1-x <= e^-x, with equality if and only if x = 0
This can be observed from their graphs:

Thus, using the above inequality, we get:

n_t <= n₀(1-1/k)^t 

==> n_t < n₀(e^-1/k)^t

==> n_t < ne^-t/k (as n₀ = n)

At t = k ln(n), we see n_t is strictly less than ne^-ln(n) = 1, which means no elements
remain to be covered.
Therefore, the time complexity of this approximate greedy algorithm comes out to be O(ln(n)).

1: ln(n) = log_e(n)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Week 5, Lecture 1

Set Cover

Approximation:

Proof:

FilesExpand file tree

Week5_lecture1_2020115002.md

Latest commit

History

Week5_lecture1_2020115002.md

File metadata and controls

Week 5, Lecture 1

Set Cover

Approximation:

Proof: