hard problems

bad hard knap:bf knap:dp knap:dpv set cover sat

hard problems

for many problems, no known polytime algorithm

0-1 knapsack: brute force

for each subset

if weight ok and value best-so-far
- print subset, update best-so-far-value

def powerset(iterable): # all subsets
#https://docs.python.org/2/library/itertools.html#recipes
    "powerset([1,2,3]) --> () (1,) (2,) (3,) (1,2) (1,3) (2,3) (1,2,3)"
    s = list(iterable)
    return chain.from_iterable(combinations(s, r) for r in range(len(s)+1))

def knapBF(val, wt, W, verbose): # brute force
  n = len(val)
  showProblem(n, W, val, wt)
  indices = range(n) # [0 1 .. n-1]
  bestVal = 0
  if verbose:
    print('knapBF\nsubset    wt val   max wt',W)
  for indexset in powerset(indices):
    v = sum( val[t] for t in indexset )
    w = sum(  wt[t] for t in indexset )
    if w <= W and v > bestVal:
      bestVal = v
      if verbose: print(indexset, w,v)

brute force 0-1 knapsack

runtime ?

Ω(2ⁿ)

correct ?

yes: considers every subset

can we do better ? ? ?

0-1 knapsack: dynamic programming

subproblems ?

by set of allowed items
by allowed total weight

set of allowed items ?

{ } {1} {1 2} {1 2 3} ...

allowed total weight ?

0 1 2 ...

K[w][j]: best value with items any subset of {1, ..., j}, total wt ≤ w

compute K[w][j]

item j out? value K[w][j-1]
item j in? value K[w-w_j][j-1] + v_j
so set K[w][j] to max of these two

0-1 knapsack: dp
#python indexing: val[t],wt[t] corresponds to K[][t+1]
# K[w][j]  will be best value knapsack, weight at most w,
#         using subset of { item_1, item_2, ..., item_j }
# warning: K[][] needs an extra 0th column, so its indices
#   are off-by-one w.r.t. val[], wt[]
# eg. val[v], wt[t]  refer to t+1'st item
#     also K[][t+1]  refers to t+1'st item
def solveknapsack(val,wt,W):
  n = len(val)
  K = [[0 for j in xrange(n+1)] for w in xrange(W+1)]
  for j in range(1,n+1):
    for w in range(W+1):
      K[w][j] = K[w][j-1] if w < wt[j-1] \
        else max(K[w][j-1], K[w-wt[j-1]][j-1]+val[j-1])

0-1 knapsack: dp example
max value knapsack, weight at most 27 ?
 val   [ 6,  9,  7,  9,  8,  7]
  wt   [11,  6,  8, 10,  8,  9]
[ 0,  0,  0,  0,  0,  0,  0]
[ 0,  0,  0,  0,  0,  0,  0]
[ 0,  0,  0,  0,  0,  0,  0]
[ 0,  0,  0,  0,  0,  0,  0]
[ 0,  0,  0,  0,  0,  0,  0]
[ 0,  0,  0,  0,  0,  0,  0]
[ 0,  0,  9,  9,  9,  9,  9]
[ 0,  0,  9,  9,  9,  9,  9]
[ 0,  0,  9,  9,  9,  9,  9]
[ 0,  0,  9,  9,  9,  9,  9]
[ 0,  0,  9,  9,  9,  9,  9]
[ 0,  6,  9,  9,  9,  9,  9]
[ 0,  6,  9,  9,  9,  9,  9]
[ 0,  6,  9,  9,  9,  9,  9]
[ 0   ?   ?   ?   ?   ?   ?]

[ 0,  6,  9, 16, 16, 17, 17]
[ 0,  6,  9, 16, 16, 17, 17]
[ 0,  6,  9, 16, 18, 18, 18]
[ 0,  6, 15, 16, 18, 18, 18]
[ 0,  6, 15, 16, 18, 18, 18]
[ 0,  6, 15, 16, 18, 18, 18]
[ 0,  6, 15, 16, 18, 18, 18]
[ 0,  6, 15, 16, 18, 18, 18]
[ 0,  6, 15, 16, 18, 24, 24]
[ 0,  6, 15, 16, 18, 24, 24]
[ 0,  ?   ?   ?   ?   ?   ?]

[ 0,  6, 15, 16, 25, 26, 26]
[ 0,  6, 15, 22, 25, 26, 26]
[ 0,  6, 15, 22, 25, 26, 26]
[ 0,  6, 15, 22, 25, 26, 26]
-----------------------------
solution
[0, 1, 0, 1, 1, 0] val 26

dp knapsack wc superpoly

consider these instances
- each value and weight has n bits, so in [ 2^n-1 ... 2ⁿ -1 ]
- average weight ( 2^n-1+2ⁿ -1 ) / 2 ≈ .75 2ⁿ
- let W be n times average weight, so ≈ .75 n 2ⁿ
- input size Θ( n² ) bits
brute force wc time
- 2ⁿ subsets, each weight Θ(n) bits
- for each subset t of size s, perform s add'ns
- total number of add'ns is sum, over all subsets t, of size(t)-1
- sum, over all subsets t, of size(t) is n*2^n-1 (pair each subset with its complement)
- total number add'ns: n*2^n-1 - 2ⁿ in Θ( n * 2ⁿ)
- avg add'n involves n/2 weights, sum has Θ( n² ) bits, time Θ(n²)
- total alg'm time in Θ( n³ 2ⁿ )
dyn. prog. wc time
- avg wt. Θ(n²) bits
- avg add'n cost Θ(n²)
- wc W Θ(n * 2ⁿ)
- Θ(n² * n * W ) = Θ( n⁴ 2ⁿ ), worse than BF!
let t = n² so n = √ t let f(t) = t² 2^√t
- each alg. input Θ( t ) bits wc time O( f(t) )
time complexities
2^{O(log t)} polynomial
2^o(t) subexponential
2^Ω(t) exponential
here
- lim_{t → ∞} 2^{k log t} / f(t) = 0, so f(t) superpoly
- lim_{t → ∞} f(t) / (1+ε)^t = 0, so f(t) subexponential

0-1 knapsack: another DP solution

how do you like DPW if
- each weight is in { 2^{n-1}, ..., 2^n -1 } and
- each values is in { 1, ..., 2n } ?
organize subproblems by value (instead of weight)
also useful in constructing knapsack PTAS (polytime approx scheme)

def knapDPV(val,wt,V): #dynamic programming by value
  # A[v][j] is min weight of subset of items 0..j with exact val v
  infinity = 1 + sum(val) #larger than any possible sum of values
  n = len(val)
  A = [[infinity for j in xrange(n)] for v in xrange(V+1)]
  for j in range(n): A[0][j] = 0
  A[val[0]][0] = wt[0]  # end initialization
  for v in range(1,V+1):  # row    0 already initialized
    for j in range(1,n):  # column 0 "       "
      A[v][j] = A[v][j-1] if v < val[j] \
        else min(A[v][j-1], A[v - val[j]][j-1] + wt[j])

knapsack DPV hand-trace

knapsack DPV

time in O(nV), but V can be big
correctness ?

setcover

brute force set cover
from random    import randint
from itertools import chain, combinations

def powerset(iterable):
#https://docs.python.org/2/library/itertools.html#recipes
    "powerset([1,2,3]) --> () (1,) (2,) (3,) (1,2) (1,3) (2,3) (1,2,3)"
    s = list(iterable)
    return chain.from_iterable(combinations(s, r) for r in range(len(s)+1))

def settostring(s,n): # set as pretty binary string
  x = ''
  for j in range(n):
    if j in s:   x += ' *'
    else:        x += ' -'
  return x

def gensubset(a,b,n): # subset of 0..n-1, each with prob a/b
  s = set()
  for j in range(n):
    if randint(1,b) <= a: s.add(j)
  return s

def initsubsets(n,m,a,b): # m subs of 0..n-1
  # an element in a subset with probability a/b
  L = [] # list of subsets
  for _ in range(m): L.append(gensubset(a,b,n))
  sumlen = 0
  for x in L:
    sumlen += len(x)
    print settostring(x,n)
  print 'universe size', n,' avg subset size', sumlen*1.0/m
  return L

def bruteforce(n,m,L):
  indices = range(m) # [0 1 .. m-1]
  bestcover = ()
  for indexset in powerset(indices):
    subsetunion = set()
    for t in indexset:
      subsetunion |= L[t]
    # indexsets sorted by cardinality, so first found is min
    if len(subsetunion) == n:
      print 'size', len(indexset), 'cover', indexset
      for j in indexset: print settostring(L[j],n)
      return # so return once one is found

n,m,a,b = 40,25,2,11
L = initsubsets(n,m,a,b)
bruteforce(n,m,L)

set cover example
set     universe
1 2 3 4 5 6 7 8 9
 - - * - - - - * * -
 - * - - - - - * - *
 - - - - * - - * * -
 - - - - - - - - * -
 - - - - - * - * * -
 - - - - * - * * - *
 * * - * - - - - - -
 - - - - - - * * - -

min cover (0, 4, 5, 6)
 - - * - - - - * * -
 - - - - - * - * * -
 - - - - * - * * - *
 * * - * - - - - - -

another example
set                    universe
1 2 3 4 5 6 7 8 910111213141516171819
 * - * - - - * - * - - - - * - * - - * -
 - - - * - * - * - - - * - - * * - - - -
 - * - - - - * - * - - - - - - - - * - -
 - - - * - - - - - - - - - - - - - - * -
 * - - - - * - * - * - - - - - * - - - -
 - - - * - - * * - - - - - * - * - - - -
 - - - - - * - - * - - * * - - * - - * -
 - * - - * - - - * - * * * - * * - - - -
 - * - - * - - - * - - - - * * - - - * *
 * - * - - - * * - - - - - - - - - - - *
- * - - - - * - - * * - * - - - * - * -

min cover (0, 1, 2, 8, 10)
* - * - - - * - * - - - - * - * - - * -
- - - * - * - * - - - * - - * * - - - -
- * - - - - * - * - - - - - - - - * - -
- * - - * - - - * - - - - * * - - - * *
- * - - - - * - - * * - * - - - * - * -

greedy set cover

greedy set cover
Dasgupta et al. section 5.4

school location problem
  - input: some towns, and a threshold distance (e.g. 30 km)
  - problem: build schools in minimum number of towns,
      s.t. distance to school at most the threshold
  - model as a graph problem G = (V,E)
      - V = set of towns
      - E = pairs of towns at most threshold distance apart
      - for each town, its neighbor-set is set of all towns
          whose distance away is at most the threshold
      - problem: pick the minimum number of neighbor-sets that cover V
      - in other words: this is an instance of min set cover

how hard is set cover?
- answer: NP-hard, by reduction from vertex cover

satisfiability

given a boolean formula in CNF, is it satisfiable?

assume input has n variables and m clauses

brute force: 2ⁿ possible assignments

brute force: worst case time O(m2ⁿ)

sat example
( x1  or  ~x2  or  x3 )  and
( x1  or  ~x2  or ~x4 )  and
( x1  or   x2  or  x3 )  and
( x1  or   x2  or  x4 )  and
(~x1  or  ~x2  or ~x3 )  and
(~x1  or  ~x2  or  x3 )  and
(~x1  or   x2  or  x4 )  and
( x2  or  ~x3  or ~x4 )  and
( x2  or   x3  or ~x4 )

[  1, -2,  3]
[  1, -2, -4]
[  1,  2,  3]
[  1,  2,  4]
[ -1, -2, -3]
[ -1, -2,  3]
[ -1,  2,  4]
[  2, -3, -4]
[  2,  3, -4]

satisfiable (0, 1, 1, 0)

another example
[ -4, -5,  6]
[ -4,  5, -6]
[ -2,  4, -5]
[ -2,  5, -6]
[ -1, -3, -4]
[ -1, -3,  4]
[ -1,  4, -6]
[  1, -4,  5]
[  1, -3, -5]
[  1, -3,  4]
[  1,  5, -6]
[  2,  5,  6]
[  3, -5, -6]
[  4, -5,  6]
[  4,  5, -6]
[  4,  5,  6]

satisfiable (1, 1, 0, 1, 0, 0)

brute force sat solver
# I/O: literals x1 -x1 x2 -x2 ... rept'd by 1 -1 2 -2 ...
def clause_sat(clause,asnmt):
  for var in clause:
    if ((var>0 and asnmt[  var-1 ] == 1) or
        (var<0 and asnmt[-(var+1)] == 0)):
      return True
  return False

def sat(formula, asnmt):
  for clause in formula:
    if not clause_sat(clause,asnmt): return False
  return True

n, k, m = 20, 3, 90  #max m: (n choose k)(2^k)
myf = formula(n,k,m)
show(myf, False)

allAsn = product([0,1],repeat=n)
for a in allAsn:
  if sat(myf,a): print 'satisfiable', a