The vector partition problem for convex objective functions

4 downloads 78 Views 108KB Size Report
problem has broad expressive power and captures NP-hard problems even if ... 1999). Because the problem concerns the maximization of a convex function, ...
MATHEMATICS OF OPERATIONS RESEARCH Vol. 26, No. 3, August 2001, pp. 583–590 Printed in U.S.A.

THE VECTOR PARTITION PROBLEM FOR CONVEX OBJECTIVE FUNCTIONS SHMUEL ONN and LEONARD J. SCHULMAN The partition problem concerns the partitioning of a given set of n vectors in d-space into p parts to maximize an objective function that is convex on the sum of vectors in each part. The problem has broad expressive power and captures NP-hard problems even if either p or d is fixed. In this article we show that when both p d are fixed, the problem is solvable in strongly polynomial time using Ondp−1−1  arithmetic operations. This improves upon the previously known bound of 2 Ondp . Our method is based on the introduction of the signing zonotope of a set of points in space. We study this object, which is of interest in its own right, and show that it is a refinement of the so-called partition polytope of the same set of points.

1. Introduction. The partition problem concerns the partitioning of a set A = a1   an of n vectors in d-space into p parts to maximize an objective function that is convex on the sum of vectors in each part. Each

=  1   p   ordered partition  of A is associated with the d × p matrix A =  a∈ 1 a  a∈ p a, whose jth column represents the total value of vectors assigned to the jth part. The problem is to find a partition that maximizes an objective function f given by f   = cA , where c is a real convex functional on d×p . This class of problems has applications in diverse fields that include circuit layout, clustering, inventory, scheduling, and reliability—see Barnes et al. (1992), Granot and Rothblum (1991), Hwang et al. (2000), and references therein—as well as important recent applications to symbolic computation (Onn and Sturmfels 1999). In its full generality, the partition problem instantly captures NP-hard problems, and hence is presumably intractable (Hwang et al. 1999). Because the problem concerns the maximization of a convex function, it can be reduced to p the problem of maximizing the same objective over the p-partition polytope A of A defined d×p

to be the convex hull in  of all (exponentially many) matrices A corresponding to ppartitions. As there will always be an optimal solution which is a vertex of this polytope, the partition problem can be solved by picking the best among the extremal partitions of p A—those with A a vertex of A . If the convex functional c is presented by an evaluation oracle, it may be necessary in the worst case to query the oracle on each and every one of these extremal partitions. Therefore, the complexity of the partition problem is intimately related to the number of extremal partitions. As the number of partitions and matrices A is exponential in n even with both p d fixed, the filtration of extremal out of all partitions by direct enumeration is prohibitive. Barnes et al. (1992) showed that any extremal partition is separable, that is, any two distinct parts can be separated by a hyperplane. This condition was exploited by Hwang et al. (1999) who showed that, for any fixed p d, the number of separable and hence extremal partitions p is Ond 2 , and all separable and hence extremal partitions can be enumerated and the par2 tition problem solved in time Ondp . These results apply for the more general shaped partition problem, where partitions are restricted to be those whose shape  1    p  Received October 20, 1999; revised March 20, 2001. AMS 2000 subject classification. Primary: 05A, 15A, 51M, 52A, 52B, 52C, 68Q, 68R, 68U, 90B, 90C. OR/MS subject classification. Primary: mathematics/combinations. Key words. Partition, cluster, optimization, separation, convex, polytope, zonotope, signing, vertex enumeration, polynomial time, combinatorial optimization. 583 0364-765X/01/2603/0583/$05.00 1526-5471 electronic ISSN, © 2001, INFORMS

584

S. ONN AND L. J. SCHULMAN

lies in a prescribed (but arbitrary) set of shapes of n. The solution of the partition problem by this method can not be significantly accelerated: Alon and Onn (1999) showed that, for p every fixed p ≥ 2 and d ≥ 3, the number of separable partitions is in fact nd 2 . In this article we provide a more efficient resolution of the (unrestricted) partition problem p by taking a different approach and introducing the p-signing zonotope A of A defined d×p of all matrices A corresponding to p-signings  of A; to be the convex hull in  the formal definitions are provided in the next section, where we study this object and establish Theorem 2.4 which implies at once the following result, demonstrating a manyp p to-one mapping from the set of vertices of A onto the set of vertices of A : p

Theorem 1.1. For every vertex A of the signing zonotope A there exists a vertex A p of the partition polytope A such that the cone of linear functionals uniquely maximized p p over A at A is contained in the cone of linear functionals uniquely maximized over A

at A . In fact, as shown in Example 2.5, the number of extremal signings (those  with A a verp tex of A ) corresponding to each extremal partition is typically exponential. Nevertheless, there are far fewer extremal signings than separable partitions and so, combining the manyto-one correspondence implied by Theorem 1.1 with available algorithmic and extremal results on zonotopes, we are able in §3 to conclude the following improved combinatorial and algorithmic bounds for every fixed d and p ≥ 2 (see §3 for the precise statements): Corollary 3.1. The maximum number of extremal p-partitions of any set of n nonzero points in d satisfies epd n = Ondp−1−1 . Corollary 3.2. All extremal p-partitions are enumerable in strongly polynomial time using Ondp−1−1  arithmetic operations. Corollary 3.3. The p-partition problem with an oracle presented c can be solved in strongly polynomial time using Ondp−1−1  arithmetic operations and queries. We conclude our article with Corollary 3.4, which further elaborates on the relation p p between A and A , showing that, in a precise sense, the latter is a refinement of the former. Recent results of Aviran et al. (forthcoming) that study partition polytopes for p = 2 or d = 2 give e2d = nd−1  for all d ≥ 2 and ep 2 = np  for all p ≥ 3. Thus, the bound of Corollary 3.1 is tight for p = 2 but somewhat loose for d = 2. However, already for p = d = 3 the precise rate of growth is as yet unknown. Several interesting questions remain. How accurate are the bounds of Corollary 3.1 on epd n for p d ≥ 3? Are they tight for the maximum number of extremal p-signings of any set of n nonzero points in d-space (which by Theorem 1.1 is greater than or equal to the maximum number epd n of extremal ppartitions)? Could the extremal partitions be enumerated and the partition problem solved faster? 2. Extremal partitions and extremal signings. We start with some basic notation and conventions. Throughout, ei stands for the ith standard unit vector in Euclidean real space. The outer product of an ordered pair u v of vectors is the matrix u ⊗ v whose i jth entry is ui vj . The inner product of two matrices U  V of the same dimensions  is U  V = i j Ui j · Vi j . Slightly modifying standard terminology, we call a (possibly empty) subset  of real space a cone provided  ∪ 0 is closed under finite nonnegative linear combinations (i.e., is a cone in the standard sense). We make the convention that a sum over an empty set of vectors (matrices) is the zero vector (matrix) of dimension which is clear from the context. A p-partition of a set A of points in d is an ordered collection =  1   p  of p pairwise disjoint (possibly empty) sets whose union is A. We also interpret as the

THE VECTOR PARTITION PROBLEM FOR CONVEX OBJECTIVE FUNCTIONS

585

function from A to 1  p with a being the index for which a ∈

a . With each p-partition of A we associate the d × p matrix     

A = a ⊗ e a = a  a ∈ d×p  a∈A

a∈ 1

a∈ p

We are concerned with the following combinatorial optimization problem (with the reals  replaced by the rationals  when the Turing computation model is considered). Partition problem. Given positive integers p d n, a set A of n points in d , and a convex functional c  d×p −→ , find a p-partition ∗ attaining the maximum objective value, ∗ cA  = max cA   is a p-partition of A  Because c is convex, there will always be an optimal partition which is extremal, that is, whose matrix A is a vertex of the convex hull of all matrices of p-partitions, defined as follows. Partition polytope. The p-partition polytope of a set A of n points in d is defined to be the convex hull of all pn matrices of p-partitions of A, p

A = conv A  is a p-partition of A ⊂ d×p  p

While A is defined as the convex hull of exponentially many points, it was shown by Hwang et al. (1999) (for the more general class of shaped partition polytopes), using separable partitions, that for any fixed d p, the number of extremal partitions is polynomial in n and they can be enumerated in polynomial time. Here we take a different approach, based on a new object—the signing zonotope—which leads to improved bounds and to a more efficient enumeration procedure for extremal partitions. We now proceed to define this object.  A p-signing of a set A of n points in d is an n p2 -tuple  = ra s  with ra s ∈ −1 1 for all a ∈ A and 1 ≤ r < s ≤ p. We extend the domain of  to all pairs 1 ≤ r = s ≤ p by antisymmetry sa r = −ra s . With each p-signing  of A we associate the d × p matrix   ra s · a ⊗ er − es  ∈ d×p  A = a∈A 1≤r 0 C A − C A ¯ = ¯ a∈A

p

It follows that C · is uniquely maximized over A at A hence C ∈  .  Lemma 2.1 also implies that, if and ¯ are two distinct extremal p-partitions of A then p A = A ¯ , and so extremal partitions stand in bijection with vertices of A . Indeed if = ¯

¯

¯ ¯ are extremal with A = A then, taking any C ∈  =  and a ∈ A with a = a,  and C a ⊗ e a − e a  we must have by the lemma that both C a ⊗ e a − e a ¯ ¯ must be positive, which is impossible. Note however that, for nonextremal partitions, it may happen that exponentially many give the same matrix A . The second lemma is the analog of Lemma 2.1 concerning the signing zonotope. Lemma 2.2. The cone  of a p-signing  of a set A of nonzero points in d is the set of all d × p matrices C with C ra s · a ⊗ er − es  positive for all a ∈ A and 1 ≤ r < s ≤ p. p

Proof. Suppose first C ∈  hence C · is uniquely maximized over A at A . Consider any a ∈ A and 1 ≤ r < s ≤ p, and let ¯ be the signing obtained from  by flipping the sign of ra s . Then A − A¯ = 2ra s · a ⊗ er − es , hence   1 1 a C r s · a ⊗ er − es  = C A − A¯  = C A − C A¯  > 0 2 2 Conversely, suppose C is a matrix with C ra s · a ⊗ er − es  positive for all a ∈ A and 1 ≤ r < s ≤ p. Consider any signing ¯ different from . Then, for all a ∈ A and 1 ≤ r < s ≤ p, we have the inequality C ra s − ¯ ra s  · a ⊗ er − es  ≥ 0, being strict if ¯ the inequality is indeed strict for some a ∈ A and 1 ≤ r < s ≤ p, ra s = ¯ ra s . Since  = , hence    a 

C r s − ¯ ra s · a ⊗ er − es  > 0 C A − C A¯ = a∈A 1≤r C a ⊗ es if and only if ra s = 1. Thus, the values C a ⊗ e1   C a ⊗ ep are distinct. Let a be the unique index attaining maximum value C a ⊗ e a . Then, for all k = a, we a = 1, and so a is the desired unique have C a ⊗ e a > C a ⊗ ek hence  ak index.  Proposition 2.3 enables to associate with each extremal p-signing  of A a p-partition

=  of A via the interpretation of as a function from A to 1  p as follows: a = 1 for all k = a. For each a ∈ A, let 1 ≤ a ≤ p be the unique index satisfying  ak If  is not extremal then  is undefined. The following statement implies at once Theorem 1.1 stated in §1. Theorem 2.4. A p-partition of a set A of nonzero points in d is extremal if and only if =  for some extremal p-signing  of A. Moreover, the closure of the cone  of any p-partition of A is the union of closures of all cones  of p-signings  of A with = . Proof. First, consider any extremal p-signing  of A and let =  be its associated p-partition. We show that the corresponding cones satisfy  ⊆  hence, in particular,  is nonempty and is extremal. Consider any C ∈  . For all a ∈ A and k = a we a = 1 and hence have, by the definition of = , that  ak

a C  a ⊗ e a − ek  = C  ak · a ⊗ e a − ek   Since C ∈  Lemma 2.2 implies that the right-hand side is positive, hence so is the lefthand side. Since this is true for all a ∈ A and k = a, Lemma 2.1 implies that C ∈  . Next, let be any extremal p-partition. Suppose  is a p-signing for which the closure of  intersects  . Since both  and  are open,  intersects  as well. Since the cones of any two distinct extremal p-partitions are disjoint, and, as just shown in the first part of the proof above,  ⊆   , it follows that = . Now, the union over the finitely many p-signings  of the closures of the cones  is the entire space, hence contains  . Since we have shown that whenever the closure of  intersects  then

=  and  ⊆  , it follows that, as claimed, the closure of  is the union of closures of all cones  of p-signings  of A with = .  We finish this section by demonstrating, for every n, a set of n points in n each extremal p-partition of which equals  for exponentially many extremal p-signings  of this set. Example 2.5. Let p n be positive integers and let A = e1   en be the set of unit vectors in n . Then the matrices A of p-partitions of A are precisely all 0 1 -matrices p with one 1 per row hence all are vertices of A . Thus, all pn -many p-partitions of A are extremal. Next, the matrices A of extremal p-signings of A are precisely all matrices with each row a permutation of p − 1 p − 3  3 − p 1 − p. Thus, p!n -many p-signings of A p are extremal. Interestingly, here A is equivalent to the n-fold product of the permutohedron of order p. The matrix A of an extremal =  is obtained from the matrix A of  by replacing the maximal entry p − 1 in each row by 1 and replacing all other entries by 0. Therefore, every extremal partition satisfies =  for precisely p − 1!n -many extremal signings .

588

S. ONN AND L. J. SCHULMAN p

Specifically, consider the case n = p = 3, A = e1  e2  e3 ⊂ 3 . Then A ⊂ 3×3 has p 3 = 27 vertices while A ⊆ 3×3 has 3!3 = 216 vertices. Let =  e1  e2  e3  be the extremal 3-partition with A = I3 the 3 × 3-identity. Encoding each 3-signing  by the 9-tuple e e e e e e e e e 112 113 213 122 123 223 132 133 233  3

the 3 − 1!3 = 8 extremal 3-signings  with = , together with their matrices A , are 2 0−2 2 0−2 2 0−2 2 0−2 0 2−2 0 2−2 −2 2 0 −2 2 0 0−2 2 −2 0 2 0−2 2 −2 0 2 +++−+++−− +++−++−−− +++−−++−− +++−−+−−− 2−2 0 2−2 0 2−2 0 2−2 0 0 2−2 0 2−2 −2 2 0 −2 2 0 0−2 2 −2 0 2 0−2 2 −2 0 2 ++−−+++−− ++−−++−−− ++−−−++−− ++−−−+−−− p

The closure of the normal cone  of A at its vertex I3 = A is the union of the closures p of the normal cones  of A at its eight vertices given by the matrices A above. 3. Consequences. Combining Theorem 2.4 with available algorithmic and extremal results on zonotopes, we obtain the following consequences concerning extremal partitions and the partition problem we started with. The case p = 1 being straightforward, we restrict attention to p ≥ 2. Corollary 3.1. For every fixed d and p ≥ 2, the maximum number of extremal ppartitions of any set of n nonzero points in d satisfies epd n = Ondp−1−1 . Proof. It is known (see, e.g., Gritzmann and Sturmfels 1993 and references therein) that the number of vertices of any D-dimensional zonotope which is the Minkowski sum of N N −1  line segments is at most 2 D−1 . Consider any set A ofn nonzero points in d-space. k=0 k p Then the p-signing zonotope A of A is the sum of N = n p2 line segments. As is easy p to see each matrix A has row sum zero, hence A lives in the subspace of d × p matrices with this property, which has dimension D = dp − 1. By Theorem 2.4, the number of extremal p-partitions of A is less than or equal to the number of extremal p-signings of p A which—see discussion following Lemma 2.2—equals the number of vertices of A . For fixed p d we therefore obtain, as claimed, the upper bound ep d n

≤2

D−1  k=0



N −1 k

dp−1−1

= 2



k=0

p 

n

2

−1 k



  = O ndp−1−1 



Aviran et al. (forthcoming), which study partition polytopes with p = 2 or d = 2, show that e2 d n = nd−1  for every d ≥ 2 and ep2 n = np  for every p ≥ 3. Thus, the bound e2d n = Ond−1  of Corollary 3.1 is tight for all d whereas the bound ep2 n = On2p−3  is tight for no p ≥ 4. How accurate are the bounds of Corollary 3.1 on epd n for p d ≥ 3? Are they tight for the maximum number of extremal p-signings of any set of n nonzero points in d-space (which, by Theorem 2.4, is greater than or equal to the maximum number epd n of extremal p-partitions)? The next two algorithmic corollaries concern both the real and Turing computation models: An algorithm is strongly polynomial time if it uses a number of real arithmetic operations polynomially bounded in n, and for rational data runs in time polynomial in the bit size of the input.

THE VECTOR PARTITION PROBLEM FOR CONVEX OBJECTIVE FUNCTIONS

589

Corollary 3.2. For every fixed d and p ≥ 2, all extremal p-partitions of any set of n nonzero points in d-space are enumerable in strongly polynomial time using Ondp−1−1  arithmetic operations. Proof. It is known (see Gritzmann and Sturmfels 1993 and references therein) that for any fixed D there is a strongly polynomial time algorithm which, given any zonotope in  D presented as the Minkowski sum  = Nk=1 −1 1 · vk of N line segments, enumerates  all sign vectors  ∈ −1 1 N corresponding to vertices Nk=1 k vk of , using ON d−1  arithmetic operations. Fix p and d and  set D = dp − 1. Let A be any given set of n nonzero points in dspace. Let N = n p2 . For each a ∈ A and 1 ≤ r < s ≤ p let Vra s be the d × p − 1 matrix obtained by omitting the last column of the d × p matrix a ⊗ er − es . Let    = −1 1 · Vra s ⊂ d×p−1 a∈A 1≤r