You are on page 1of 10

Hardness Results for Coloring 3-Colorable 3-Uniform Hypergraphs 

Subhash Khot y
Department of Computer Science,
Princeton University
email : khot@cs.princeton.edu

Abstract 1. Introduction
In this paper, we consider the problem of coloring a 3- In this paper, we consider the problem of coloring 3-
colorable 3-uniform hypergraph. In the minimization ver- colorable 3-uniform hypergraphs. We consider both the
sion of this problem, given a 3-colorable 3-uniform hyper- minimization and maximization versions.
graph, one seeks an algorithm to color the hypergraph with A q -uniform hypergraph G = (V; E ) consists of a set of
as few colors as possible. We show that it is NP-hard to vertices V and a set of edges E . Every edge e 2 E is a
color a 3-colorable 3-uniform hypergraph with constantly size-q subset of the set of vertices (so graphs are 2-uniform
many colors. In fact, we show a stronger result that it is NP- hypergraphs). A hypergraph is said to be k -colorable if the
hard to distinguish whether a 3-uniform hypergraph with n vertices can be colored with k colors so that for every edge,
vertices is 3-colorable or it contains no independent set of not all of its vertices have the same color.
size Æn for an arbitrarily small constant Æ > 0. In the max-
imization version of the problem, given a 3-uniform hyper- Minimization Version : In the minimization version of
graph, the goal is to color the vertices with 3 colors so as the problem (also called Approximate Coloring problem),
to maximize the number of non-monochromatic edges. We we are given a k -colorable (hyper)graph where k is a small
show that it is NP-hard to distinguish whether a 3-uniform constant and we seek an algorithm to color the (hyper)graph
hypergraph is 3-colorable or any coloring of the vertices with as few colors as possible. It is well-known that it is NP-
with 3 colors has at most 89 +  fraction of the edges non- hard to test 3-colorability of graphs whereas Lovasz [19]
monochromatic where  > 0 is an arbitrarily small con- showed that it is NP-hard to test 2-colorability of 3-uniform
stant. This result is tight since assigning a random color hypergraphs. The known algorithms color k -colorable (hy-
independently to every vertex makes 89 fraction of the edges per)graphs with n
(1) colors. Table 1 gives (only) the best
non-monochromatic. known algorithmic results.
These results are obtained via a new construction of a There is a huge gap between these algorithmic results
probabilistically checkable proof system (PCP) for NP. We and the corresponding known hardness results. For 3-
develop a new construction of the PCP Outer Verifier. An colorable graphs, we only know that it is hard to color them
important feature of this construction is smoothening of the with 4 colors (see [14]). A big open problem is whether 3-
projection maps. colorable graphs are hard to color with constantly many col-
Dinur, Regev and Smyth [6] independently showed that ors. This problem seems well-beyond the scope of the cur-
it is NP-hard to color a 2-colorable 3-uniform hypergraph rent techniques. A natural question is whether one can get
with constantly many colors. In the “good case”, the hyper- any hardness results for hypergraph coloring. Indeed, Gu-
graph they construct is 2-colorable and hence their result is ruswami et al [8] showed hardness of coloring 2-colorable
stronger. In the “bad case” however, the hypergraph we 4-uniform hypergraphs with constantly many colors. The
construct has a stronger property, namely, it does not even known hardness results for (hyper)graph coloring are sum-
contain an independent set of size Æn. marized in Table 2.
A question left open by Guruswami et al [8] is whether
similar hardness results hold for 3-uniform hypergraphs (as
opposed to 4-uniform hypergraphs). We answer this ques-
 Full version of this paper is available at tion positively by proving that

Theorem 1.1 For every constant Æ > 0, it is NP-hard to
http://www.cs.princeton.edu/ khot/publications.html
y This work was done when the author was visiting the Dept of Com-
puter Science, University of California, Berkeley. distinguish whether an n-vertex 3-uniform hypergraph is
Holds for Colors with
Blum, Karger [4] 3-colorable graphs O~ (n3=14 ) colors
k-colorable graphs, O~ (n1 )
3
Karger et al [13] k+1 colors; Improvements by Halperin et al [9]
k3
Krivelevich et al [18] 2-colorable 3-uniform O~ (n1=5 ) colors
hypergraphs
Table 1 : Known algorithmic results for approximate (hyper)graph coloring.

Holds for Hardness of coloring with Assumption


Khanna et al [14] 3-colorable graphs 4 colors P 6= NP
Khot [15] k-colorable graphs, k 251 (log k) colors P 6= NP
all sufficiently large k
Guruswami et al [8] 2-colorable 4-uniform constantly many colors P 6= NP
hypergraphs
Guruswami et al [8] 2-colorable 4-uniform
( logloglogloglogn n ) colors NP 6 DTIME(nO(log log n) )
hypergraphs
NP 6 DTIME(2(log n)
O(1)
Khot [16] k-colorable 4-uniform (log n) k colors, )
hypergraphs, k  5 > 0 absolute constant
This paper 3-colorable 3-uniform constantly many colors P 6= NP
hypergraphs
Dinur et al [6] 2-colorable 3-uniform constantly many colors P 6= NP
hypergraphs
Table 2 : Known hardness results for approximate (hyper)graph coloring.

3-colorable or it contains no independent set of size Æn. In hypergraph is q -uniform, every constraint is defined on q
particular, it is NP-hard to color 3-colorable 3-uniform hy- variables. Assignment of colors to the vertices corresponds
pergraphs with constantly many colors. to assigning every variable a value from a domain of size k .
A constraint is satisfied iff all its variables are not assigned
Independent set in a hypergraph is defined as a set of the same value, or equivalently, the corresponding edge is
vertices such that no edge lies entirely within this set. In a non-monochromatic. We call this CSP to be the problem
properly colored hypergraph, the vertices colored with the NAE q;k . The optimum of the CSP is the maximum fraction
same color form an independent set. Thus saying that a hy- of the constraints that can be satisfied by any assignment.
pergraph contains no independent set of size Æn is stronger
than saying it cannot be colored with 1=Æ colors.
Note that assigning every variable a random value from
the size-k domain satisfies (1 kq1 1 ) fraction of the con-
Under a stronger complexity assumption than P6=NP, we
straints. The problem NAE q;k is said to have a Random
show the following stronger hardness result.
Threshold Property if it is NP-hard to do strictly better than
Theorem 1.2 Assuming NP 6 DTIME(2(log n)
O(1) assigning random values, or more specifically, if it is NP-
hard to distinguish whether the optimum is  1  or
), it is
hard to color a 3-colorable 3-uniform hypergraph on n ver-
tices with (log log n) colors. One can take = 1=9.  1 kq1 1 +  for arbitrarily small  > 0.
Håstad [11] showed that for the problem NAE 4;2 , a gap
Maximization Version : In the maximization version of (1; 78 + ) is hard, i.e. this problem has the random thresh-
the hypergraph coloring problem, we are given a q -uniform old property. This result was the basic starting point for the
hypergraph and k different colors. The goal is to assign one hypergraph coloring hardness result of Guruswami et al [8].
color to every vertex so as to maximize the number of edges On the other hand, Zwick [21] showed that for the problem
that are non-monochromatic. NAE 3;2 , there exists an algorithm that does strictly better
One can also think of the maximization version as a than the random assignment and this seemed to be a stum-
constraint satisfaction problem (CSP) on the Not-All-Equal bling block in extending Guruswami et al’s techniques to 3-
predicate. The vertices of the hypergraph are variables of uniform hypergraphs. Khot [17] investigated the case when
the CSP and the edges are constraints of the CSP. Since the the domain is ternary, i.e. the problem NAE 3;3 . He showed
that a certain conjecture about 2-prover games (which he We believe that the techniques in this paper will be quite
calls Unique Games Conjecture) implies that the problem useful in future. As another application of our techniques,
NAE 3;3 has the random threshold property. An open prob- we obtain simpler proofs of Håstad’s [11] hardness results
lem was whether such a result could be proven uncondition- for Max-3SAT and Set Splitting on satisfiable instances and
ally. In this paper, we solve this problem by showing that Holmerin’s [12] hardness result for vertex cover in hyper-
graphs. We omit these proofs from this paper due to space
Theorem 1.3 For every k  3, the problem NAE 3;k has limitations.
the random threshold property on satisfiable instances. In
particular, it is NP-hard to distinguish whether a 3-uniform Remark : Dinur, Regev and Smyth [6] obtained a better
hypergraph is 3-colorable or any coloring of the vertices hardness result for coloring 3-uniform hypergraphs, and our
with 3 colors has at most 89 +  fraction of the edges non- paper was partly influenced by their result. They show hard-
monochromatic where  > 0 is an arbitrarily small con- ness of coloring 2-colorable 3-uniform hypergraphs with
stant. constantly many colors. However their construction has
large independent sets (in fact independent sets of size n2 )
and in this respect, their result is weaker than Theorem
Our Techniques : Our results are obtained via new con- 1.1. Their techniques are quite different from ours. We use
structions of probabilistically checkable proofs (PCPs) for a new construction of the PCP outer verifier followed by
NP. Most PCP constructions in the recent years ([11], [8] Fourier analysis whereas Dinur et al make a clever use of
for example) can be divided into two components, the so- Kneser graphs and their construction is more combinatorial
called Outer PCP Verifier and the so-called Inner PCP Ver- in nature. It is interesting that they are able to get a hardness
ifier. The outer verifier used in all previous constructions result for 2-colorable 3-uniform hypergraphs even though
is based on the hardness of a 2-prover 1-round game ob- the problem NAE 3;2 does not have the random threshold
tained by parallel repetition of a basic 2-prover protocol for property. It would be nice, if possible, to obtain a Fourier
Max-3SAT. A crucial property of these games is that the an- analysis based proof of Dinur et al’s result.
swer of the first prover uniquely determines the answer of
the second prover if the verifier is to accept. Let us denote Overview of the paper : Section 2 introduces some of the
by (q1 ; q2 ) the pair of questions asked to the first and the tools we use in this paper. In Section 3, we describe the
second prover respectively. For every answer a of the first construction of the PCP outer verifier. We prove Theorem
prover, let q1 ;q2 (a) be the answer of the second prover if 1.1 in Section 4. Theorem 1.3 is proved in Section 5.
the verifier is to accept (this notation makes sense because
the answer of the first prover uniquely determines the an- 2 Preliminaries
swer of the second prover).
Khot [17] pointed out that one possible direction for get- 2.1. Probabilistic Checking of Proofs
ting hardness results for some open problems (Min-2CNF
deletion for example), is to show hardness of 2-prover Let PCP ;s [r(n); q (n)℄ be the class of languages L for
games with the following additional property : for every which there exists a probabilistic polynomial time verifier
question-pair (q 1 ; q2 ), and for every two distinct answers V with the following properties. The verifier has access to
a; a0 of the first prover, we have q1 ;q2 (a) 6= q1 ;q2 (a0 ). a proof . On input x with length n, the verifier uses r(n)
The main technique (which is simple but powerful) in random bits, queries q (n) bits from the proof  and then
this paper is to get a weaker analogue of the above prop- accepts or rejects. It satisfies :
erty. We will show hardness of 2-prover games where the (Completeness condition ) :
maps q1 ;q2 are “smooth”. For every question q 1 to the
first prover and every two distinct answers a; a0 of the first
x 2 L =) 9 proof  s:t: Pr [ V accepts  ℄ 
prover, we will have (Soundness condition ) :

Prq2 [ q ;q (a) 6= q ;q (a0 ) ℄  1(Smoothness property)


1 2 1 2
x 62 L =) 8 proofs ; Pr [ V accepts  ℄  s
Thus over the choice of the question q 2 to the second prover The parameters and s, 1  > s > 0, are called the
(conditional on the question to the first prover being q1 ), completeness and soundness parameters respectively.
the answers q1 ;q2 (a) and q1 ;q2 (a0 ) are distinct with high It was an amazing result by Arora and Safra [2] and
probability. Arora et al [1] that the class NP is precisely the class of
We combine this technique with the multi-layered con- languages that have a PCP verifier that uses only logarith-
struction of an outer PCP verifier by Dinur, Guruswami, mically many random bits and a constant number of queries.
Khot and Regev [5] (which they used to show q 1  We state this result below which is known as the PCP The-
hardness for vertex cover in q -uniform hypergraphs). orem.
Theorem 2.1 (The PCP Theorem, [2], [1]) Now we modify this protocol as follows : The verifier
picks a set W of u clauses at random and the set U of u
NP = PCP1; [O(log n); O(1)℄
1
2
variables as before. Then he picks a set W 0 of T u clauses at
random. He asks the first prover to give a satisfying assign-
The PCP Theorem has had a great impact on the the- ment to the set W 00 = W [ W 0 and asks the second prover
ory of inapproximability. Over the past decade, different to give a satisfying assignment to the set U 00 = U [ W 0 .
PCP constructions have led to several hardness results (in The verifier accepts iff the answers of the two provers agree
many cases optimal results) for various optimization prob- on the set U 00 .
lems, Håstad’s results for clique [10] and Max-3SAT [11] The clauses in W 0 are “dummy” in some sense and it
and Feige’s result for Set-Cover [7] to name a few. can be easily shown that this protocol too has soundness
2
(u) . Fix the set W 00 of (T + 1)u clauses, hence fixing
2.2. The Basic Outer Verifier the question asked to the first prover. The question asked
to the second prover can be equivalently viewed as picking
In this section we will explain the main technique in this a random subset W  W 00 of u clauses and taking one
paper. We begin with a result on hardness of Max-3SAT. variable from each clause in W giving a set of variables U .
The question to the second prover is U 00 = U [ (W 00 n W ).
The PCP Theorem is equivalent to saying that Max- 00 00
3SAT is MAX-SNP hard. With a suitable transformation, Let  U ;W be the projection that maps an assignment to
one can show that the hardness holds for 3SAT formulae the set W 00 to its sub-assignment to the set U 00 .
with a regular structure, i.e. every clause contains exactly The following lemma gives a crucial property of this
3 variables and every variable appears in exactly 5 clauses construction. The significance of this lemma is quite tech-
(see [7]). We call such formulae to be instances of 3-SAT-5. nical in nature and it will be evident only when we get to
Thus we have the theorem : the actual analysis in Section 4.

Theorem 2.2 For some absolute constant < 1 , it is NP- Lemma 2.4 (Smoothness property) For fixed W 00 and
hard to distinguish whether a 3-SAT-5 instance is satisfiable any two distinct assignments 1 ; 2 to W 00 ,

Pr U 00 U 00 ;W 00 (1 ) 6= U 00 ;W 00 (2 )  1 T1
(called the YES instance) or there is no assignment satisfy- h i
ing a fraction of the clauses (called the NO instance).

Let’s first review the standard construction of the outer Proof: The assignments 1 ; 2 differ on at least one clause
verifier based on the 3-SAT-5 instance given by the above C0 2 W 00 . Note that W is a random subset of W 00 with
theorem (see [3], [10], [11]). The construction can be
viewed as a 2-prover 1-round game. Let u be a parame-
jW j = u and jW 00 j = (T +1)u. Hence with probability 1
1
T over the choice of the set W , C0 62 W and consequently
ter, thought of as a large constant. The verifier picks a set C0 2 U 00 . Whenever this happens, the sub-assignments of
of u clauses at random, say W = fCi j i = 1; 2; : : : ; ug, 1 ; 2 to the set U 00 are distinct.
and asks the first prover to give a satisfying assignment to
these clauses. For every i, the verifier picks a variable xi at We combine this basic technique with the multi-layered
random from the clause Ci . Let U = fxi ji = 1; 2; : : : ; ug construction of an outer PCP verifier by Dinur, Guruswami,
be the set of these variables. The verifier asks the second Khot and Regev [5]. The combined construction appears in
prover to give an assignment to the set U . The verifier ac- Section 3. A proof for a PCP verifier is obtained by writing
cepts iff the answers of the two provers agree on the set of down the answers of the provers in the 2-prover protocol.
variables U . If we denote  U;W to be the projection that For the sake of convenience, we avoid using the language of
maps an assignment to W to its sub-assignment to U , the 2-prover games, and instead talk in terms of written proofs.
verifier accepts iff the answer of the first prover is  and the
answer of the second prover is  U;W ( ). 2.3. Long Codes over GF (k ) and Fourier Analysis
Clearly, if the 3-SAT-5 formula is satisfiable (i.e. it is a
YES instance), the provers have a strategy to make the veri- Long code (over GF (k )) over a domain M is indexed by
fier accept with probability 1 (completeness property). The all functions f : M 7! f1; !; ! 2; : : : ; ! k 1 g where ! is the
Parallel Repetition Theorem of Raz [20] gives the sound- kth root of unity, i.e. ! = e2i=k . Let
ness property :
F := ff j f : M 7! f1; !; !2; : : : ; !k 1 gg
Theorem 2.3 If the 3-SAT-5 instance is a NO instance, then
the provers in the above 2-prover 1-round protocol have no The long code A of a 2 M is defined as
strategy that makes the verifier accept with probability more
than 2
(u) . A(f ) = f (a) 8 f 2 F
C
Consider the space of all “tables” A : F 7! . In particular, A block U 2 Vi contains (L i)u variables and (T + i)u
a long code is one such table. Consider the characters  clauses, and thus a total of (L i)u + 3(T + i)u variables.
where : M 7! GF (k ). There is one such character for A satisfying assignment to the block U is an assignment to
every . The character  is a table defined by these (L i)u +3(T + i)u variables such that the assignment
Y satisfies all the clauses in U . Let MU denote the set of all
 (f ) := f (x) (x) satisfying assignments to the block U .
x2M If U is a sub-block of W , then every satisfying assign-
The characters form an orthonormal basis under the follow- ment to W can be restricted to a satisfying assignment to U .
ing definition of inner product of tables. For tables A1 ; A2 , Let the map  U;W : MW 7! MU denote this operation of
define taking a sub-assignment/restriction.
< A1 ; A2 > := jM j
1 X
A (f )A2 (f ) = Ef A1 (f )A2 (f )
 
k f 2F 1 Definition 3.3 We construct a multi-layered graph G =
G (L; T; u) as follows :
P
It follows that any table can be expanded as A = A b 
1. The graph has (L + 1) layers numbered 0 through L
P
jA j =
2
where A are the Fourier coefficients with and Vi is the set of vertices in the ith layer. Thus the
b b
<
P
A; A > . When A : F 7! f1; !; : : : ; ! g, we have
k 1
vertices in the ith layer are the blocks of type-i.
jA j = 1. The Fourier coefficients are given by
b 2
  2. There are no edges between the vertices of the same
Ab = < A;  > = Ef A(f ) (f ) layer.
In particular when  0, the value of the coefficient Ab0 is 3. For 0  i < j  L, the edges between layers i and
just Ef [A(f )℄. j are defined as follows : For a pair of blocks U 2 Vi
and W 2 Vj such that U is a sub-block of W , let
3. A New Outer PCP Verifier (U; W ) be an edge of the graph.
In this section, we explain the construction of the outer Since the 3-SAT-5 instance is regular, the bipartite graph
PCP verifier from a 3-SAT-5 instance given by Theorem 2.2. between every pair of layers is regular.
The variables will be denoted by x1 ; x2 ; : : : and the clauses
Definition 3.4 An assignment  to the graph G (L; T; u)
by C1 ; C2 ; : : :.
assigns a satisfying assignment (U ) 2 MU to every block
This construction is obtained by combining the tech-
U . If U is a sub-block of W , then we say that  satisfies an
edge (U; W ) of the graph if (U ) is a restriction of (W ).
nique in Section 2.2 with a construction due to Dinur, Gu-
ruswami, Khot and Regev [5]. The construction has param-
eters (L; T; u). Think of T  L  1.
In other words

Definition 3.1 For 0  i  L, a type-i block is defined  satisfies the edge (U; W ) if U;W ((W )) = (U )
to be the union of a set of (L i)u variables and a set of
(T + i)u clauses. The family of all type-i blocks is denoted Let OP T (G ; i; j ) denote the maximum fraction of the
by V i . edges satisfied between layers i; j by any assignment  to
these layers.
Remarks : (1) A block is union of a set of variables and a
set of clauses. The fact that the components are sets and not Dinur et al [5] proved the following theorem. The first
tuples is important. By definition, there is no order associ- property in this theorem follows from Raz’s Parallel Rep-
ated with elements of a set. (2) The construction of Dinur etition Theorem and the second property follows from the
et al [5] corresponds to the case T = 0. regular structure of the graph G . The second property is a
Definition 3.2 For 0  i < j  L, we say that a block weak notion of expansion.
U 2 Vi is a sub-block of a block W 2 Vj if one can obtain Theorem 3.5 For every constant Æ > 0 and every L 
block U by replacing (j i)u clauses fC l j l = 1; 2; ::; (j 4=Æ, the multi-layered graph G (L; T; u) has the following
i)ug in W by (j i)u variables fxl j l = 1; 2; :::; (j i)ug properties :
such that the variable xl is contained in the clause Cl for
1  l  (j i)u.  If the 3-SAT-5 instance is a YES instance, there ex-
For W 2 Vj , a random sub-block of W of type-i is ob- ists an assignment  that satisfies every edge in the
tained by choosing (j i)u clauses at random from the graph G . If the 3-SAT-5 instance is a NO instance,
(T + j )u clauses in W and replacing each clause by one of OP T (G ; i; j )  2
((j i)u)  2
(u) for every pair
the variables appearing in that clause picked at random. of layers i < j .
 Consider any k layers numbered i1 < i2 < : : : < ik We expect the proof to contain for every block U , the
where k = 4=Æ . Choose any sets Sil  Vil with long code AU (over GF (3)) of a supposed satisfying as-
jSil j = ÆjVil j for 1  l  k. Then there exist two signment to U . So this long code is over the domain MU .
layers numbered il and il0 such that the number of Let ! = e2i=3 be the cube root of unity. The long code is
edges between the sets Sil and Sil0 is at least a fraction indexed by all functions f 2 FU where
Æ3 =16 of the total number of edges between the layers
il and il0 . FU := ff j f : MU 7! f1; !; !2gg
Now we show the smoothness of the maps  U;W , simliar The verifier’s action is :
to the Lemma 2.4.  Pick 0  i < j  L at random.
Lemma 3.6 Let 0  i < j  L and W be a block of type-  Pick a random block W of type-j and its random sub-
j . Let ; 0 2 MW be two distinct satisfying assignments block U of type-i. Let  U;W : MW 7! MU be the
to W . If U is a random sub-block of W of type i, then projection between U and W . Let A = AU and B =
AW be the supposed long codes of assignments to the
Pr U U;W () = U;W (0 )  TL
  blocks U and W . These long codes are indexed by
functions f 2 FU and g 2 FW respectively.
Proof: ;  0 differ in at least one bit. The projection  U;W  Pick random functions f 2 FU ; g 2 FW . Pick a func-
preserves the variables in W , but replaces some clauses by tion  2 FW by defining for every x 2 MW
variables. If ;  0 differ on a variable in W , their projections 
under  U;W are still distinct. Otherwise they differ on some (x) := ! with probability 1/2
clause, say clause C0 . For a choice of a random sub-block !2 with probability 1/2
U , one replaces at random (j i)u clauses out of the (T +
j )u clauses in W . With probability 1 Tj +ij  1 TL , the  Let h 2 FW be defined as h := g  f Æ U;W  
clause C0 is not replaced and hence projections of ;  0 are  Accept iff
distinct.
Not-All-Equal(A(f ); B (g ); B (h))
Lemma 3.7 Let 0  i < j  L and W be a block of
type-j . Let  MW be non-empty and  2 . If U is Remark : The above construction of the hypergraph moti-
a random sub-block of W of type-i, then with probability vates the use of the multi-layered version of the outer PCP
1 j jL=T we have, verifier. Lets see what happens when the outer verifier has
only 2 layers (as is the case with the standard outer verifier
8 0 2 ; 0 6= ; U;W () 6= U;W (0 ) used in [8], [11]). Every edge of the hypergraph will have
0
one vertex in the first layer and two vertices in the second
Proof: Just apply the previous lemma to every 2
; 0 6=  and take a union bound.
layer. Coloring all the vertices in the first layer with one
color and all the vertices in the second layer with another
color will give a proper 2-coloring of the hypergraph. Thus
we won’t be able to claim that the hypergraph is not col-
4. Hardness of 3-Uniform Hypergraph Color- orable with constantly many colors. However this “cheating
ing strategy” breaks down with the multi-layered construction.

In this section, we prove Theorem 1.1. We construct a 4.1. Completeness


PCP verifier that expects a proof over the domain GF (3).
The verifier reads 3 symbols from the proof and accepts iff In a correct proof, A will be the long code of some a 2
not all of the 3 symbols are equal. We let the locations in MU and B will be the long code of some b 2 MW , with
the proof to be the vertices of a hypergraph and the tests U;W (b) = a. In that case
of the verifier (reading 3 locations) to be the edges of the
hypergraph. This defines a 3-uniform hypergraph. If the 3- A(f ) = f (a); B (g) = g(b);
SAT-5 instance is a YES instance, there exists a proof that
the verifier always accepts and hence the hypergraph is 3- B (h) = h(b) = g(b)f (U;W (b))(b) = g(b)f (a)(b)
colorable. If the 3-SAT-5 instance is a NO instance, we Hence A(f ); B (g ); B (h) can’t all be equal, since  takes
show that the hypergraph has no independent set of relative values only in the set f!; ! 2 g. Thus the test always accepts
size Æ . a correct proof. Hence the hypergraph is 3-colorable.
4.2. Soundness which can be expanded using Fourier expansions of tables
A and B as
We will show that the size of any independent set in the  X 
hypergraph is at most Æ fraction of the size of the whole hy- EU;W;f;g; Ab Bb Bb  (f )  (g)  (gf Æ )
pergraph. The hypergraph is a weighted hypergraph. The ; ;
vertices in the same layer have equal weight. The total
weight of all the vertices in any layer is L+1 1 . Since there where we denoted  =  U;W for notational convenience.
are L + 1 layers, the total weight of all the vertices in the Note that ; : MW 7! GF (3) and : MU 7! GF (3).
hypergraph is 1. The expression can be written as
Let I be any set of locations in the proof with weight Æ .  X
We will show that there is at least a constant fraction of the EU;W;f;g; Ab Bb Bb
tests for which all the 3 queries lie in the set I . This means ; ;
that for any set of vertices with weight Æ in the hypergraph, 
at least a constant fraction of the edges are contained in this  (f )  (g)  (g)  (f Æ )  () =
set. Hence there is no independent set of size Æ .  X 
EU;W;f;g; Ab Bb Bb  (f )  (g)  (f Æ )  ()
Define tables AU as follows : For every block U , ; ;

AU (f ) := 1 if the location AU (f ) 2 I This expectation is zero unless = . Also  (f Æ ) =
0 otherwise 3 ( ) (f ) if we let 3 ( ) : MU 7! GF (3) be a function
Remark : The tables AU are 0-1 tables whereas the proofs
defined as
are supposed to be over the alphabet f1; !; ! 2 g. However For y2 MU ; 3 ( )(y) :=
X
(x)
0-1 tables still make sense in the Fourier analysis we do and x : (x)=y
in fact going to 0-1 tables is a trick we exploit.
Call a block U “good” if at least Æ=2 fraction of the lo- where the sum is over GF (3)). Hence the expression sim-
cations in the table AU are set to 1. Equivalently, U is good plifies to
if the Fourier coefficient A
bU;0 of the table AU satisfies
X 
EU;W;f;g; Ab Bb 2  3 ( ) (f )  ()
AbU;0  Æ=2 ;
By an averaging argument, at least Æ=2 fraction of the
blocks U are good. Again by an averaging argument, in at
Again, the expression is zero unless = 3 ( ). Defining
“cardinality” of as
least Æ=4 fraction of the layers, at least Æ=4 fraction of the
blocks are good. The number of such layers is  Æ=4  L 
O(1=Æ) provided L > O(1=Æ2 ).
j j = jfx j x 2 MW ; (x) 6= 0gj
 
By the “weak expansion property” of the multi-layered we get E  () = ( 12 )j j . Thus the expression further
graph (see Theorem 3.5), there exist two layers i0 < j0 such simplifies to
that at least Æ=4 fraction of the blocks in each of the two
layers are good and the number of edges between the good X
Ab3 ( ) Bb 2 (
1 j j 
blocks is at least Æ 0 =
(Æ 3 ) fraction of the total number of EU;W
2) (2)
edges between the layers i0 and j0 . Fix these layers i0 and
j0 for the rest of the proof. Lemma 4.1 The terms in (2) with 6 0
3 ( ) = can be
Consider the process of picking a random block W from
bounded in magnitude by 2
(u) .
layer j0 and its random sub-block of type-i0. As noted,
with probability Æ 0 , both W and U are good. Denoting by
A = AU and B = AW , we have
Proof: Using a standard argument, we will show that if the
terms with 3 ( ) 6= 0 have “high” magnitude, then there

EU;W Ab0 Bb02

 (Æ=2)3 Æ0 = Æ00 (say) (1) exists an assignment  to the blocks in layers i0 and j0
(see Definition 3.4) which satisfies a “good” fraction of the
Now consider the probability that all the three queries edges between layers i0 and j0 . Using Cauchy-Schwartz
A(f ); B (g); B (h) lie in the set I . Every proof location is 1 inequality,
or 0 depending on whether it is in the set I or not. Hence 2 3

( 12 )j j5
this probability is the expectation X
 
EU;W 4
Ab3 ( ) Bb 2
EU;W;f;g; A(f )B (g)B (h) :3 ( )6=0
2 3
v sX We will construct a PCP verifier that reads 3 symbols
jAb ( ) j2 jBb j2 ( 21 )2j j
u X
 6u
EU;W 4 t jBb j2 75 from a proof over the alphabet GF (k ), has perfect com-
pleteness and soundness k k2 1 +  where  > 0 is an arbi-
3 2
:3 ( )6=0
v 2 3
trary constant. This clearly suffices to prove Theorem 1.3.
u
u
jAb ( )j2 jBb j2 j 1 j 5
X The verifier is based on the construction of the outer ver-
u
 tEU;W 4 3
(3) ifier in Section 3. However we do not need the “weak ex-
: ( )6=0
3 pansion property” of this construction (given by property-2
P of Theorem 3.5). We make a crucial use of Lemma 3.7.
Note that A; B are 0-1 tables and thus jA b j2  1
After picking a block Wj and its sub-block Wi with a
P
and jB b j2  1. The assignment  is defined as fol- suitable distribution, the verifier’s test is very similar to the
lows : For a block W , pick with probability jB b j2 , test given in Section 4. Note however that we use long codes
pick a random x 2 MW such that (x) 6= 0 and de- over GF (k ).
fine (W ) = x. For a block U , pick with probabil- Action of the verifier :
ity jAb j2 , pick a random y 2 MU such that (y ) 6= 0
and define (U ) = y . The summation in (3) is precisely  Pick a random block WL of type-L. For i = L; L
the probability that  ((W )) = (U ). This is bounded 1; L 2; : : : ; 1, let Wi 1 be a random sub-block of
by OP T (G ; i0 ; j0 )  2
(u) as claimed (see Theorem type-(i 1) of the (already chosen) type-i block Wi .
3.5).
 Let Ai = AWi be the supposed long code of a satis-
Next, we note that the terms in (2) with j j  log T have fying assignment to the block Wi . Let MWi be the
magnitude at most ( 12 )log T = T1 which can be assumed to set of satisfying assignments to the block Wi . For
be negligible by choosing T large enough. 0  i < j  L, let i;j = Wi ;Wj : MWj 7! MWi
Thus we are left with terms where 3 ( ) = 0 and j j  be the projection function between Wi and Wj .
log T . Fix W and consider the case when 6= 0, i.e. there
exists x 2 MW such that (x) 6= 0. Lemma 3.7 shows that  Pick 0  i < j L at random.
over the choice of a random sub-block U , with probability
1 log T  L=T , we have  Pick functions f : MWi 7! f1; !; ! 2; : : : ; ! k 1 g; g :
MWj 7! f1; !; !2; : : : ; !k 1 g at random.
8 x0 s:t: (x0 ) 6= 0; x 6= x0 we have (x) 6= (x0 )
 Pick a function  : MWj 7! f!; !2g by defining for
Whenever this happens, 3 ( ) 6= 0. Hence the probability every x 2 MWj
that 3 ( ) = 0 is at most logTT L which is negligible. Thus
the terms with 3 ( ) = 0; 6= 0 and j j  log T have 
! with probability 1/2
negligible magnitude. (x) := !2 with probability 1/2
Finally we are left with the single term
 
EU;W Ab0 Bb02  Let h : MWj 7! f1; !; !2; : : : ; !k 1 g be defined as
h := g2  f Æ i;j  
which is at least Æ 00 as observed before (Equation (1)). As-
suming the magnitude of all the terms neglected to be at  Accept iff
most Æ 00 =2, it follows that at least Æ 00 =2 fraction of the tests
between layers i0 and j0 have all the 3 queries in the set Not-all-equal(Ai (f ); Aj (g ); Aj (h))
I . In particular, I cannot be an independent set. This com-
pletes the proof. Theorem 1.2 is proved by plugging in ap-
propriate super-constant values of the parameters L; T; u in 5.1. Completeness
the analysis. We omit the details.
Clearly, the test always accepts a correct proof con-
5. Hardness of the Problem NAE3;k structed from a satisfying assignment to the 3-SAT-5 in-
stance using correct long codes.
In this section, we prove Theorem 1.3. We give a proof
in the special case when k  3 is a prime. The proof can be 5.2. Soundness
easily generalized for any odd integer k  3. However one
requires a somewhat different proof for even k and we omit We will show that if the 3-SAT-5 instance is a NO in-
stance, the test accepts with probability at most k k2 1 + .
2
it from this paper.
Lemma 5.1 If x; y; z 2 f1; !; !2; : : : ; !k 1 g, then the ex- terms with 6= 0; k ( ) = 0 and j j  O(k 2 log T ). Us-
pression ing Lemma 3.7, for a fixed Wj , over the choice of a random
sub-block Wi , the probability that k ( ) = 0 is at most
1 k12
X
xr yr z r 1 2 3
(r1 ; r2 ; r3 2 GF (k)) O(k2 log T )L which is negligible. Hence these terms can
T
r +r +r =0
1 2 3 also be ignored.
is 0 if x = y = z and 1 otherwise. Thus we are left with only those terms where = 0. The
acceptance probability is
From this lemma, it is clear that the acceptance probabil-
Pr[acc℄ = 1
1 X
Abri;10 Abrj;20 Abrj;30 + 0 (4)
ity of the verifier is k2 r1 +r2 +r3 =0
1 k12
X  
E WL ;WL 1 ;:::;W0 ; Ai (f )r1 Aj (g)r2 Aj (h)r3 where 0 takes into account the terms neglected. Let
i;j;f;g;
r1 +r2 +r3 =0 kX1
Fix WL ; WL 1 ; : : : ; W0 ; i; j for the time being and con- Abi;0 = ail  !l
sider the expectation over f; g; . For notational conve- l=0
P
nience, we will drop the E [℄ notation. Expanding the tables where ail are non-negative real numbers with kl=01 ail =
Ari 1 ; Arj 2 ; Arj 3 by their Fourier expansions, we get 1. Note that ail is just the fraction of entries in the table Ai
which are equal to ! l . It follows that
1 k12
X X
Abri; Abrj; Abrj; 1 2 3
kX1
r +r +r =0 ; ;
1 2 3 Abri;0 = ail  !rl
l=0
 (f ) (g) (g2 f) Hence the summation in (4) can be written as
The expectation over g is zero unless + 2 = 0. Let X X
 = i;j and let k ( ) be a function k ( ) : MWi 7! ail1 ajl2 ajl3 !r1l1 +r2 l2 +r3 l3
r1 +r2 +r3 =0 l1 ;l2 ;l3
GF (k) defined as X X
X = ail1 ajl2 ajl3 !r1 l1 +r2 l2 +r3 l3
8 y 2 MWi ; k ( )(y) := (x) l1 ;l2 ;l3 r1 +r2 +r3 =0
x2MWj :(x)=y The inner summation is 0 unless l1 = l2 = l3 and k 2 other-
wise. So the sum reduces to
With this definition, the expectation over f is zero unless
= k ( ). Thus the acceptance probability is kX1
k2 ail a2jl
1 k12
X X l=0
Abri;k ( ) Abrj; 2 Abj;  E [ ()℄
1 2 r3
r +r +r =0 Recall that the verifier first picks blocks
WL ; WL 1 ; : : : ; W0 and then picks i; j at random.
1 2 3

It can be proved in a similar way as [16, Lemma 8] that Taking expectation over the choice of 0  i; j  L and
applying lemma 5.2, we get
E [ ()℄ = (1
( 2 ))j j  Q( )
1
k  kX1 
Pr[acc℄ = 1 E 0i<j L ail a2jl + 0
where Q( ) is a complex number with absolute value 1. l=0

 1 k12 + L2 + 0
Therefore the acceptance probability is

1 k12
X X
Abri;k ( ) Abrj; 2 Abj; 
r3
Let fail : 0  i  L; 0  l  k 1g be a
1 2
Lemma 5.2
r +r +r =0
1 2 3 collection of non-negative reals such that

(1
( k12 ))j j  Q( )
kX1
ail = 1 8i
l=0
Using Lemma 4.1, the terms with k ( ) 6= 0 can be used
to define a “good” assignment to the blocks in layers i and Then we have
j . Therefore these terms can be assumed to be arbitrarily  kX1
 k12 L2

small in magnitude. The terms with j j  O(k 2 log T ) have E 0i<jL ail a2jl
magnitude at most T1 which is negligible. Now consider the l=0
Proof: Let ail = (1 + bil )=k so that Pkl=01 bil = 0 for [7] U. Feige. A threshold of ln n for approximating set cover.
every i. Also 1  bil  k 1. Journal of the ACM, 45(4), 634–652, July 1998.
[8] V. Guruswami, J. Håstad, and M. Sudan. Hardness of ap-
1 1
ail a2jl = 2 ail (1 + bjl )2 = 2 ail (1 + 2bjl + b2jl )
proximate hypergraph coloring. In Proc. of the 41st IEEE
k k Symposium on Foundations of Computer Science, pages
149–158, 2000. (The references are to the journal version
 k12 ail (1 + 2bjl ) = k13 (1 + bil )(1 + 2bjl ) = of this paper which is in preparation).

1 (1 + b + 2b + 2b b ) [9] E. Halperin, R. Nathaniel and U. Zwick. Coloring k-


il jl il jl colorable graphs using smaller palettes. In Proc. of the 12th
k3 Annual Symposium on Discrete Algorithms, 319–326, 2001.
Pk 1
Summing over l and noting that l=0 bil = 0, we get [10] J. Håstad. Clique is hard to approximate within n1  . In
kX1 kX1 Proc. of the 37th Annual IEEE Symposium on Foundations
1
ail a2jl  2 + 3
2
b b : of Computer Science, pages 627–636, 1996.
k l=0 il jl
Finally
l=0 k [11] J. Håstad. Some optimal inapproximability results. In Proc.
of the 29th Annual ACM Symposium on Theory of Comput-
L L

E 0i<jL bil bjl =
 1 X
( bil )2
X
b2il
 ing, pages 1–10, 1997.
2 L+1
2 i=0 i=0 [12] J. Holmerin. Vertex cover on 4-regular hyper-graphs is hard
1 2 to approximate within 2 . To appear in Proc. of the 34th
 L+1 ( (L + 1)(k 1)2 )  kL Annual ACM Symposium on Theory of Computing, 2002.
2 2 [13] D. Karger, R. Motwani, and M. Sudan. Approximate graph
coloring by semidefinite programming. In Proc. of the
35th IEEE Symposium on Foundations of Computer Science,
1994.
6. Acknowledgments
[14] S. Khanna, N. Linial and S. Safra. On the hardness of ap-
I sincerely thank Irit Dinur, Oded Regev and Cliff Smyth proximating the chromatic number. In Proc. of the 2nd Is-
for showing their hypergraph coloring result to me. Many rael Symposium on Theory and Computing Systems, ISTCS,
250–260, IEEE Computer Society Press, 1993.
thanks to Venkatesan Guruswami for several helpful discus-
sions on the subject. [15] S. Khot. Improved inapproximability results for maxclique,
chromatic number and approximate graph coloring. In Proc.
of the 42nd IEEE Symposium on Foundations of Computer
References
Science, 2001.

[1] S. Arora, C. Lund, R. Motawani, M. Sudan, and M. Szegedy. [16] S. Khot. Hardness results for approximate hypergraph color-
Proof verification and the hardness of approximation prob- ing. To appear in Proc. of the 34th Annual ACM Symposium
lems. Journal of the ACM, 45(3):501–555, 1998. on Theory of Computing, 2002.

[2] S. Arora and S. Safra. Probabilistic checking of proofs : A [17] S. Khot. On the power of unique 2-prover 1-round games.
new characterization of np. Journal of the ACM, 45(1):70– To appear in Proc. of the 34th Annual ACM Symposium on
122, 1998. Theory of Computing, 2002.

[3] M. Bellare, O. Goldreich, and M. Sudan. Free bits, pcps and [18] M. Krivelevich, R. Nathaniel and B. Sudakov. Approximat-
non-approximability. Electronic Colloquium on Computa- ing coloring and maximum independent set in 3-uniform hy-
tional Complexity, Technical Report TR95-024, 1995. pergraphs. In Proc. of the 12th Annual Symposium on Dis-
crete Algorithms, 2001.
[4] A. Blum and D. Karger. An O~ (n3=14 ) coloring algorithm for
[19] L. Lovasz. Coverings and colorings of hypergraphs. In Proc.
3-colorable graphs. Information Processing Letters, 61:49–
of 4th Southeastern Conf. on Combinatorics, Graph Theory
53, 1997.
and Computing, pages 3–12. Utilitas Mathematica Publish-
[5] I. Dinur, V. Guruswami, S. Khot and O. Regev. k 1 ing, Winnipeg, 1973.
 hardness for vertex cover in k -uniform hypergraphs. In
[20] R. Raz. A parallel repetition theorem. SIAM J. of Comput-
preparation.
ing, 27(3):763–803, 1998.
[6] I. Dinur, O. Regev and C. Smyth. The hardness of 3-uniform [21] U. Zwick. Finding almost satisfying assignments. In Proc.
hypergraph coloring. Appearing in Proc. of the 43rd An- of the 30th Annual ACM Symposium on Theory of Comput-
nual IEEE Symposium on Foundations of Computer Science, ing, pages 551–560, 1998.
2002.

You might also like