Sloppy conversion from lecture slides + extra info

Notation

This note uses $\neg x$ for negation. SAT literature often uses bar notation $\overset{x}{ˉ}$ instead, which is more compact but can look like a different variable.

Practical SAT Solving

SAT solvers work on formulas in normal form because arbitrary tree-structured formulas are hard to simplify globally (changes only affect subtrees).

Why CNF over DNF?

Both CNF and DNF can represent any propositional formula, but they have complementary strengths:

DNF (Disjunctive Normal Form): OR of ANDs, e.g., $(a \land b) \lor (c \land \neg d)$
Satisfiability is trivial (just check if any conjunction is consistent), but converting to DNF can be exponential.

CNF (Conjunctive Normal Form): AND of ORs, e.g., $(a \lor b) \land (c \lor \neg d)$
Refutability is easy to show (find an empty clause), and conversion can be done in polynomial time. CNF is the standard input format for SAT solvers (DIMACS format).

Transformation to CNF

Approach 1: Multiplication

Produces an equivalent formula. Three steps:

Eliminate derived connectives:
$ϕ \to ψ \Leftrightarrow \neg ϕ \lor ψ$ $ϕ \leftrightarrow ψ \Leftrightarrow (ϕ \to ψ) \land (ψ \to ϕ)$ $ϕ \oplus ψ \Leftrightarrow \neg (ϕ \leftrightarrow ψ)$
Convert to Negation Normal Form (NNF): push negations inward using De Morgan’s laws, eliminate double negation. Result: only $\land$ , $\lor$ , and literals.
Distribute $\lor$ over $\land$ to get CNF.

Distributivity of OR over AND

The core rule:
$A \lor (B \land C) \equiv (A \lor B) \land (A \lor C)$
Symmetrically:
$(A \land B) \lor C \equiv (A \lor C) \land (B \lor C)$
Why it works: For the left side to be true, either $A$ is true (making both clauses on the right true), or both $B$ and $C$ are true (again satisfying both clauses). The equivalence holds in both directions.

The goal is to push all $\lor$ s inside all $\land$ s. When a disjunction sits above a conjunction, distribute. Repeat until every $\lor$ is at the innermost level (forming clauses) and every $\land$ is at the outermost level (conjoining clauses).

Distributivity step-by-step

Transform $(p \land q) \lor (r \land s)$ to CNF.

This is a disjunction of two conjunctions. We need to “multiply out” like $(a + b) (c + d) = a c + a d + b c + b d$ , but with $\lor$ as multiplication and $\land$ as addition.

Step 1: Treat as $X \lor (r \land s)$ where $X = (p \land q)$
$= (X \lor r) \land (X \lor s)$ $= ((p \land q) \lor r) \land ((p \land q) \lor s)$
Step 2: Distribute in the left conjunct: $(p \land q) \lor r$
$= (p \lor r) \land (q \lor r)$
Step 3: Distribute in the right conjunct: $(p \land q) \lor s$
$= (p \lor s) \land (q \lor s)$
Result:
$(p \lor r) \land (q \lor r) \land (p \lor s) \land (q \lor s)$
Four clauses from two 2-literal conjunctions: $2 \times 2 = 4$ clauses.

General pattern

When distributing $(A_{1} \land \dots \land A_{n}) \lor (B_{1} \land \dots \land B_{m})$ :

Each $A_{i}$ must pair with each $B_{j}$ to form a clause $(A_{i} \lor B_{j})$ .

Result: $n \times m$ clauses:
$i = 1 ⋀ n j = 1 ⋀ m (A_{i} \lor B_{j})$
Example: $(a \land b \land c) \lor (d \land e)$

$3 \times 2 = 6$ clauses:
$(a \lor d) \land (a \lor e) \land (b \lor d) \land (b \lor e) \land (c \lor d) \land (c \lor e)$

Chained disjunctions

For $(A_{1} \land B_{1}) \lor (A_{2} \land B_{2}) \lor (A_{3} \land B_{3})$ :

First combine the first two terms, then combine with the third.

Step 1: $(A_{1} \land B_{1}) \lor (A_{2} \land B_{2}) = (A_{1} \lor A_{2}) \land (A_{1} \lor B_{2}) \land (B_{1} \lor A_{2}) \land (B_{1} \lor B_{2})$

Step 2: Now OR this 4-clause conjunction with $(A_{3} \land B_{3})$ . Each of the 4 clauses distributes with both $A_{3}$ and $B_{3}$ :
$4 \times 2 = 8 clauses$
In general, $(A_{1} \land B_{1}) \lor \dots \lor (A_{n} \land B_{n})$ yields $2^{n}$ clauses.

Exponential blowup

Distributivity can cause exponential growth. For example, transforming $(a_{1} \land b_{1}) \lor (a_{2} \land b_{2}) \lor \dots \lor (a_{n} \land b_{n})$ to CNF produces $2^{n}$ clauses.

Worked example

Transform $\neg (a \leftrightarrow b) \to (\neg (c \land d) \land e)$

Step 1a (remove equivalences):
$\neg ((a \to b) \land (b \to a)) \to (\neg (c \land d) \land e)$

Step 1b (remove implications):
$\neg\neg ((\neg a \lor b) \land (\neg b \lor a)) \lor (\neg (c \land d) \land e)$

Step 2 (NNF via De Morgan + double negation):
$((\neg a \lor b) \land (\neg b \lor a)) \lor ((\neg c \lor \neg d) \land e)$

Step 3 (distribute $\lor$ over $\land$ ):

We have $(P \land Q) \lor (R \land S)$ where:
$P = (\neg a \lor b)$ , $Q = (\neg b \lor a)$ , $R = (\neg c \lor \neg d)$ , $S = e$

Applying the $n \times m$ pattern: each element of the left conjunction pairs with each element of the right:
$(P \lor R) \land (P \lor S) \land (Q \lor R) \land (Q \lor S)$
Substituting back and flattening nested ORs:
$(\neg a \lor b \lor \neg c \lor \neg d) \land (\neg a \lor b \lor e) \land (\neg b \lor a \lor \neg c \lor \neg d) \land (\neg b \lor a \lor e)$

Approach 2: Tseitin Encoding

Produces an equisatisfiable formula: same satisfiability status, but not logically equivalent (introduces new variables). Polynomial complexity.

The idea: instead of distributing (which explodes), introduce a fresh variable for each subformula and constrain it to equal that subformula’s value.

Process:

Label each non-literal subformula with a fresh variable $v_{i}$
For each labeled subformula, write a definition $v_{i} \leftrightarrow (operator applied to children)$
The children are either literals or labels of subformulas (not the subformulas themselves).
AND all definitions together, then convert to CNF using Approach 1
AND the root label (asserting the whole formula must be true)

Why no blowup: Each definition has the form $v \leftrightarrow (x \circ y)$ where $\circ$ is a binary connective, or $v \leftrightarrow \neg x$ for negation. That’s at most 3 variables per definition. Converting a 3-variable equivalence to CNF produces a fixed small number of clauses (see below), no matter how large the original formula.

Converting definitions to CNF

An equivalence $v \leftrightarrow ψ$ means $(v \to ψ) \land (ψ \to v)$ . Convert each implication separately, then AND together.

Conjunction: $v \leftrightarrow (a \land b)$

$v \to (a \land b)$ : if $v$ then both $a$ and $b$
$= \neg v \lor (a \land b) = (\neg v \lor a) \land (\neg v \lor b)$

$(a \land b) \to v$ : if both $a$ and $b$ then $v$
$= \neg (a \land b) \lor v = \neg a \lor \neg b \lor v$

Result: $(\neg v \lor a) \land (\neg v \lor b) \land (v \lor \neg a \lor \neg b)$ — 3 clauses, 3 variables

Disjunction: $v \leftrightarrow (a \lor b)$

$v \to (a \lor b) = \neg v \lor a \lor b$

$(a \lor b) \to v = (\neg a \lor v) \land (\neg b \lor v)$

Result: $(\neg v \lor a \lor b) \land (v \lor \neg a) \land (v \lor \neg b)$ — 3 clauses, 3 variables

Negation: $v \leftrightarrow \neg a$

$v \to \neg a = \neg v \lor \neg a$

$\neg a \to v = a \lor v$

Result: $(\neg v \lor \neg a) \land (v \lor a)$ — 2 clauses, 2 variables

Implication: $v \leftrightarrow (a \to b)$ — same as $v \leftrightarrow (\neg a \lor b)$ , use disjunction pattern

Worked example

Transform $ϕ := \neg (a \leftrightarrow b) \to (\neg (c \land d) \land e)$

The syntax tree:

  		→  (root)
  	  /   \
  	 ¬	 ∧
  	 |	/ \
  	↔	¬   e
  	   / \   |
  	  a   b  ∧
  		/ \
  	   c   d
  	```
  	
  	 *Step 1: Label non-literal subformulas bottom-up*
  	
  	 Start at the deepest nodes and work toward the root. Each non-literal subformula gets a fresh variable:
  	
  	 Deepest level:
  	 - $v_1$ for $(c \land d)$
  	 - $v_2$ for $(a \leftrightarrow b)$
  	
  	 Next level up:
  	 - $v_3$ for $\neg v_1$ (the negation of $c \land d$)
  	 - $v_4$ for $\neg v_2$ (the negation of $a \leftrightarrow b$)
  	
  	 Next level:
  	 - $v_5$ for $(v_3 \land e)$ (the right subtree of $\to$)
  	
  	 Root:
  	 - $v_6$ for $(v_4 \to v_5)$ (the whole formula)
  	
  	 *Step 2: Write a definition for each label*
  	
  	 Each definition says "this variable equals this subformula":
  	 - $v_1 \leftrightarrow (c \land d)$
  	 - $v_2 \leftrightarrow (a \leftrightarrow b)$
  	 - $v_3 \leftrightarrow \neg v_1$
  	 - $v_4 \leftrightarrow \neg v_2$
  	 - $v_5 \leftrightarrow (v_3 \land e)$
  	 - $v_6 \leftrightarrow (v_4 \to v_5)$
  	
  	 Notice: each definition has at most 3 variables (the label + up to 2 children).
  	
  	 *Step 3: Convert each definition to CNF*
  	
  	 $v_1 \leftrightarrow (c \land d)$ — conjunction pattern:
  	 $(\neg v_1 \lor c) \land (\neg v_1 \lor d) \land (v_1 \lor \neg c \lor \neg d)$
  	
  	 $v_3 \leftrightarrow \neg v_1$ — negation pattern:
  	 $(v_1 \lor v_3) \land (\neg v_1 \lor \neg v_3)$
  	
  	 $v_5 \leftrightarrow (v_3 \land e)$ — conjunction pattern:
  	 $(\neg v_5 \lor v_3) \land (\neg v_5 \lor e) \land (v_5 \lor \neg v_3 \lor \neg e)$
  	
  	 $v_6 \leftrightarrow (v_4 \to v_5)$ — implication is $(\neg v_4 \lor v_5)$, use disjunction pattern:
  	 $(\neg v_6 \lor \neg v_4 \lor v_5) \land (v_6 \lor v_4) \land (v_6 \lor \neg v_5)$
  	
  	 (Similarly for $v_2, v_4$)
  	
  	 *Step 4: AND all clauses together, plus assert the root*
  	
  	 Final CNF: (all clauses from step 3) $\land\; v_6$
  	
  	 The unit clause $(v_6)$ forces the root to be true, which propagates constraints through all the definitions.

Why "equisatisfiable" not "equivalent"

The CNF has extra variables ( $v_{1}, \dots, v_{6}$ ) that don’t exist in the original formula. If the original is satisfiable, we can extend any satisfying assignment to the new variables (just compute what each subformula evaluates to). If the CNF is satisfiable, restricting to the original variables satisfies the original formula. But the formulas aren’t logically equivalent because they have different variable sets.

DPLL Algorithm

DPLL (Davis-Putnam-Logemann-Loveland, 1962) is the foundation of modern SAT solvers. It operates on CNF and combines:

Binary Constraint Propagation (BCP) for deterministic inference
Branching with backtracking for search

Binary Constraint Propagation (BCP)

Given a CNF formula $ϕ$ containing a unit clause $(l)$ (a clause with exactly one literal), $BCP (ϕ, l)$ :

Removes all clauses containing $l$ (they’re satisfied)

Removes $\neg l$ from all remaining clauses (it contributes nothing)

Apply repeatedly until no unit clauses remain.

BCP outcomes:

Empty CNF ( ${}$ = $⊤$ ): formula is satisfiable under current assignments
Empty clause ( $□$ = $⊥$ ): formula is unsatisfiable under current assignments (conflict)
Neither: BCP reaches fixpoint, branching required

BCP example

$ϕ = {(\neg a \lor b \lor \neg c), (a \lor b), (\neg a \lor \neg b), (a)}$

Unit clause $(a)$ : propagate $a$
$ϕ^{'} = BCP (ϕ, a) = {(b \lor \neg c), (\neg b)}$

Unit clause $(\neg b)$ : propagate $\neg b$
$ϕ^{''} = BCP (ϕ^{'}, \neg b) = {(\neg c)}$

Unit clause $(\neg c)$ : propagate $\neg c$
$ϕ^{'''} = BCP (ϕ^{''}, \neg c) = {} = ⊤$

Satisfiable with $a = 1, b = 0, c = 0$ .

DPLL Pseudocode

function DPLL(φ):
	while true:
		φ = BCP(φ)
		if φ == ⊤: return SAT
		if φ == ⊥:
			if stack.empty(): return UNSAT
			(l, φ) = stack.pop()	// backtrack
			φ = φ ∧ l			   // try other branch
		else:
			select literal l in φ
			stack.push(¬l, φ)		// save backtrack point
			φ = φ ∧ l			   // branch on l

The algorithm explores assignments depth-first, using BCP to prune the search space.

Modern SAT Solvers

DPLL alone isn’t enough. State-of-the-art solvers (Lingeling, CaDiCaL, MiniSAT, Glucose) add:

CDCL (Conflict-Driven Clause Learning): when a conflict occurs, analyze it to learn a new clause that prevents similar conflicts, then backjump non-chronologically
Variable selection heuristics: VSIDS and variants prioritize variables involved in recent conflicts
Restart strategies: periodically restart search to escape bad parts of the search tree
Phase saving: remember polarities from previous assignments
Two-watched literals: lazy data structure for efficient BCP
Preprocessing/inprocessing: simplify formula before and during search

DPLL variants extend to other logics: QBF (quantified boolean formulas), SMT (satisfiability modulo theories).

Practice Problems

Exercise: BCP on complex formula

$Φ_{0} = (b \lor \neg c) \land (\neg a \lor c) \land a \land (\neg b \lor \neg c \lor a) \land (b \lor c \lor d) \land (\neg a \lor \neg b \lor \neg c) \land d$

Unit clauses: $(a)$ and $(d)$

$Φ_{1} = BCP (Φ_{0}, a) = (b \lor \neg c) \land (c) \land (\neg b \lor \neg c) \land (b \lor c \lor d) \land (\neg b \lor \neg c) \land d$

$Φ_{2} = BCP (Φ_{1}, d) = (b \lor \neg c) \land (c) \land (\neg b \lor \neg c) \land (\neg b \lor \neg c)$

$Φ_{3} = BCP (Φ_{2}, c) = (b) \land (\neg b) \land (\neg b)$

$Φ_{4} = BCP (Φ_{3}, b) = □ = ⊥$

Conflict detected.

Exercise: BCP leading to UNSAT

$(\neg a \lor b) \land (\neg a \lor \neg b) \land a$

$BCP (\cdot, a) = (b) \land (\neg b)$
$BCP (\cdot, b) = □ = ⊥$

Unsatisfiable.

Exercise: BCP with multiple propagations

$(a \lor b \lor c \lor d) \land (\neg a) \land (\neg c \lor \neg d) \land (a \lor c \lor \neg d) \land (c \lor \neg b) \land \neg c$

$BCP (\cdot, \neg a) = (b \lor c \lor d) \land (\neg c \lor \neg d) \land (c \lor \neg d) \land (c \lor \neg b) \land \neg c$
$BCP (\cdot, \neg c) = (b \lor d) \land (\neg d) \land (\neg d) \land (\neg b)$
$BCP (\cdot, \neg d) = (b) \land (\neg b)$
$BCP (\cdot, b) = □ = ⊥$

Unsatisfiable.

Graph View

JKU - SAT solving

Practical SAT Solving

Why CNF over DNF?

Transformation to CNF

Approach 1: Multiplication

Approach 2: Tseitin Encoding

DPLL Algorithm

DPLL Pseudocode

Modern SAT Solvers

Practice Problems