0% found this document useful (0 votes)

30 views34 pages

1004 Theorem Proving 2018

Uploaded by

Karthikayani Devaraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views34 pages

1004 Theorem Proving 2018

Uploaded by

Karthikayani Devaraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

Theorem Proving

Dave Touretzky

Read R&N Ch. 9.5-9.6

Propositional resolution
● Proof by contradiction: to prove α, assume ~α and derive FALSE.
● Sound: only valid inferences are made.
● Complete: if a sentence is valid, the proof will be found.
● Formulas must be in CNF.
○ But conversion to CNF is straightforward.

2
Resolution in First-Order Logic
Resolution is more complicated in FOL due to:

● Functions — can generate infinite models

● Variables — must use unification to match literals

○ Requires standardization of variables (variable renaming) to avoid conflicts

● Quantifiers
○ Existential quantifiers require Skolemization
○ Nesting order of quantifiers matters

● Equality
○ Paramodulation and demodulation
3
○ Equational resolution
Converting an FOL Sentence to CNF
“Everyone who loves all animals is loved by someone.”

∀x [ [ ∀y Animal(y) ⇒ Loves(x,y) ] ⇒ ∃y Loves(y,x) ]

Scoping: y and y are different variables.

4
Step 1: Eliminate implications

∀x [ [ ∀y Animal(y) ⇒ Loves(x,y) ] ⇒ ∃y Loves(y,x) ]

∀x [ [¬ ∀y Animal(y) ⇒ Loves(x,y) ] ∨ ∃y Loves(y,x) ]

∀x [ [ ¬ ∀y ¬ Animal(y) ∨ Loves(x,y) ] ∨ ∃y Loves(y,x) ]

5
Step 2: Move ¬ inward
¬∀xp becomes ∃ x ¬p

¬∃xp becomes ∀x¬p

∀ x [ [ ∃ y ¬(¬Animal(y) ∨ Loves(x,y)) ] ∨ [ ∃y Loves(y,x) ] ]

∀ x [ [ ∃ y ¬¬Animal(y) ∧ ¬Loves(x,y)) ] ∨ [ ∃y Loves(y,x) ] ]

∀ x [ [ ∃ y Animal(y) ∧ ¬Loves(x,y)) ] ∨ [ ∃y Loves(y,x) ] ]

6
Step 3: standardize variables
Rename the second y to z:

∀ x [ [ ∃ y Animal(y) ∧ ¬Loves(x,y)) ] ∨ [ ∃z Loves(z,x) ] ]

7
Step 4: Skolemization
Create a unique constant for each existentially quantified variable.

But the constant is a function of all other variables in its scope, so we must use
Skolem functions to generate these constants.

∀x [ [ Animal(f(x)) ∧ ¬Loves(x, f(x)) ] ∨ Loves(g(x), x) ]

8
Step 5: Drop universal quantifiers

[ Animal(f(x)) ∧ ¬Loves(x, f(x)) ] ∨ Loves(g(x), x)

9
Step 6: Distribute ∨ over ∧

[ Animal(f(x)) ∨ Loves(g(x), x) ] ∧ [ ¬Loves(x, f(x)) ∨ Loves(g(x), x) ]

The sentence is now in CNF. But it’s not very readable.

10
The resolution inference rule

l1 ∨ … ∨ lk , m1 ∨ … ∨ mn
------------------------------------------------------------------------------------------------------------------------------------

SUBST(θ, l1 ∨ … ∨ li-1 ∨ li+1 ∨ … ∨ lk ∨ m1 ∨ … ∨ mj-1 ∨ mj+1 ∨ … ∨ mn )

where UNIFY(li , ¬mj) = θ

11
Applying the binary resolution rule
Unify: [ Animal(f(x)) ∨ Loves(g(x), x) ] with [ ¬Loves(u,v) ∨ ¬Kills(u,v) ]

Use the unifier θ = { u / g(x), v / x } to produce

[ Animal(f(x)) ∨ ¬Kills(g(x), x) ]

Tricky bits:

● For completeness, we must resolve all subsets of literals that are unifiable,
not just pairs of literals.
● Alternative is factoring: replacing two literals by one if they are unifiable.
12
Prove that Colonel West is a criminal

Unified terms are

shown in boldface.

Straight-line
derivation because
this is a Horn
theory. There is a
contradiction but it
doesn’t mention West. 13
Did curiosity kill the cat?
1. Everyone who loves animals is loved by someone.
2. Anyone who kills an animal is loved by no one.
3. Jack loves all animals.
4. Either Jack or Curiosity killed the cat.
5. The cat is named Tuna.
6. Cats are animals.
7. Did Curiosity kill the cat?

14
Logically, did curiosity kill the cat?
1. ∀x [ ∀y Animal(y) ⇒ Loves(x,y) ] ⇒ ∃y Loves(y,x)
2. ∀x [ ∃z Animal(z) ∧ Kills(x,z) ] ⇒ ∀y ¬Loves(y,x)
3. ∀x Animal(x) ⇒ Loves(jack,x)
4. Kills(jack, tuna) ∨ Kills(curiosity, tuna)
5. Cat(tuna)
6. ∀x Cat(x) ⇒ Animal(x)
7. ¬Kills(curiosity, tuna)

15
Convert to CNF
1. Animal(f(x)) ∨ Loves(g(x), x)
¬Loves(x, f(x)) ∨ Loves(g(x), x)
2. ¬Loves(y,x) ∨ ¬Animal(z) ∨ ¬Kills(x,z)
3. ¬Animal(x) ∨ Loves(jack,x)
4. Kills(jack, tuna) ∨ Kills(curiosity, tuna)
5. Cat(tuna)
6. ¬Cat(x) ∨ Animal(x)
7. ¬Kills(curiosity, tuna)

16
Did curiosity kill the cat?

Not a straight-line
derivation because
this is not a Horn
theory. 17
Who killed the cat?
Goal: ∃w Kills(w, tuna)

Negated, in CNF: ¬Kills(w, tuna)

Can unify with both Kills(jack, tuna) or Kills(curiosity, tuna), so we derive a

contradiction without knowing who killed tuna.

18
Who killed the cat?

Kills(jack, tuna) ∨ Kills(curiosity, tuna) ¬Kills(w,tuna)

{ w / curiosity }

Kills(jack, tuna)

{ w / jack }

19
Who killed the cat?
Solutions:

1. Don’t allow the query variable w to be bound more than once in a derivation.
Backtrack on w until we find a value that gives the desired contradiction.
Example: binding w to curiosity leaves us with Kills(jack, tuna), which resolves
with the other clauses to yield a contradiction.

2. Create an “answer literal” to use in the query: ¬Kills(w, tuna) ∧ Answer(w).

When we derive a clause containing only Answer(w) for some w, report that
value as an answer to the query.
20
How to handle equality
Three approaches:

1. Axiomatize

2. Inference rules

3. Extended unification

21
Axiomatizing equality
∀x x = x
∀x,y x=y ⇒ y=x
∀x,y,z x=y ∧ y=z ⇒ x=z This produces correct
equality reasoning, but it
For all predicates P, Q, ...: generates a huge number of
conclusions, most of which
∀x,y x=y ⇒ (P(x) ⇔ P(y)) will not be useful.
∀w,x,y,z w=x ∧ y=z ⇒ (Q(w,y) ⇔ Q(x,z))
...

For all functions f, g, ...:

∀x,y x=y ⇒ (f(x) = f(y))
∀w,x,y,z w=x ∧ y=z ⇒ (g(w,y) = g(x,z))
22
...
Inference rules for equality: demodulation
x=y , m1 ∨ … ∨ mn
-----------------------------------------------------
SUB(SUBST(θ,x), SUBST(θ,y), m1 ∨ … ∨ mn)

where UNIFY(x,z) = θ and z appears somewhere in mi.

SUB(x,y,m) means replace x with y everywhere that x occurs in m.

Example:
father(father(x)) = paternal_grandpa(x)
Birthdate(father(father(bella)), 1926)

Using θ = { x / bella } we can derive:

23
Birthdate(paternal_grandpa(bella), 1926)
Inference rules for equality: paramodulation
l1 ∨ … ∨ lk ∨ x=y , m1 ∨ … ∨ mn
----------------------------------------------------------------------
SUB(SUBST(θ,x), SUBST(θ,y), l1 ∨ … ∨ lk ∨ m1 ∨ … ∨ mn)

Handles non-unit clauses where one of the terms is an equality.

Example: from P(f(x,b), x) ∨ Q(x) and f(a,y)=y ∨ R(y)

We have θ = { x / a, y / b}

We derive: P(b,a) ∨ Q(a) ∨ R(b)

Paramodulation yields a complete inference procedure. 24

Equality via extended unification
The third way to handle equality is to modify the unification algorithm to allow
unification of expressions that are provably equal.

For example, equational unification could allow (1+2) to unify with (2+1) using
the empty substitution.

This approach is used in CLP (Constraint Logic Programming) systems.

25
Resolution strategies
1. Unit preference

2. Set of support

3. Input resolution

4. Subsumption

26
Unit preference strategy
Which clauses should we resolve first?

If we resolve a unit clause with another clause, the result is always a shorter
clause. Since we’re trying to derive a contradiction (empty clause), shorter is
better.

So choose unit clauses first.

Unit resolution requires a unit clause in every step. Incomplete in general, but
complete for Horn theories, where it resembles forward chaining.

27
“Set of support” strategy
Require that every resolution step involve at least one element from a special “set
of support”. New resolvents are added to this step.

Provides a way to focus attention on formulas relevant to the goal. Inference will
be incomplete if the set is not chosen carefully.

If the set of support starts out with just the negation of the query, it generates a
goal-directed proof tree that may be easier for humans to understand.

28
Input resolution strategy
The “input set” consists of the sentences of the KB plus the query.

The input resolution strategy requires every resolution step to include a sentence
from the input set.
● Complete for Horn theories.
● Incomplete in general.

In linear resolution we allow P and Q to be resolved together if either P is in the

original KB or P is an ancestor of Q in the proof tree.
● Linear resolution is complete.

29
Subsumption strategy
Eliminate all sentences that are subsumed by (i.e., are more specific than) a
sentence already in the KB.

If we have P(x) in the KB, don’t add P(a) or P(a) ∨ Q(b).

The goal is to keep the size of the KB small, which reduces the search space.

In HW4 we will explore a version of this idea.

30
Uses of theorem proving
● Prove mathematical theorems
● Design of digital circuits
● Verification of complex hardware, including entire CPUs.
● “Automatic programming”: synthesizing a program based on a formal
specification
○ Not practical for general programs
○ Works in specialized areas such as scientific computing code (e.g., vectorization)
○ “Hand-guided” synthesis has been used successfully for algorithm design

31
Theorem proving at Intel
These slides are based on a presentation by John Harrison of Intel:
https://www.cl.cam.ac.uk/~jrh13/slides/arw-04apr02/slides.pdf

● The 1994 FDIV (floating point division) bug in the Intel Pentium processor
cost the company $500 million.

● Today new products are developed more quickly: less time to find bugs.

32
Increased complexity makes bugs more likely
John Harrison (Intel):

● A 4-fold increase in pre-silicon bugs in Intel processor designs per generation.

● Approximately 8000 bugs introduced during design of the Pentium 4.

● Pre-silicon bug detection rates are now at least 99.7%.

But that still leaves ~ 24 uncaught bugs.

33
Approaches to formal verification of chips
1. Symbolic simulation

2. Temporal logic model checking (see Ed Clark’s Turing Award)

3. General theorem proving

Intel uses a combination of these techniques.

Hybrid theorem prover that includes mathematical knowledge about floating point
representations.

CS103 Midterm 1 Reference Sheet
No ratings yet
CS103 Midterm 1 Reference Sheet
2 pages
23ad1504 Keis Unit 2 Notes
No ratings yet
23ad1504 Keis Unit 2 Notes
21 pages
Unit 1 - Toc
No ratings yet
Unit 1 - Toc
77 pages
AI Unit 3
No ratings yet
AI Unit 3
154 pages
07 Logic
No ratings yet
07 Logic
82 pages
Knowledge Reasoning
No ratings yet
Knowledge Reasoning
40 pages
Fallsem2015 16 Cp3066 Qz01ans PDNF and PCNF
No ratings yet
Fallsem2015 16 Cp3066 Qz01ans PDNF and PCNF
28 pages
Wa0001.
No ratings yet
Wa0001.
37 pages
CS6364 Lecture6 - Ch08 FOL - Rev4
No ratings yet
CS6364 Lecture6 - Ch08 FOL - Rev4
55 pages
Artificial Intelligence PPT-7 - Inference in FOL
No ratings yet
Artificial Intelligence PPT-7 - Inference in FOL
46 pages
Chapt09-Inference in First-Order Logic
No ratings yet
Chapt09-Inference in First-Order Logic
107 pages
CH 09
No ratings yet
CH 09
35 pages
CH 09
No ratings yet
CH 09
37 pages
Inference in First-Order Logic
No ratings yet
Inference in First-Order Logic
34 pages
The Facts:: Limitation of Propositional Logic
No ratings yet
The Facts:: Limitation of Propositional Logic
16 pages
Ai 3,4,5 Vtu nOTES
No ratings yet
Ai 3,4,5 Vtu nOTES
22 pages
Unit-Iii: Propositional Logic (PL)
No ratings yet
Unit-Iii: Propositional Logic (PL)
33 pages
7 - 1-The-Resolution-Refutation-Method - Examples
No ratings yet
7 - 1-The-Resolution-Refutation-Method - Examples
31 pages
Proof by Resolution
No ratings yet
Proof by Resolution
10 pages
Formal Derivation of Some Incompleteness Theorems - Charles Volkstorf 1990 2 19
No ratings yet
Formal Derivation of Some Incompleteness Theorems - Charles Volkstorf 1990 2 19
17 pages
4.7resolution in FOL
No ratings yet
4.7resolution in FOL
35 pages
AI07
No ratings yet
AI07
42 pages
Part1 Logic and Part2 Function
No ratings yet
Part1 Logic and Part2 Function
15 pages
Firstorderlogic JSN
No ratings yet
Firstorderlogic JSN
31 pages
Unit 3 Topic 6 Resolution
No ratings yet
Unit 3 Topic 6 Resolution
16 pages
Knowledge Representation Using Logic
No ratings yet
Knowledge Representation Using Logic
55 pages
Ai Online
No ratings yet
Ai Online
10 pages
Resolution, Frws and BCKWRD Chaining
50% (2)
Resolution, Frws and BCKWRD Chaining
17 pages
Atp BW
No ratings yet
Atp BW
31 pages
Lec13 Fol
No ratings yet
Lec13 Fol
38 pages
Predicate Logic Exercise
No ratings yet
Predicate Logic Exercise
8 pages
Knowledge Representation
No ratings yet
Knowledge Representation
8 pages
Pred Logic
No ratings yet
Pred Logic
86 pages
Bab 1 Logic and Proof
No ratings yet
Bab 1 Logic and Proof
10 pages
Quiz3 Review Sol PDF
No ratings yet
Quiz3 Review Sol PDF
18 pages
ECS 20 Chapter 4, Logic Using Propositional Calculus: P Is False. If P Is False, Then P Is True
No ratings yet
ECS 20 Chapter 4, Logic Using Propositional Calculus: P Is False. If P Is False, Then P Is True
8 pages
Logical
No ratings yet
Logical
58 pages
Unit-3 - Unification Resolution
No ratings yet
Unit-3 - Unification Resolution
23 pages
16 FirstOrderLogic
No ratings yet
16 FirstOrderLogic
79 pages
Resolution Frws and BCKWRD Chaining
No ratings yet
Resolution Frws and BCKWRD Chaining
17 pages
Logic Fol 2
No ratings yet
Logic Fol 2
43 pages
CS 2710, ISSP 2160: Inference in First-Order Logic
No ratings yet
CS 2710, ISSP 2160: Inference in First-Order Logic
45 pages
KRR4 Notes
No ratings yet
KRR4 Notes
7 pages
Game Playing in AI
No ratings yet
Game Playing in AI
30 pages
Sol Logic InCS
No ratings yet
Sol Logic InCS
12 pages
12.2.1 Resolution Principle (1) : - Resolution Refutation Proves A Theorem by
No ratings yet
12.2.1 Resolution Principle (1) : - Resolution Refutation Proves A Theorem by
31 pages
Logic Tutorial
No ratings yet
Logic Tutorial
6 pages
Artificial Intelligence 8. The Resolution Method: Course V231 Department of Computing Imperial College, London Jeremy Gow
No ratings yet
Artificial Intelligence 8. The Resolution Method: Course V231 Department of Computing Imperial College, London Jeremy Gow
30 pages
CMSC 471 Fall 2002: Class #15/16 - Monday, October 21 / Wednesday, October 23
No ratings yet
CMSC 471 Fall 2002: Class #15/16 - Monday, October 21 / Wednesday, October 23
49 pages
Inference in First-Order Logic
No ratings yet
Inference in First-Order Logic
43 pages
First-Order Logic: CS472 - Fall 2007 Thorsten Joachims
No ratings yet
First-Order Logic: CS472 - Fall 2007 Thorsten Joachims
8 pages
15 KB Systems Part3 6up
No ratings yet
15 KB Systems Part3 6up
7 pages
CS 2742 (Logic in Computer Science) : 3 Resolution
No ratings yet
CS 2742 (Logic in Computer Science) : 3 Resolution
3 pages
An Kit Shah
No ratings yet
An Kit Shah
10 pages
CS2742 Midterm Test 1 Study Sheet Propositional Logic
No ratings yet
CS2742 Midterm Test 1 Study Sheet Propositional Logic
3 pages
Ai Chap 5 Soln
No ratings yet
Ai Chap 5 Soln
6 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

1004 Theorem Proving 2018

Uploaded by

1004 Theorem Proving 2018

Uploaded by

Theorem Proving

Read R&N Ch. 9.5-9.6

● Functions — can generate infinite models

● Variables — must use unification to match literals

∀x [ [ ∀y Animal(y) ⇒ Loves(x,y) ] ⇒ ∃y Loves(y,x) ]

Scoping: y and y are different variables.

∀x [ [ ∀y Animal(y) ⇒ Loves(x,y) ] ⇒ ∃y Loves(y,x) ]

∀x [ [¬ ∀y Animal(y) ⇒ Loves(x,y) ] ∨ ∃y Loves(y,x) ]

∀x [ [ ¬ ∀y ¬ Animal(y) ∨ Loves(x,y) ] ∨ ∃y Loves(y,x) ]

¬∃xp becomes ∀x¬p

∀ x [ [ ∃ y ¬(¬Animal(y) ∨ Loves(x,y)) ] ∨ [ ∃y Loves(y,x) ] ]

∀ x [ [ ∃ y ¬¬Animal(y) ∧ ¬Loves(x,y)) ] ∨ [ ∃y Loves(y,x) ] ]

∀ x [ [ ∃ y Animal(y) ∧ ¬Loves(x,y)) ] ∨ [ ∃y Loves(y,x) ] ]

∀ x [ [ ∃ y Animal(y) ∧ ¬Loves(x,y)) ] ∨ [ ∃z Loves(z,x) ] ]

∀x [ [ Animal(f(x)) ∧ ¬Loves(x, f(x)) ] ∨ Loves(g(x), x) ]

[ Animal(f(x)) ∧ ¬Loves(x, f(x)) ] ∨ Loves(g(x), x)

[ Animal(f(x)) ∨ Loves(g(x), x) ] ∧ [ ¬Loves(x, f(x)) ∨ Loves(g(x), x) ]

The sentence is now in CNF. But it’s not very readable.

SUBST(θ, l1 ∨ … ∨ li-1 ∨ li+1 ∨ … ∨ lk ∨ m1 ∨ … ∨ mj-1 ∨ mj+1 ∨ … ∨ mn )

where UNIFY(li , ¬mj) = θ

Use the unifier θ = { u / g(x), v / x } to produce

Unified terms are

Negated, in CNF: ¬Kills(w, tuna)

Can unify with both Kills(jack, tuna) or Kills(curiosity, tuna), so we derive a

Kills(jack, tuna) ∨ Kills(curiosity, tuna) ¬Kills(w,tuna)

2. Create an “answer literal” to use in the query: ¬Kills(w, tuna) ∧ Answer(w).

For all functions f, g, ...:

where UNIFY(x,z) = θ and z appears somewhere in mi.

Using θ = { x / bella } we can derive:

Handles non-unit clauses where one of the terms is an equality.

Example: from P(f(x,b), x) ∨ Q(x) and f(a,y)=y ∨ R(y)

We derive: P(b,a) ∨ Q(a) ∨ R(b)

Paramodulation yields a complete inference procedure. 24

This approach is used in CLP (Constraint Logic Programming) systems.

So choose unit clauses first.

In linear resolution we allow P and Q to be resolved together if either P is in the

If we have P(x) in the KB, don’t add P(a) or P(a) ∨ Q(b).

In HW4 we will explore a version of this idea.

● A 4-fold increase in pre-silicon bugs in Intel processor designs per generation.

● Approximately 8000 bugs introduced during design of the Pentium 4.

● Pre-silicon bug detection rates are now at least 99.7%.

But that still leaves ~ 24 uncaught bugs.

2. Temporal logic model checking (see Ed Clark’s Turing Award)

3. General theorem proving

Intel uses a combination of these techniques.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.