0% found this document useful (0 votes)

257 views43 pages

Normal Forms For Context Free Grammars

1. The document discusses normal forms for Context-Free Grammars (CFGs). 2. It explains that every CFG can be transformed into Chomsky Normal Form or Greibach Normal Form through simplification techniques like eliminating useless symbols, epsilon productions, and unit productions. 3. The goal is to show that every Context-Free Language is generated by a CFG with productions that are either of the form A → BC or A → a, where A, B, C are variables and a is a terminal symbol.

Uploaded by

Prashant Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

257 views43 pages

Normal Forms For Context Free Grammars

Uploaded by

Prashant Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

Normal forms for ContextFree Grammars

Context-Free Grammar

In linguistics and computer science, a context-free grammar (CFG) is a formal grammar in which every production rule is of the form Vw where V is a non-terminal symbol and w is a string consisting of terminals and/or non-terminals.

The term "context-free" expresses the fact that the non-terminal V can always be replaced by w, regardless of the context in which it occurs. A formal language is context-free if there is a context-free grammar that generates it.

Context-Free Grammar

Context-free grammars are powerful enough to describe the syntax of most programming languages; in fact, the syntax of most programming languages is specified using context-free grammars. On the other hand, context-free grammars are simple enough to allow the construction of efficient parsing algorithms which, for a given string, determine whether and how it can be generated from the grammar.

Context-Free Grammar

Not all formal languages are context-free. A well-known counter example is n bn cn : n >= 0 } {a the set of strings containing some number of a's, followed by the same number of b's and the same number of c's.

Context-Free Grammar

Just as any formal grammar, a context-free grammar G can be defined as a 4-tuple: G = (Vt ,Vn ,P,S) where Vt is a finite set of terminals Vn is a finite set of non-terminals P is a finite set of production rules S is an element of Vn, the distinguished starting non-terminal.

elements of P are of the form

Vn ( Vt U Vn) *

A language L is said to be a Context-Free-Language (CFL) if its grammar is Context-Free. More precisely, it is a language whose words, sentences and phrases are made of symbols and words from a Context-Free-Grammar. Usually, CFL is of the form L=L(G).

Example 1

A simple context-free grammar is given as:

SaSb|

where | is used to separate multiple options for the same non-terminal, and stands for the empty string. This grammar generates the language { an bn : n >= 0 } , which is not regular.

Regular languages

A regular language is a formal language (i.e., a possibly infinite set of finite sequences of symbols from a finite alphabet) that satisfies the following equivalent properties: it can be accepted by a deterministic finite state machine it can be accepted by a nondeterministic finite state machine it can be accepted by an alternating finite automaton it can be described by a regular expression it can be generated by a regular grammar it can be generated by a prefix grammar

Regular languages
The collection of regular languages over an alphabet is defined recursively as follows: the empty language is a regular language. the empty string language { } is a regular language. For each a , the singleton language { a } is a regular language. If A and B are regular languages, then A B (union), A B (concatenation), and A* (Kleene star) are regular languages. No other languages over are regular.

Finite languages
Finite languages are: A specific subset within the class of regular languages is the finite languages - those containing only a finite number of words. These are obviously regular as one can create a regular expression that is the union of every word in the language, and thus are regular.

Example 2

A context-free grammar for the language consisting of all strings over {a,b} which contain a different number of a's to b's is

SU|V U TaU | TaT V TbV | TbT T aTbT | bTaT |

Here, T can generate all strings with the same number of a's as b's, U generates all strings with more a's than b's and V generates all strings with fewer a's than b's.

Example 3

Another example of a context-free language is This is not a regular language, but it is context free as it can be generated by the following context-free grammar:

S b S bb | A AaA|

Normal forms

Every context-free grammar that does not generate the empty string can be transformed into an equivalent one in Chomsky normal form or Greibach normal form. "Equivalent" here means that the two grammars generate the same language. Because of the especially simple form of production rules in Chomsky Normal Form grammars, this normal form has both theoretical and practical implications. For instance, given a context-free grammar, one can use the Chomsky Normal Form to construct a polynomial-time algorithm which decides whether a given string is in the language represented by that grammar or not (the CYK algorithm).

Properties of context-free languages

An alternative and equivalent definition of contextfree languages employs non-deterministic pushdown automata: a language is context-free if and only if it can be accepted by such an automaton. A language can also be modeled as a set of all sequences of terminals which are accepted by the grammar. This model is helpful in understanding set operations on languages. The union and concatenation of two context-free languages is context-free, but the intersection need not be. The reverse of a context-free language is contextfree, but the complement need not be.

Properties of context-free languages

Every regular language is context-free because it can be described by a regular grammar. The intersection of a context-free language and a regular language is always context-free. There exist context-sensitive languages which are not context-free. To prove that a given language is not context-free, one may employ the pumping lemma for contextfree languages. The problem of determining if a context-sensitive grammar describes a context-free language is undecidable.

Normal forms for Context-Free Grammars

The goal is to show that every CFL (without ) is generated by a CFG in which all productions are of the form A BC or A a, where A, B, C are variables, and a is a terminal.

Normal forms for Context-Free Grammars

A number of simplifications is inevitable: The elimination of useless symbols, variables or terminals that do not appear in any derivation of a terminal string from the start symbol. The elimination of -productions, those of the form A for some variable A. The elimination of unit productions, those of the form A B for variables A and B.

Eliminating useless symbols

A symbol X is useful for Grammar G = {V, T, P, S}, if there is some derivation of the form S >* a X b >* w , where w T* X V or X T The sentential form of a X b might be the first or last derivation If X is not useful, then X is useless

Eliminating useless symbols

Characteristics of useful symbols (for instance X): X is generating if X >* w for some terminal string w. Every terminal is generating since w can be that terminal itself, which is derived by 0 steps. X is reachable if there is a derivation S >* a X b for some a and b. A symbol which is useful is surely to be both generating and reachable.

Eliminating useless symbols

Eliminating the symbols which are not generating first followed by eliminating the symbols which are not reachable from the remaining grammar, this will generate a grammar consisting of only useful symbols.

Eliminating useless symbols

Example 7.1 If we have the following grammar:

Eliminating useless symbols

Example 7.1 Notice that a and b generate themselves terminals, S generates a, and A generates b. B is not generating. After eliminating B:

Eliminating useless symbols

Example 7.1 Notice that only S and a are reachable after eliminating the non-generating B. A is not reachable; so it should be eliminated. The result :

This production itself is a grammar that has the same result, which is {a}, as the original grammar.

Computing the generating and reachable symbols

Basis: Every Symbol of T is obviously generating; it generates itself. Induction: If we have a production A a, and every symbol of a is already known to be generating, then A is generating; because it generates all and only generating symbols, even if a = ; since all variables that have as a production body are generating. Theorem: The previous algorithm finds all and only the Generating symbols of G

Computing the generating and reachable symbols

Basis : For a grammar G = {V, T, P, S} S is surely reachable. Induction: If we discovered that some variable A is reachable, then for all productions with A in the head (first part of the expression), all the symbols of the bodies (second part of the expression) of those productions are also reachable. Theorem: The above algorithm finds all and only the Reachable symbols of G

Eliminating useless symbols

So far, the first step, which is the elimination of useless symbols is concluded. Now, for the second part, which is the elimination of -productions.

Eliminating -productions

The strategy is to have the following: if L is CFG, then L {} is also CFG This is done through discovering the nullable variables. A variable for instance A, is nullable if: A >* . Whenever A appears in a production body, A might or might not derive

Eliminating -productions

Basis: If A is a production of G, then A is nullable Induction: If there is a production B C1 C2 Ck such that each C is a variable and each C is nullable, then B is nullable

Eliminating -productions

Theorem: For any grammar G, the only nullable symbols are the variables that derive in previous algorithm Proof: for one step : A must be a production, then this implies that A is discovered as nullable (as in basis). for N > 1 steps: the first step is A C1 C2 Ck , each Ci derives by a sequence < N steps. By the induction, each Ci is discovered by the algorithm to be nullable. So by the inductive step, A is eventually found to be nullable.

Eliminating -productions
If a grammar G1 is constructed by the elimination of -productions using the previous method of grammar G, then

L(G1) = L(G) - {}

Eliminating unit productions

The last part concerns the eliminating of unit productions Any production of the form A B , where A and B are variables, is called a unit production. These production introduce extra steps in the derivations that obviously are not needed in there.

Eliminating unit productions

Basis: (A, A) is a unit pair of any variable A, if A >* A by 0 steps. Induction: Lets (A, B) be a unit pair, and let B C is a production, where A, B, and C are variables, then we can conclude that (A, C) is also a unit pair. Theorem: The previous algorithm (basis and induction) finds exactly all the unit pairs for any grammar G.

Eliminating unit productions

Example 7.12

Eliminating unit productions

Example 7.12

Eliminating unit productions

Example 7.12
After eliminating the unit productions, the generated grammar is:

This grammar has no unit productions and still generates the same expressions as the previous one.

Chomsky Normal Form

Conclusion of all three elimination stages: Theorem: If G is a CFG which generates a language that consists of at least one string along with , then there is another CFG G1 such that: L{G1} = L{G} {} , no -productions, and G1 has neither unit productions nor useless symbols

Chomsky Normal Form

Proof: Start by performing the elimination of -productions. Then perform the elimination of unit productions, so the resulting grammar wont introduce any -productions since the new bodies are still identical to some bodies of the old grammar. Finally, perform the elimination of useless symbols, and since this eliminates productions and symbols, it will never reintroduce any -productions nor unit productions

Chomsky Normal Form

Every nonempty CFL without has grammar G in which all productions are in one of the following forms: A BC , where A, B, and C are variables or A a , where A is a variable and a is a terminal Also G doesnt contain any useless symbols A grammar complying to these forms is called a Chomsky Normal Form (CNF).

Chomsky Normal Form

The construction of CNF is performed through: Arrangement of all bodies of length 2 or more to contain only variables. Breaking bodies of length 3 or more into a cascade productions, where each one has a body consisting of 2 variables.

Chomsky Normal Form

Example 7.15

Chomsky Normal Form

Example 7.15 First: we introduce new variables to represent terminals:

Chomsky Normal Form

Example 7.15 Second: We make all bodies either a single terminal or multiple variables:

Chomsky Normal Form

Example 7.15 Last step: we make all bodies either a single terminal or two variables:

CS 311 Final Notes
No ratings yet
CS 311 Final Notes
3 pages
CourseOutline TheoryofAutomata
No ratings yet
CourseOutline TheoryofAutomata
6 pages
CSC510 Discrete Structures Assignment 1
75% (4)
CSC510 Discrete Structures Assignment 1
7 pages
MITWPU - Unit 3-Theory of Computation
No ratings yet
MITWPU - Unit 3-Theory of Computation
72 pages
Higher Order Metametaphysics
No ratings yet
Higher Order Metametaphysics
32 pages
09 CFL
100% (1)
09 CFL
62 pages
Flat 1
No ratings yet
Flat 1
16 pages
Unit 3-FLAT
No ratings yet
Unit 3-FLAT
80 pages
Unit-4 Context Free Grammar
No ratings yet
Unit-4 Context Free Grammar
106 pages
Cs606 Collection of Old Papers
0% (2)
Cs606 Collection of Old Papers
18 pages
Chapter 3 Lexical Analyser
No ratings yet
Chapter 3 Lexical Analyser
29 pages
Schuller39s Geometric Anatomy of Theoretical Physics Lectures 1 17 PDF
100% (1)
Schuller39s Geometric Anatomy of Theoretical Physics Lectures 1 17 PDF
114 pages
Module 4 Notes
No ratings yet
Module 4 Notes
19 pages
18 Context-Free Grammars
No ratings yet
18 Context-Free Grammars
13 pages
Lec 8
No ratings yet
Lec 8
91 pages
Unit 3-Theory of Computation
No ratings yet
Unit 3-Theory of Computation
77 pages
17 Context Free Languages With Examples
No ratings yet
17 Context Free Languages With Examples
74 pages
Propositions and Truth Table
No ratings yet
Propositions and Truth Table
21 pages
Unit III Regular Grammar
No ratings yet
Unit III Regular Grammar
54 pages
UNIT-3: 08/02/23 UNIT 2 - Context-Free Grammar 1
No ratings yet
UNIT-3: 08/02/23 UNIT 2 - Context-Free Grammar 1
69 pages
Unit 3 CFG
No ratings yet
Unit 3 CFG
65 pages
Compact L5 Final 1
No ratings yet
Compact L5 Final 1
7 pages
CD Lab Manual
No ratings yet
CD Lab Manual
7 pages
Exercise 1.3 Logic BONIFACIO, LLOVIT, TE
No ratings yet
Exercise 1.3 Logic BONIFACIO, LLOVIT, TE
2 pages
BITS-Pilani 1 Semester 2022-23 MATH F213 (Discrete Mathematics)
No ratings yet
BITS-Pilani 1 Semester 2022-23 MATH F213 (Discrete Mathematics)
25 pages
CH5 Simplification of Context-Free Grammars and Normal Forms
No ratings yet
CH5 Simplification of Context-Free Grammars and Normal Forms
53 pages
Btech Cs 4 Sem Theory of Automata and Formal Language Ncs402 2019
No ratings yet
Btech Cs 4 Sem Theory of Automata and Formal Language Ncs402 2019
2 pages
Chomsky Hierarchy1 1
No ratings yet
Chomsky Hierarchy1 1
21 pages
Grammar
No ratings yet
Grammar
131 pages
Tafal Unit-2,3,4 Theory Important Questions
No ratings yet
Tafal Unit-2,3,4 Theory Important Questions
19 pages
ATC Module 3
No ratings yet
ATC Module 3
38 pages
Atcd - 21CS51 - M3
No ratings yet
Atcd - 21CS51 - M3
36 pages
Tut 1
No ratings yet
Tut 1
1 page
Theory of Programming Languages: An Overview Lecture # 1
No ratings yet
Theory of Programming Languages: An Overview Lecture # 1
38 pages
Week 1 - Review of Discrete Structure 1 - Presentation - PDF - 2
No ratings yet
Week 1 - Review of Discrete Structure 1 - Presentation - PDF - 2
33 pages
Homework and Exams
No ratings yet
Homework and Exams
8 pages
01 Task Performance 1/prelim Exam - ARG
No ratings yet
01 Task Performance 1/prelim Exam - ARG
2 pages
Compiler Design SUBJECT CODE: 203105351: Prof. Kapil Raghuwanshi
No ratings yet
Compiler Design SUBJECT CODE: 203105351: Prof. Kapil Raghuwanshi
66 pages
Structures and Enumerations
No ratings yet
Structures and Enumerations
11 pages
Grammar and Language: Grammar: It Is System That Specifies
No ratings yet
Grammar and Language: Grammar: It Is System That Specifies
40 pages
Module 1 DSGT
No ratings yet
Module 1 DSGT
87 pages
Constructing Formal Proof Using The 19 Rules of Inference
No ratings yet
Constructing Formal Proof Using The 19 Rules of Inference
16 pages
Artificial Intelligence: Lecture 2: First Order Logic 2 Solve Problems
No ratings yet
Artificial Intelligence: Lecture 2: First Order Logic 2 Solve Problems
97 pages
CH 4 - Context Free Languages Amd Grammars
No ratings yet
CH 4 - Context Free Languages Amd Grammars
86 pages
Unit 2
No ratings yet
Unit 2
86 pages
Unit 3 TOC
No ratings yet
Unit 3 TOC
80 pages
Grammar
No ratings yet
Grammar
8 pages
UNIT-2 TOc by Krishnendu
No ratings yet
UNIT-2 TOc by Krishnendu
44 pages
Normal Forms
No ratings yet
Normal Forms
76 pages
TOC UNIT 3 Dbatu Book
No ratings yet
TOC UNIT 3 Dbatu Book
22 pages
KnowledgeRepresentation PDF
No ratings yet
KnowledgeRepresentation PDF
8 pages
TOC-DEC-19 StrangeR
No ratings yet
TOC-DEC-19 StrangeR
8 pages
Atc Module 3 Notes
No ratings yet
Atc Module 3 Notes
38 pages
Unit-3 Part Ii
No ratings yet
Unit-3 Part Ii
13 pages
Notes CFG
No ratings yet
Notes CFG
25 pages
Grammar
No ratings yet
Grammar
31 pages
Fopl
No ratings yet
Fopl
29 pages
Algebra Notes
No ratings yet
Algebra Notes
56 pages
Lesson 6 3rd Release
No ratings yet
Lesson 6 3rd Release
15 pages
Chapter 4 and 5
No ratings yet
Chapter 4 and 5
71 pages
Chapter 4 and 5
100% (1)
Chapter 4 and 5
71 pages
Unit Iv Context Free Languages
No ratings yet
Unit Iv Context Free Languages
74 pages
Toc 3
No ratings yet
Toc 3
65 pages
"Context-Free Grammar" From John Martin (3 Edition)
No ratings yet
"Context-Free Grammar" From John Martin (3 Edition)
40 pages
Context Free Grammars: Bachelor of Technology Computer Science and Engineering
No ratings yet
Context Free Grammars: Bachelor of Technology Computer Science and Engineering
10 pages
Unit-3 Part Ii
No ratings yet
Unit-3 Part Ii
15 pages
Chapter 2 Chapter 2 Mathematics As A Language - Lesson 2 Elementary Logic
No ratings yet
Chapter 2 Chapter 2 Mathematics As A Language - Lesson 2 Elementary Logic
11 pages
Motivation For Formal Grammars
No ratings yet
Motivation For Formal Grammars
15 pages
Context Free Grammar
No ratings yet
Context Free Grammar
5 pages
FL&T Unit 3 - 1 - 1724732026415
No ratings yet
FL&T Unit 3 - 1 - 1724732026415
17 pages
Schemes Philo
No ratings yet
Schemes Philo
29 pages
Inference in First-Order Logic: FOL Inference Rules For Quantifier
No ratings yet
Inference in First-Order Logic: FOL Inference Rules For Quantifier
4 pages
Normal Forms For Context Free Grammars
No ratings yet
Normal Forms For Context Free Grammars
54 pages
Lec 3
No ratings yet
Lec 3
76 pages
Context-Free Grammars Example Grammar: Arithmetic Expressions
No ratings yet
Context-Free Grammars Example Grammar: Arithmetic Expressions
4 pages
Theory of Computation: Lecture 7: Context-Free Grammar
No ratings yet
Theory of Computation: Lecture 7: Context-Free Grammar
21 pages
Lect 11
No ratings yet
Lect 11
7 pages
Normal Forms For Context Free Grammars
No ratings yet
Normal Forms For Context Free Grammars
43 pages
Chapter Three
No ratings yet
Chapter Three
110 pages
Types of Grammars:: Grammar
No ratings yet
Types of Grammars:: Grammar
10 pages
Normal Forms: CS154 Chris Pollett Mar 12, 2007
No ratings yet
Normal Forms: CS154 Chris Pollett Mar 12, 2007
8 pages
2 Contex Free Language
No ratings yet
2 Contex Free Language
13 pages
Discrete Mathematics For Computer Science
100% (1)
Discrete Mathematics For Computer Science
92 pages
Context Free Grammars: Unit - Iii
No ratings yet
Context Free Grammars: Unit - Iii
17 pages
FALLSEM2020-21 CSE2002 TH VL2020210106983 Reference Material III 27-Aug-2020 CFGPDANOTES PDF
100% (1)
FALLSEM2020-21 CSE2002 TH VL2020210106983 Reference Material III 27-Aug-2020 CFGPDANOTES PDF
79 pages
TOC Unit 4 PDF
100% (1)
TOC Unit 4 PDF
23 pages
CS6503 Theory of Computations Unit 2
67% (3)
CS6503 Theory of Computations Unit 2
47 pages
2.1 Context-Free Grammars
No ratings yet
2.1 Context-Free Grammars
42 pages
Creating Melodies
From Everand
Creating Melodies
Stefan Hollos
No ratings yet
Introduction to Formal Languages
From Everand
Introduction to Formal Languages
György E. Révész
2/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Normal Forms For Context Free Grammars

Uploaded by

Normal Forms For Context Free Grammars

Uploaded by

Normal forms for ContextFree Grammars

elements of P are of the form

A simple context-free grammar is given as:

SU|V U TaU | TaT V TbV | TbT T aTbT | bTaT |

Properties of context-free languages

Properties of context-free languages

Normal forms for Context-Free Grammars

Normal forms for Context-Free Grammars

Eliminating useless symbols

Eliminating useless symbols

Eliminating useless symbols

Eliminating useless symbols

Example 7.1 If we have the following grammar:

Eliminating useless symbols

Eliminating useless symbols

Computing the generating and reachable symbols

Computing the generating and reachable symbols

Eliminating useless symbols

Eliminating unit productions

Eliminating unit productions

Eliminating unit productions

Eliminating unit productions

Eliminating unit productions

Chomsky Normal Form

Chomsky Normal Form

Chomsky Normal Form

Chomsky Normal Form

Chomsky Normal Form

Chomsky Normal Form

Example 7.15 First: we introduce new variables to represent terminals:

Chomsky Normal Form

Chomsky Normal Form

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.