0% found this document useful (0 votes)

15 views7 pages

Toc Theory

Uploaded by

Chandan D

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views7 pages

Toc Theory

Uploaded by

Chandan D

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Explain Following symbole,Alphabet , power of an alphabet ,Strings, Language

1. Symbols

• Definition: A symbol is a basic unit of a language. It is an individual character or element that

can represent something in a formal system. In computational theory, symbols can be from
a finite set, often referred to as an alphabet.

• Example: In a binary language, the symbols could be {0, 1}. In a more general language,
symbols could include letters (A, B, C, etc.) or any characters.

2. Alphabet

• Definition: An alphabet is a finite, non-empty set of symbols. It is the basic building block of
formal languages. The notation often used is Σ (sigma), which represents the alphabet.

• Example: For example, the alphabet Σ = {a, b} contains two symbols: 'a' and 'b'.

3. Power of an Alphabet

• Definition: The power of an alphabet refers to the number of distinct strings that can be
formed using the symbols of the alphabet. If the alphabet has nnn symbols, then the total
number of strings of length kkk that can be formed from the alphabet is nkn^knk.

• Example: If the alphabet is Σ = {a, b}, then for strings of length 2, the possible strings are: {aa,
ab, ba, bb}. There are 22=42^2 = 422=4 strings of length 2.

4. Strings

• Definition: A string is a finite sequence of symbols from a given alphabet. Strings can be of
any length, including zero (the empty string, usually denoted as ε).

• Example: If Σ = {a, b}, then "abba", "aa", and "b" are all examples of strings formed from this
alphabet.

5. Language

• Definition: A language is a set of strings formed from an alphabet. Languages can be finite or
infinite, and they can be defined by specific rules or patterns. The concept of language is
central to formal language theory.

• Example: For the alphabet Σ = {a, b}, a language L could be defined as L = {a, ab, aab, aaab,
...}, which consists of strings with one or more 'a's followed by zero or more 'b's.

i) Role of Lexical Analyzer

The primary role of the lexical analyzer is to read the input source code and convert it into a
sequence of tokens. Here are the key functions and responsibilities of a lexical analyzer:

1. Input Processing:

o The lexical analyzer reads the source code character by character and organizes the
input for further processing.

2. Token Generation:
o It identifies meaningful sequences of characters (lexemes) and classifies them into
tokens. Tokens are categorized into types such as keywords, identifiers, literals,
operators, and punctuation.

3. Removing Whitespaces and Comments:

o It eliminates unnecessary whitespace, comments, and any irrelevant characters that

do not affect the program's execution, simplifying the input for the next stage of
compilation.

4. Error Detection:

o The lexical analyzer checks for errors in the source code, such as illegal characters or
malformed tokens, and generates appropriate error messages.

5. Symbol Table Management:

o It may maintain a symbol table to keep track of identifiers, their types, and other
attributes that are used during compilation.

6. Output:

o The lexical analyzer outputs a stream of tokens to the parser for further syntactic
analysis, thus acting as an interface between the source code and the parser.

ii) Specification of Token and Recognition of Token

Specification of Token

A token is defined as a categorized string of characters that represents a basic unit of meaning in the
source code. Each token consists of two main components:

1. Token Type:

o This is a category that identifies the class of the token, such as:

▪ Keywords: Reserved words like if, else, while.

▪ Identifiers: Names given to variables, functions, etc.

▪ Literals: Constant values like numbers or strings.

▪ Operators: Symbols representing operations like +, -, *, /.

▪ Punctuation: Symbols used for syntax, such as ;, ,, {, }.

2. Lexeme:

o This is the actual sequence of characters in the source code that corresponds to the
token. For instance, in the expression int count = 0;, the lexeme for the identifier
token would be count, and for the keyword token, it would be int.

Recognition of Token

Token recognition involves several steps, typically implemented using regular expressions and finite
automata. Here’s how it works:

1. Regular Expressions:
o Regular expressions define the patterns for different token types. For example:

▪ Keywords: int | float | if | else

▪ Identifiers: [a-zA-Z_][a-zA-Z0-9_]*

▪ Integer Literals: [0-9]+

▪ Operators: [+\-*/]

▪ Comments: //.*|/\*.*?\*/

2. Finite Automata:

o The lexical analyzer uses finite automata (either deterministic or nondeterministic)

to recognize tokens based on the defined regular expressions. Each state of the
automaton corresponds to a state of recognition for a specific token type.

3. Tokenization Process:

o As the lexical analyzer reads the input, it transitions through states in the finite
automaton based on the input characters. When it reaches an accepting state, it
recognizes the corresponding token and stores it in the output.

4. Error Handling:

o If the lexer encounters an input that does not match any token definition, it raises
an error, indicating that the input is not valid.

Summary

In summary, the lexical analyzer serves as the first stage of the compilation process, transforming
raw source code into a structured stream of tokens, which are essential for syntactic analysis. By
defining and recognizing tokens through regular expressions and finite automata, the lexical analyzer
efficiently processes the input while also ensuring that errors are detected early in the compilation
process.

STRING RELATION

In the context of formal languages and automata theory, relations on strings refer to the various
ways in which strings can be compared, combined, or manipulated. These relations can help in
defining languages, parsing strings, and constructing automata. Here’s an overview of several key
relations and operations on strings:

1. Equality Relation

• Definition: Two strings s1s_1s1 and s2s_2s2 are said to be equal if they consist of the same
sequence of characters.

• Notation: s1=s2s_1 = s_2s1=s2 if every character in s1s_1s1 corresponds to the same

character in s2s_2s2 at the same position.

2. Substring Relation

• Definition: A string s1s_1s1 is a substring of s2s_2s2 if s1s_1s1 can be found within s2s_2s2.

• Notation: s1⊆s2s_1 \subseteq s_2s1⊆s2 (or s1 is a substring of s2s_1 \text{ is a substring of

} s_2s1 is a substring of s2).
• Example: If s2="hello"s_2 = \text{"hello"}s2="hello", then s1="ell"s_1 = \text{"ell"}s1="ell" is
a substring of s2s_2s2.

3. Prefix and Suffix Relations

• Prefix: A string s1s_1s1 is a prefix of s2s_2s2 if s2s_2s2 can be expressed as s1+s3s_1 + s_3s1
+s3, where s3s_3s3 is another string (which can be empty).

o Notation: s1 is a prefix of s2s_1 \text{ is a prefix of } s_2s1 is a prefix of s2 or

s1⪯s2s_1 \preceq s_2s1⪯s2.

o Example: For s2="hello"s_2 = \text{"hello"}s2="hello", s1="hel"s_1 = \text{"hel"}s1

="hel" is a prefix.

• Suffix: A string s1s_1s1 is a suffix of s2s_2s2 if s2s_2s2 can be expressed as s3+s1s_3 + s_1s3
+s1, where s3s_3s3 is another string.

o Notation: s1 is a suffix of s2s_1 \text{ is a suffix of } s_2s1 is a suffix of s2 or

s1⪰s2s_1 \succeq s_2s1⪰s2.

o Example: For s2="hello"s_2 = \text{"hello"}s2="hello", s1="lo"s_1 = \text{"lo"}s1

="lo" is a suffix.

4. Concatenation Relation

• Definition: The concatenation of two strings s1s_1s1 and s2s_2s2 is the string formed by
appending s2s_2s2 to the end of s1s_1s1.

• Notation: The concatenation is denoted as s1⋅s2s_1 \cdot s_2s1⋅s2 or simply s1s2s_1 s_2s1
s2.

• Example: If s1="hello"s_1 = \text{"hello"}s1="hello" and s2=" world"s_2 = \text{" world"}s2

=" world", then s1⋅s2="hello world"s_1 \cdot s_2 = \text{"hello world"}s1⋅s2="hello world".

5. Length Relation

• Definition: The length of a string sss is the number of characters in it, denoted as ∣s∣|s|∣s∣.

• Example: For s="hello"s = \text{"hello"}s="hello", ∣s∣=5|s| = 5∣s∣=5.

6. Language Relation

• Definition: A language is a set of strings formed from a specific alphabet. The relation can be
defined based on properties of the strings in the language.

• Example: Let L={s∣s contains an even number of a’s}L = \{ s \mid s \text{ contains an even
number of a's} \}L={s∣s contains an even number of a’s}.

7. Homomorphism Relation

• Definition: A homomorphism is a mapping from one alphabet to another that preserves the
structure of the strings.

• Example: If we define a mapping hhh where h(a)=xh(a) = xh(a)=x and h(b)=yh(b) = yh(b)=y,
then h(ab)=xyh(ab) = xyh(ab)=xy.

8. Equivalence Relation
• Definition: An equivalence relation on strings is a relation that partitions the set of strings
into equivalence classes. Two strings s1s_1s1 and s2s_2s2 are equivalent if they satisfy
certain conditions.

• Example: In the context of regular languages, two strings are equivalent if they cannot be
distinguished by any string in the language.

Compiler Design Unit 1 SRM 21 Regulation
100% (1)
Compiler Design Unit 1 SRM 21 Regulation
193 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
84 pages
Lecture 1 Automata
No ratings yet
Lecture 1 Automata
35 pages
Chapter 3 Finite Automata and Lexical Analysis
No ratings yet
Chapter 3 Finite Automata and Lexical Analysis
95 pages
Unit 1 Introduction of Compiler
No ratings yet
Unit 1 Introduction of Compiler
61 pages
Maple Programming Guide PDF
100% (1)
Maple Programming Guide PDF
682 pages
Compiler Design Lab Manual For r13 PDF
100% (2)
Compiler Design Lab Manual For r13 PDF
52 pages
Compiler Construction Lec 2
No ratings yet
Compiler Construction Lec 2
37 pages
Chapter 3 Lexical Analyser
No ratings yet
Chapter 3 Lexical Analyser
29 pages
Lec2 LexicalAnalyser
No ratings yet
Lec2 LexicalAnalyser
30 pages
Chapter 3 Finite Automata and Lexical Analysis
No ratings yet
Chapter 3 Finite Automata and Lexical Analysis
100 pages
Lexical Analysis
No ratings yet
Lexical Analysis
62 pages
SPCC - 5
No ratings yet
SPCC - 5
19 pages
Compiler Design 2
No ratings yet
Compiler Design 2
76 pages
C Langauage Notes
No ratings yet
C Langauage Notes
5 pages
System Software3160715 Handbook
No ratings yet
System Software3160715 Handbook
124 pages
Chapter 3 Finite Automata and Lexical Analysis
No ratings yet
Chapter 3 Finite Automata and Lexical Analysis
100 pages
Scanner (Lexical Analyzer) : The Structure of A Compiler
No ratings yet
Scanner (Lexical Analyzer) : The Structure of A Compiler
109 pages
Chapter 7 Lexical Analysis
No ratings yet
Chapter 7 Lexical Analysis
61 pages
M2 Main
No ratings yet
M2 Main
41 pages
Blackfin® Processor Instruction Set
No ratings yet
Blackfin® Processor Instruction Set
518 pages
Slides CHP 3 and 4
No ratings yet
Slides CHP 3 and 4
21 pages
BCS515B - Module 4
No ratings yet
BCS515B - Module 4
23 pages
Ch3 LexicalAnalysis
No ratings yet
Ch3 LexicalAnalysis
40 pages
CD 3
No ratings yet
CD 3
1 page
Lec 06 Specification of Tokens
No ratings yet
Lec 06 Specification of Tokens
23 pages
CD Unit-2
No ratings yet
CD Unit-2
64 pages
CD 1
No ratings yet
CD 1
92 pages
TOA Concepts
No ratings yet
TOA Concepts
26 pages
Chapter 3
No ratings yet
Chapter 3
30 pages
2 - Features of Python
No ratings yet
2 - Features of Python
77 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
18 pages
Lark Parser Readthedocs Io en Latest
No ratings yet
Lark Parser Readthedocs Io en Latest
90 pages
Lexical Analysis
No ratings yet
Lexical Analysis
62 pages
Python classXI Notes
No ratings yet
Python classXI Notes
14 pages
Acd Unit-2
No ratings yet
Acd Unit-2
16 pages
Python Lex-Yacc: Language Tool For Python CS 550 Programming Languages
No ratings yet
Python Lex-Yacc: Language Tool For Python CS 550 Programming Languages
20 pages
System Programming Notes
100% (1)
System Programming Notes
15 pages
Assignnment 04 Sol Ution
No ratings yet
Assignnment 04 Sol Ution
5 pages
BCS515B - Module 3
No ratings yet
BCS515B - Module 3
18 pages
SE Compiler Chapter 2
No ratings yet
SE Compiler Chapter 2
16 pages
2 - Lexical Analysis
No ratings yet
2 - Lexical Analysis
52 pages
An Experimental Study of Text Preprocessing Techniques For Automatic Short Answer Grading in Indonesian
No ratings yet
An Experimental Study of Text Preprocessing Techniques For Automatic Short Answer Grading in Indonesian
5 pages
Lecture 03
No ratings yet
Lecture 03
42 pages
6 Issch
No ratings yet
6 Issch
3 pages
CD Unit-2
No ratings yet
CD Unit-2
64 pages
Lexical Analysis
No ratings yet
Lexical Analysis
45 pages
Lexical Analysis
No ratings yet
Lexical Analysis
44 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
31 pages
Lecture 2
No ratings yet
Lecture 2
31 pages
Compiler
No ratings yet
Compiler
6 pages
CD Previous QA 2010
No ratings yet
CD Previous QA 2010
64 pages
BCS515B - Module 2
No ratings yet
BCS515B - Module 2
16 pages
Introduction To Languages There Are Two Types of Languages Formal Languages (Syntactic Languages)
No ratings yet
Introduction To Languages There Are Two Types of Languages Formal Languages (Syntactic Languages)
29 pages
Unit 2 Lexical Analysis - Part 1: Harshita Sharma
No ratings yet
Unit 2 Lexical Analysis - Part 1: Harshita Sharma
55 pages
Annexes: Aide en Ligne Sur Random
No ratings yet
Annexes: Aide en Ligne Sur Random
9 pages
Day 3 - Regexps
No ratings yet
Day 3 - Regexps
52 pages
Turing PPT Final
No ratings yet
Turing PPT Final
14 pages
Lexical Analysis
No ratings yet
Lexical Analysis
41 pages
HW 6
No ratings yet
HW 6
28 pages
ch-2 Compiler Design
No ratings yet
ch-2 Compiler Design
9 pages
UNIT 2 Compiler Design
No ratings yet
UNIT 2 Compiler Design
23 pages
Lesson 01
No ratings yet
Lesson 01
39 pages
LWMC1V2 001
100% (2)
LWMC1V2 001
296 pages
Lesson 01
No ratings yet
Lesson 01
31 pages
Compiler Design - Lexical Analysis
No ratings yet
Compiler Design - Lexical Analysis
2 pages
Fall Semester 2023-24 CSE1013 TH AP2023242000613 Reference Material I 02-Aug-2023 Module - I Part - I
No ratings yet
Fall Semester 2023-24 CSE1013 TH AP2023242000613 Reference Material I 02-Aug-2023 Module - I Part - I
38 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
56 pages
Ch2 Lexical Analysis
No ratings yet
Ch2 Lexical Analysis
11 pages
Compiler Design Quantum PDF
100% (1)
Compiler Design Quantum PDF
211 pages
Let's Build A Simple Interpreter. Part 1
No ratings yet
Let's Build A Simple Interpreter. Part 1
12 pages
Languages Strings
No ratings yet
Languages Strings
53 pages
Chapter One - Introduction
No ratings yet
Chapter One - Introduction
30 pages
Lesson 1
No ratings yet
Lesson 1
28 pages
CSC 318 Class Notes
No ratings yet
CSC 318 Class Notes
21 pages
Lexical Analysis
No ratings yet
Lexical Analysis
31 pages
M.Suhaib Khalid PDF
No ratings yet
M.Suhaib Khalid PDF
10 pages
Week 2: (Properties of Formal Language)
No ratings yet
Week 2: (Properties of Formal Language)
45 pages
Compiler Construction: Tahir Iqbal
No ratings yet
Compiler Construction: Tahir Iqbal
28 pages
COS 335-Automata Theory and Formal Languages
No ratings yet
COS 335-Automata Theory and Formal Languages
9 pages
2 Lex
No ratings yet
2 Lex
45 pages
CSC312 Automata Theory Languages: Lecture # 2
No ratings yet
CSC312 Automata Theory Languages: Lecture # 2
50 pages
3a. Context Free Grammar
No ratings yet
3a. Context Free Grammar
18 pages
Syllabus - Compiler Design
No ratings yet
Syllabus - Compiler Design
2 pages
BCS515B - Module 5
No ratings yet
BCS515B - Module 5
15 pages
Theoretical Foundations of CS / Wide Variety of Problems - Frameworks/models To Solve Problems - Analyze Problems & Algorithms
No ratings yet
Theoretical Foundations of CS / Wide Variety of Problems - Frameworks/models To Solve Problems - Analyze Problems & Algorithms
50 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
26 pages
rkCD-Chapter 2 - LEXICAL ANALYSIS
No ratings yet
rkCD-Chapter 2 - LEXICAL ANALYSIS
9 pages
Compilers and Interpreters
No ratings yet
Compilers and Interpreters
10 pages
Compiler Design Chapter 2
No ratings yet
Compiler Design Chapter 2
14 pages
Introduction To Compiler Design (CD) : Mu-Mit
No ratings yet
Introduction To Compiler Design (CD) : Mu-Mit
22 pages
Compiler Design
No ratings yet
Compiler Design
4 pages
A Ad - A - Ab - Abc - B: Generate The SLR Parsing Table For The Following Grammar
0% (1)
A Ad - A - Ab - Abc - B: Generate The SLR Parsing Table For The Following Grammar
7 pages
Lab Manual Java PDF
No ratings yet
Lab Manual Java PDF
27 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Toc Theory

Uploaded by

Toc Theory

Uploaded by

Explain Following symbole,Alphabet , power of an alphabet ,Strings, Language

• Definition: A symbol is a basic unit of a language. It is an individual character or element that

i) Role of Lexical Analyzer

3. Removing Whitespaces and Comments:

o It eliminates unnecessary whitespace, comments, and any irrelevant characters that

5. Symbol Table Management:

ii) Specification of Token and Recognition of Token

▪ Keywords: Reserved words like if, else, while.

▪ Identifiers: Names given to variables, functions, etc.

▪ Literals: Constant values like numbers or strings.

▪ Operators: Symbols representing operations like +, -, *, /.

▪ Punctuation: Symbols used for syntax, such as ;, ,, {, }.

▪ Keywords: int | float | if | else

▪ Integer Literals: [0-9]+

o The lexical analyzer uses finite automata (either deterministic or nondeterministic)

• Notation: s1=s2s_1 = s_2s1=s2 if every character in s1s_1s1 corresponds to the same

• Notation: s1⊆s2s_1 \subseteq s_2s1⊆s2 (or s1 is a substring of s2s_1 \text{ is a substring of

3. Prefix and Suffix Relations

o Notation: s1 is a prefix of s2s_1 \text{ is a prefix of } s_2s1 is a prefix of s2 or

o Example: For s2="hello"s_2 = \text{"hello"}s2="hello", s1="hel"s_1 = \text{"hel"}s1

o Notation: s1 is a suffix of s2s_1 \text{ is a suffix of } s_2s1 is a suffix of s2 or

o Example: For s2="hello"s_2 = \text{"hello"}s2="hello", s1="lo"s_1 = \text{"lo"}s1

• Example: If s1="hello"s_1 = \text{"hello"}s1="hello" and s2=" world"s_2 = \text{" world"}s2

• Example: For s="hello"s = \text{"hello"}s="hello", ∣s∣=5|s| = 5∣s∣=5.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.