UNIT 4 Taxonomies and Ontologies
UNIT 4 Taxonomies and Ontologies
Overview of Taxonomies
What is a Taxonomy?
A taxonomy is a structured classification system used to categorize and organize concepts,
items, or terms into a hierarchical format based on their relationships and shared
characteristics.
It provides a tree-like structure, where:
• Broad categories are at the top (root)
• More specific subcategories or instances are at the lower levels (branches and
leaves)
Definition:
A taxonomy is a controlled vocabulary arranged in a hierarchical structure that helps in
organizing information based on parent-child (is-a) relationships.
Components of a Taxonomy
Component Description
Features of Taxonomies
• Hierarchical: Follows a tree structure (general to specific)
• Simple semantics: Represents “is-a” or “type of” relationships
• Easy to understand and implement
• Useful for filtering, browsing, and indexing
Applications of Taxonomies
• Search Engines – Improve categorization of search results
• Content Management – Organize articles, blogs, and knowledge bases
• E-commerce – Product navigation and filtering
• Healthcare – Classifying diseases and treatments (e.g., ICD codes)
• Semantic Web – Lays foundation for building ontologies
Conclusion
• Taxonomies are essential building blocks in knowledge organization.
• They provide a simple but effective structure to classify and navigate information.
• While limited in semantic depth, they are widely used across industries and form a
foundation for more complex systems like ontologies.
Library
Hierarchical structure with
2 Taxonomy Basic ("is-a") classification,
parent-child relationships.
Product categories
1. Controlled Vocabulary
• Just a list of terms with definitions.
• No hierarchy or semantic relations.
• Ensures consistency in naming.
Example:
Terms: Student, Teacher, Course, Exam
2. Taxonomy
• Adds a hierarchical structure.
• Captures “is-a” relationships.
• Used in information retrieval and navigation.
Example:
Vehicle
├── Car
├── Truck
└── Motorcycle
“Car” is-a Vehicle.
3. Thesaurus
• Builds on taxonomy.
• Includes:
o Synonyms (same meaning)
o Antonyms (opposite meaning)
o Related terms
o Scope notes
Example:
For the term “Car”:
• Synonym: Automobile
• Broader term: Vehicle
• Related term: Engine
Used in search systems to enhance semantic search.
4. Ontology
• Most expressive and machine-understandable.
• Defines:
o Classes (concepts)
o Individuals (instances)
o Properties (relationships)
o Axioms (rules and constraints)
• Supports reasoning, inference, and interoperability.
Conclusion
• The Ontology Spectrum describes the evolution of knowledge structures from basic
term lists to complex ontologies.
• Each level increases in semantic expressiveness and computational capabilities.
• Understanding this spectrum helps in choosing the right tool for knowledge
organization and AI-based decision making.
Component Description
Occurrences Link a topic to its relevant information (e.g., webpages, documents, media)
Define the context in which a topic or association is valid (e.g., language, user
Scopes
group)
Element Example
Limitations
• Not as widely adopted as RDF/OWL in the Semantic Web.
• Complexity in authoring and maintaining large topic maps.
• Requires consistent use of subject identifiers for effective merging.
Conclusion
• Topic Maps are a powerful way to represent and connect knowledge based on
human understanding.
• They provide a semantic network of topics and associations, with links to actual
content.
• Though less popular than RDF/OWL in the Semantic Web, they remain valuable for
information management, navigation, and indexing.
Overview of Ontologies
What is an Ontology?
In computer science and artificial intelligence, an ontology is a formal, explicit specification
of a shared conceptualization of a domain.
In simpler terms, an ontology defines the concepts, relationships, and rules that exist in a
particular area of knowledge and provides a machine-understandable structure.
Component Description
Instances
Actual examples of classes (e.g., "John", "Harry Potter")
(Individuals)
Component Description
Properties
Describe features of classes (e.g., hasName, hasAuthor)
(Attributes)
Ontology Language Used to represent the ontology (e.g., OWL, RDF, RDFS)
Purpose of Ontologies
• To model knowledge in a structured, formal way.
• To allow machines to understand and process relationships and concepts.
• To support reasoning and inference (e.g., deducing new facts).
• To enable semantic interoperability across systems.
Concept Example
Ontology vs Taxonomy
Ontology Languages
Language Description
OWL (Web Ontology Rich language with logic-based features (classes, properties,
Language) restrictions, etc.)
Applications of Ontologies
• Semantic Web – Enabling intelligent search and data linking
• AI & Machine Learning – Representing domain knowledge for reasoning
• Healthcare – E.g., SNOMED CT (medical ontology)
• E-commerce – Structured product information (e.g., Schema.org)
• Enterprise Systems – Knowledge management and interoperability
• Education – Intelligent tutoring systems
Advantages of Ontologies
• Shared understanding of domain knowledge
• Enables semantic search and reasoning
• Promotes data integration and reuse
• Enhances interoperability between systems
• Supports intelligent agents and automation
Challenges
• Creating ontologies can be time-consuming
• Requires domain expertise
• May become complex for large domains
• Consistency and version control can be difficult
Conclusion
• Ontologies are powerful tools for representing, sharing, and reasoning about domain
knowledge in a machine-readable way.
• They go far beyond simple data models by encoding rich semantics, enabling
intelligent systems to make inferences and automate tasks.
• In the Semantic Web, ontologies play a central role by linking data meaningfully
across diverse systems.
Definition
Syntax refers to the rules and formal grammar that define how symbols, words, and
statements are legally combined in a knowledge representation language.
Syntax defines how knowledge looks — how it is written and encoded.
Purpose of Syntax
• Ensures machine-readability of ontologies.
• Defines the correct format for writing statements.
• Enables parsing and validation by software tools.
Examples of Syntax:
2. Structure
Definition
Structure refers to the arrangement of components such as classes, properties, instances,
and how they are interrelated in a hierarchical or graph format.
Structure defines the internal organization of the ontology or knowledge base.
Purpose of Structure
• Helps to organize knowledge logically.
• Defines how concepts relate to each other (e.g., "Student is a Person").
• Provides navigation paths through the knowledge base.
Component Description
Definition
Semantics refers to the meaning of the symbols and structures in the knowledge
representation — what they represent in the real world.
Semantics is about interpreting what the syntax and structure mean.
Purpose of Semantics
• Allows machines to understand and reason with knowledge.
• Supports logical inference (e.g., deducing that if A is a Student, A is also a Person).
• Ensures shared understanding across systems.
Examples of Semantics
• RDF defines that a triple (Subject-Predicate-Object) asserts a fact.
• OWL defines logical semantics: e.g., disjointness, equivalence, domain/range,
restrictions.
Example:
Student ⊆ Person
means: Every Student is a Person.
Semantics enables reasoners to infer facts like:
• If John is a Student → John is also a Person.
4. Pragmatics
Definition
Pragmatics deals with the use and context of knowledge — how, why, and by whom the
information is used.
Pragmatics answers: "What is the goal or intended use?"
Purpose of Pragmatics
• Ensures knowledge is relevant and applicable to the user or system.
• Allows context-awareness (e.g., language, culture, domain).
• Helps define policies, rules, and usage scenarios.
Examples of Pragmatics
Context Use
Example:
The word “Apple” may mean:
• A fruit (in biology context)
• A company (in technology context)
Pragmatics helps resolve this based on the situation.
Conclusion
• These four layers — Syntax, Structure, Semantics, and Pragmatics — form the
foundation of semantic technologies and ontology-based systems.
• They enable machines to store, understand, reason, and act on knowledge
meaningfully and contextually.
• Mastering these aspects is crucial for designing intelligent, interoperable systems in
fields like AI, Semantic Web, and knowledge engineering.
Expressing Ontologies Logically
What Does It Mean?
To express ontologies logically means to define the meaning, constraints, and
relationships within a domain using formal logic — so that machines can understand,
reason, and infer new knowledge automatically.
Concept Description
First-Order Logic Allows quantifiers (e.g., "for all", "there exists") and more
(FOL) expressiveness
3. Disjoint Classes
Male ⊓ Female = ⊥
Meaning: No individual can be both Male and Female
4. Existential Quantification
Professor ⊆ ∃teaches.Course
Meaning: Every professor teaches at least one course
5. Universal Quantification
Parent ⊆ ∀hasChild.Person
Meaning: All children of a parent must be persons
:Student rdfs:subClassOf
owl:subClassOf Student is a subclass of Person
:Person
:Male owl:disjointWith
owl:disjointWith Male and Female are disjoint
:Female
Limitations
• Logic adds complexity to design and understanding
• Trade-off between expressiveness and computational performance
• Some real-world concepts may be hard to express formally
Conclusion
• Expressing ontologies logically transforms static data into actionable, inferable
knowledge
• Using Description Logic and tools like OWL, we can build intelligent systems that
understand, validate, and reason with knowledge
• This logical foundation is essential in AI, Semantic Web, healthcare, law, and other
domains where precision and inference matter
It is the bridge between human intelligence (rich, abstract knowledge) and machine
understanding (structured, logical models).
Definition:
“Knowledge Representation is the study of how to represent knowledge in a symbolic form
that a computer system can utilize to solve problems intelligently.”
— Brachman & Levesque
Feature Meaning
4. Types of Knowledge in KR
Knowledge about
Meta-knowledge "I know that I know this"
knowledge
Common-sense
Everyday world knowledge "Water is wet"
knowledge
1. Semantic Networks
• Graph structure with nodes (concepts) and edges (relations)
• Good for hierarchical and associative relationships
Example:
[Dog] —is a→ [Animal]
[Dog] —has→ [Tail]
2. Frames
• Data structure for representing “stereotyped situations”
• Like objects in OOP: consist of slots and values
Example:
Frame: Car
Slots:
- Wheels: 4
- Fuel: Petrol
4. Logic-Based Representation
• Uses propositional or first-order logic
• Enables powerful, formal reasoning
Example (First-order logic):
∀x (Bird(x) → CanFly(x))
5. Ontologies
• Formal representation of concepts and relationships in a domain
• Machine-readable and logic-based (e.g., OWL, RDF)
Example:
“Professor is a subclass of Person”
“Professor teaches Course”
6. Reasoning in KR Systems
A major advantage of representing knowledge is that machines can reason with it, i.e., draw
new knowledge from existing facts using inference techniques.
Example:
If we know “All humans are mortal” and “Socrates is a human” →
A system can infer “Socrates is mortal”
Field Application
8. Challenges in KR
• Ambiguity and vagueness in natural language
• Scalability – large amounts of knowledge
• Incomplete/inconsistent knowledge
• Complexity of inference
• Context-sensitivity (same word, different meaning in different contexts)
Conclusion
• Knowledge Representation is the foundation of intelligent systems, enabling them
to understand, infer, and act on real-world information.
• It provides the structure and meaning needed to convert raw data into usable
intelligence.
• Using different techniques (semantic networks, rules, ontologies, logic), KR helps
systems reason, learn, and interact intelligently with users.