0% found this document useful (0 votes)
108 views31 pages

Jaff Seminar

This document discusses search engines. It defines a search engine as a web-based tool that allows users to locate information on the World Wide Web. It describes the key components of search engines including databases of web page references, indexing robots that crawl the web, interfaces for users to submit queries and display results, and information retrieval systems. The document also provides examples of some popular search engines like Google, discusses the architecture and functioning of search engines, and covers some challenges in building search engines and using them effectively.

Uploaded by

Jaffar Rockstar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
108 views31 pages

Jaff Seminar

This document discusses search engines. It defines a search engine as a web-based tool that allows users to locate information on the World Wide Web. It describes the key components of search engines including databases of web page references, indexing robots that crawl the web, interfaces for users to submit queries and display results, and information retrieval systems. The document also provides examples of some popular search engines like Google, discusses the architecture and functioning of search engines, and covers some challenges in building search engines and using them effectively.

Uploaded by

Jaffar Rockstar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 31

HKBK COLLEGE OF ENGINEERING

ISE DEPT.

A Seminar On

Under the guidance of By


Dr. SYED MUSTAFA A JAFFAR SADIQ 1HK16IS034
WHAT IS SEARCH ENGINE?
A search engine is a web-based tool that enables users to locate
information on the World Wide Web.
Search engines utilize automated software applications (referred to as
robots, bots, or spiders) that travel along the Web, following links from
page to page, site to site.
The information gathered by the spiders is used to create a searchable
index of the Web.
The Father of the search engine is Jonathon Fletcher.
EXAMPLES OF SEARCH ENGINES:
History of Search Engines
The first tools used for searching in the internet was Archie in
1990 by Alan Emtage.
In 1991, two new search programs Verconica and Jughead.
In september 1993, W3Catalog was the first web primitive
search engines.
COMPONENTS OF SEARCH ENGINES:
÷Four Components:
¯•A database of references to web pages
¯•An indexing robot that crawls the WWW

¯• An interface

° Enables users to submit queries


° Displays results
¯•Information retrieval system
÷ Each is unique, but are mostly the same
1.A Database
● Search-engine databases are optimized for
dealing with data that may be long,
semistructured, or unstructured, and they
typically offer specialized methods such as full-
text search, complex search expressions, and
ranking of search results.
2. Web Crawler
• A Web crawler, sometimes called a spider or spiderbot and often
shortened to crawler, is an Internet bot that systematically browses
the World Wide Web, typically for the purpose of Web indexing
• A web crawler, or spider, is a type of bot that's typically operated by
search engines like Google and Bing. Their purpose is to index the
content of websites all across the Internet so that those websites can
appear in search engine results.
3. Search Engine Interfaces
֥ Gathers input from users
֥ Presents results from the IR system

¯• Often in ranked order


° Input
¯ User requirements
° Search expression, search limits
° Presentation style
° Presentation format , search type
° Output
¯ Results
¯ Descriptions
¯ Clusters
4. Information Retrieval System
• Information retrieval (IR) is the process of obtaining information system
resources that are relevant to an information need from a collection of those
resources.

• An IR system is a software system that provides access to books, journals


and other documents; stores and manages those documents.

• For an internet search engine, data retrieval is a combination of the user-


agent (crawler), the database, and how it's maintained, and the search
algorithm. The users then views and interacts with the query interface
Importance of search engines
• Search engines are important because with over 8
billion web pages available, it would be impossible
to search for the information that is specifically
needed.
• This is why search engines are used to filter the
information that is on the internet and transform it
into results that each individual can easily access
and use within the matter of seconds.
Types of Search Engines:
es:
• There are three main types of search engine tools:
1. Crawler based search engines
2. Search directories or indexes
3. Hybrid search engines
4. Meta search engines
1. CRAWLER BASED SEARCH ENGINES :-
•These types of search engines use a "spider" or a "crawler"
to search the Internet.
•Thecrawler digs through individual web pages, pulls out
keywords and then adds the pages to the search engine's
database.
•Google and Yahoo are examples of crawler search engines.
•Crawler-based search engines are good when you have a
specific search topic.
2. SEARCH DIRECTORIES OR INDEXES:-
• Directories depend on human editors to create their
listings or the database.
• Yahoo Directory, Open Directory and Look Smart are
few examples.
• Human-powered directories are good when you are
interested in a general topic of search
3. HYBRID SEARCH ENGINES : -
Hybrid search engines are search engines that use both
crawler based searches and directory searches to obtain their
results .
Example:- Yahoo.com- Google.com
4. META SEARCH ENGINES:-
• A meta search engine is a tool that helps to locate
Information available via the WWW.
• It provide a single interface that enables users to search many
different search engines, indexes and databases.
• Thus Meta search engines are capable of searching several
search engine databases at once.
Eg…. • Dogpile - http://www.dogpile.com
• SavvySearch - http://www.search.com
• MetaCrawler -http://www.metacrawler.com
Search Engine Architecture
The search engine architecture comprises of the three
basic layers listed below : -
•Content collection and refinement.
•Search core
•User and application interfaces
Search Engine Architecture
HOW TO USE SEARCH ENGINES:
Examples
• +BLACK+BLUE: The search results will contain documents
which contain the word black and the word
blue.
• BLACK-BLUE: Those documents will be returned which
contain the word black but not the word blue.
• “BLACK BLUE”: Those documents will be returned which
include the phrase black blue. (placed together).
• BLACK OR BLUE: Those documents will be returned which
contain the term black or the term blue.
Difficulties of Building a Search
Engine
• Build by Companies and hide the technical detail
• Distributed data
• High percentage of volatile data
• Large volume
• Unstructured and redundant data
• Quality of data
• Heterogeneous data
• Dynamic data
• How to specify a query from the user
• How to interpret the answer provided by the system
User Problems
• Do not exactly understand how to provide a sequence of
words for the search
• Not aware of the input requirement of the search engine.
• Problems understanding Boolean logic, so the users cannot
use advanced search
• Novice users do not know how to start using a search engine
• Do not care about advertisements
• Around 85% of users only look at the first page of the result,
so relevant answers might be skipped
Searching Guidelines
• Specify the words clearly (+, -)
• Use Advanced Search when necessary
• Provide as many particular terms as possible. If looking for a company,
institution, or organization, try: www.name [.com | .edu | .org | .gov |
country code]
• Some searching engine specialize in some areas
• If the user use broad queries, try to use Web directories as starting points
• The user should notice that anyone can publish data on the Web, so
information that they get from search engines might not be accurate.
The Largest Search Engines
GOOGLE ARCHITECTURE
User Interfaces Of a Search Engine
Query Interface
•A box is entered a sequence of words (AltaVista uses union, HotBot
uses intersection)
•Complex query interfaces (e.g. Boolean logic, phrase search, title
search, URL search, date range search, data type search)

Answer Interface
•Relevant pages appear on the top of the list
•Each entry in the list includes a title of the page, an URL, a brief
summary, a size , a date and a written language
Advantages of using Search Engine:
• Easy to acquire different information from different topics
• determine specifically information
• Sorting and adding value to the information

Disadvantages of using Search Engine:


• Too many websites visitors
• Too many websites leads/sales
Conclusion:
• The usefulness of a search engine depends on the relevance of
the results it gives back.
• While there may be millions of Web pages that include a
particular word or phrase, some pages may be more relevant,
popular, or authoritative than others.
REFERENCES:
● https://libguides.astate.edu/c.php?g=14516&p=78177

● https://www.kook.com.au/n/components-of-search-engine-optimisation/

● http://sifaka.cs.uiuc.edu/~wang296/Course/IR_Fall/docs/PDFs/Search
%20Engine%20Architecture.pdf

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy