0% found this document useful (0 votes)

3 views3 pages

Dos Tae1

The document discusses the application of distributed operating systems through a case study on Google Search Engine, highlighting its mission to organize the world's information. It outlines the processes of crawling, indexing, and ranking, emphasizing the importance of the PageRank algorithm in determining the relevance of search results. Google has evolved from a search engine to a major player in cloud computing, facing challenges related to scalability, reliability, and performance.

Uploaded by

nidhi.laddha09

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views3 pages

Dos Tae1

Uploaded by

nidhi.laddha09

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

G H Raisoni College of Engineering, Nagpur

Department of Computer Science & Engineering

Session 2015-16, summer 2016

DOS TAE 1
Application of Distributed Operating System:
Case Study on Google Search Engine

Google is a US-based corporation with its headquarters in Mountain View, California (the
Googleplex), offering Internet search and broader web applications and earning revenue largely
from advertising associated with such services. The name is a play on the word googol, the
number 10100 (or 1 followed by a hundred zeros), emphasizing the sheer scale of information
available in the Internet today.
Google’s mission is to tame this huge body of information: ‘to organize the world’s information
and make it universally accessible and useful’ [www.google.com].Google was born out of a
research project at Stanford University, with the company launched in 1998. Since then, it has
grown to have a dominant share of the Internet search market, largely due to the effectiveness of
the underlying ranking algorithm used in its search engine (discussed further below).
Significantly, Google has diversified, and as well as providing a search engine is now a major
player in cloud computing. From a distributed systems perspective, Google provides a
fascinating case study with extremely demanding requirements, particularly in terms of
scalability, reliability, performance and openness.
Google’s mission:
To organize the world’s information and make it universally accessible and useful’.

The role of the Google search engine is, as for any web search engine, to take a given query and
return an ordered list of the most relevant results that match that query by searching the content
of the Web. The challenges stem from the size of the Web and its rate of change, as well as the
requirement to provide the most relevant results from the perspective of its users.

A brief overview of the operation of Google search below:

Fig: overall system architecture of Google

The underlying search engine consists of a set of services for crawling the Web and indexing and
ranking the discovered pages. Crawling: The task of the crawler is to locate and retrieve the
contents of the Web and pass the contents onto the indexing subsystem. This is performed by a
software service called Googlebot, which recursively reads a given web page, harvesting all the
links from that web page and then scheduling further crawling operations for the harvested links
(a technique known as deep searching that is highly effective in reaching practically all pages in
the Web). It is important for search engines to be able to report accurately on breaking news or
changing share prices. Googlebot therefore took note of the change history of web pages and
revisited frequently changing pages with a period roughly proportional to how often the pages
change. With the introduction of Caffeine in 2010 [googleblog.blogspot.com II], Google has
moved from a batch approach to a more continuous process of crawling intended to offer more
freshness in terms of search results.

Indexing: indexing produces what is known as an inverted index mapping words appearing in
web pages and other textual web resources (including documents in .pdf, .doc and other formats)
onto the positions where they occur in documents, including the precise position in the document
and other relevant information such as the font size and capitalization (which is used to
determine importance, as will be seen below). The index is also sorted to support efficient
queries for words against locations. As well as maintaining an index of words, the Google search
engine also
maintains an index of links, keeping track of which pages link to a given site. This is used by the
PageRank algorithm. This inverted index will allow us to discover
web pages that include the search terms ‘distributed’, ‘systems’ and ‘book’ and, by careful
analysis, we will be able to discover pages that include all of these terms. For example, the
search engine will be able to identify that the three terms can all be founding amazon.com,
www.cdk5.net and indeed many other web sites. Using the index, it is therefore possible to
narrow down the set of candidate web pages from billions to perhaps tens of thousands,
depending on the level of discrimination in the keywords chosen.

Ranking: The problem with indexing on its own is that it provides no information about the
relative importance of the web pages containing a particular set of keywords. All modern search
engines therefore place significant emphasis on a system of ranking whereby a higher rank is an
indication of the importance of a page and it is used to ensure that important pages are returned
nearer to the top of the list of results than lower-ranked pages. As mentioned above, much of the
success of Google can be traced back to the effectiveness of its ranking algorithm, PageRank
[Longville and Meyer 2006]. PageRank is inspired by the system of ranking academic papers
based on citation analysis. In the academic world, a paper is viewed as important if it has a lot of
citations by other academics in the field. Similarly, in PageRank, a page will be viewed as
important if it is linked to by a large number of other pages (using the link data mentioned
above). PageRank also goes beyond simple ‘citation’ analysis by looking at the importance of
the sites that contain links to a given page. Ranking in Google also takes a number of other
factors into account, including the proximity of keywords on a page and whether they are in a
large font or are capitalized (based on the information stored in the inverted index).

Submitted By:
Nidhi Laddha(8)

Case Study-Google
100% (1)
Case Study-Google
16 pages
The Ontologrcal Expressiveness Information Systems Analysis Design Grammars
No ratings yet
The Ontologrcal Expressiveness Information Systems Analysis Design Grammars
21 pages
Web Search Engine
No ratings yet
Web Search Engine
26 pages
First Module Lecture Notes
No ratings yet
First Module Lecture Notes
23 pages
JD700B User Guide R06.0
No ratings yet
JD700B User Guide R06.0
690 pages
Google Case Study
100% (1)
Google Case Study
10 pages
Accessioning Best Practices v.1.0.2 2025
No ratings yet
Accessioning Best Practices v.1.0.2 2025
103 pages
Google Search Secrets
100% (1)
Google Search Secrets
225 pages
Google Search
No ratings yet
Google Search
4 pages
User Manual
No ratings yet
User Manual
128 pages
Arranz - 2022 - Fluid-Structure Interaction of Multi-Body Systems Methodology and Applications
No ratings yet
Arranz - 2022 - Fluid-Structure Interaction of Multi-Body Systems Methodology and Applications
20 pages
Unit-Iv CC
No ratings yet
Unit-Iv CC
43 pages
Use Case Specification - Place Rush Order
No ratings yet
Use Case Specification - Place Rush Order
2 pages
ImageFlow 1
No ratings yet
ImageFlow 1
9 pages
Man0029199 Accuseqv3.2 Ug
No ratings yet
Man0029199 Accuseqv3.2 Ug
234 pages
Lenovo IdeaPad Flex 5 14 2-In-1 Touchscreen Lapt
No ratings yet
Lenovo IdeaPad Flex 5 14 2-In-1 Touchscreen Lapt
1 page
Google Searching Tecniques Lecture w3
No ratings yet
Google Searching Tecniques Lecture w3
16 pages
Patil Aditya Narendra Archana
No ratings yet
Patil Aditya Narendra Archana
72 pages
Assignment 1 of IR
No ratings yet
Assignment 1 of IR
8 pages
Codigos de FalhaCP 224 e 274
No ratings yet
Codigos de FalhaCP 224 e 274
6 pages
AirCheck Detail Report - PK8AP02
No ratings yet
AirCheck Detail Report - PK8AP02
100 pages
Proper Waste Management
No ratings yet
Proper Waste Management
20 pages
How Google Works
No ratings yet
How Google Works
27 pages
LLLLLLLLLLLLLLLLL
No ratings yet
LLLLLLLLLLLLLLLLL
30 pages
Data 98
No ratings yet
Data 98
4 pages
National Cybersecurity Policy 2023 - 2028 Is Published - Carey Abogados
No ratings yet
National Cybersecurity Policy 2023 - 2028 Is Published - Carey Abogados
4 pages
Installing PINE A64 7" LCD Touch Screen Panel: Description
No ratings yet
Installing PINE A64 7" LCD Touch Screen Panel: Description
9 pages
IRWM: Assignment 1: How Does Google Search Engine Works?
No ratings yet
IRWM: Assignment 1: How Does Google Search Engine Works?
7 pages
Search Engine Optimization Erin McIntyre - pdf-1539262495
100% (1)
Search Engine Optimization Erin McIntyre - pdf-1539262495
10 pages
Relevant Market
No ratings yet
Relevant Market
16 pages
Google PageRank Algorithm
No ratings yet
Google PageRank Algorithm
10 pages
Bauer New Filling Valves
No ratings yet
Bauer New Filling Valves
4 pages
Next Generation Web Search: Setting Our Sites
No ratings yet
Next Generation Web Search: Setting Our Sites
11 pages
705 I300885e65e6rst
No ratings yet
705 I300885e65e6rst
4 pages
Case Study Google
100% (1)
Case Study Google
6 pages
Page Rank of Google Search: The Algorithm That Organizes The Web
No ratings yet
Page Rank of Google Search: The Algorithm That Organizes The Web
8 pages
Is Google Enough?
No ratings yet
Is Google Enough?
11 pages
Prestigio Multipad pmp3270b Service Manual
No ratings yet
Prestigio Multipad pmp3270b Service Manual
32 pages
How Google Search Work
No ratings yet
How Google Search Work
3 pages
Data Entry
No ratings yet
Data Entry
2 pages
Google 2015-1
No ratings yet
Google 2015-1
23 pages
The Anatomy of A Large-Scale Hypertextual
No ratings yet
The Anatomy of A Large-Scale Hypertextual
41 pages
Google With Global Search by Topic: I) Summary
No ratings yet
Google With Global Search by Topic: I) Summary
12 pages
Petteri Huuhka Google Paper
No ratings yet
Petteri Huuhka Google Paper
13 pages
Lab Manual: Web Technology
No ratings yet
Lab Manual: Web Technology
39 pages
M.zhairf (Case Study On Google)
No ratings yet
M.zhairf (Case Study On Google)
20 pages
Google Search, Commonly Referred To As Google Web Search or Just Google, Is A
No ratings yet
Google Search, Commonly Referred To As Google Web Search or Just Google, Is A
8 pages
Google Search, Commonly Referred To As Google Web Search or Just Google, Is A
No ratings yet
Google Search, Commonly Referred To As Google Web Search or Just Google, Is A
8 pages
Internet Searching: Crawling Is Conceptually Quite Simple: Starting at Some Well-Known Sites On The Web
No ratings yet
Internet Searching: Crawling Is Conceptually Quite Simple: Starting at Some Well-Known Sites On The Web
4 pages
Google Search, Commonly Referred To As Google Web Search or Just Google, Is A
No ratings yet
Google Search, Commonly Referred To As Google Web Search or Just Google, Is A
7 pages
Google Case Study Final Exam
100% (5)
Google Case Study Final Exam
23 pages
Google Search
No ratings yet
Google Search
5 pages
41 Assigment 4 Chapter 6-9
No ratings yet
41 Assigment 4 Chapter 6-9
1 page
Google Search, Commonly Referred To As Google Web Search or Just Google, Is A
No ratings yet
Google Search, Commonly Referred To As Google Web Search or Just Google, Is A
4 pages
Seminar Formatkhjj
No ratings yet
Seminar Formatkhjj
24 pages
Google Search, Commonly Referred To As Google Web Search or Just Google, Is A
No ratings yet
Google Search, Commonly Referred To As Google Web Search or Just Google, Is A
3 pages
Ihp w22 Model Answer Paper 22655
No ratings yet
Ihp w22 Model Answer Paper 22655
14 pages
AL Tamil Medium Answer
No ratings yet
AL Tamil Medium Answer
93 pages
Ieee Format
No ratings yet
Ieee Format
13 pages
Escholarship UC Item 0nj460xn
No ratings yet
Escholarship UC Item 0nj460xn
11 pages
Article About Google Inc.
No ratings yet
Article About Google Inc.
2 pages
Drawing 19851
No ratings yet
Drawing 19851
1 page
Bib Sepport System
No ratings yet
Bib Sepport System
17 pages
Case Analysis Assignment Google Case
100% (1)
Case Analysis Assignment Google Case
8 pages
Engineering Management Assignment
No ratings yet
Engineering Management Assignment
12 pages
Binjal (The Success Story of Google.)
No ratings yet
Binjal (The Success Story of Google.)
73 pages
How Google Indexing Works
No ratings yet
How Google Indexing Works
3 pages
Bunisess Commmunication Project ON: Presented To
No ratings yet
Bunisess Commmunication Project ON: Presented To
19 pages
MOB Term Paper 1
No ratings yet
MOB Term Paper 1
24 pages
Google
No ratings yet
Google
28 pages
Ramaiah Institute of Management Studies 1
No ratings yet
Ramaiah Institute of Management Studies 1
42 pages
Case: Google: MBA Japan - INSY 690 (Case Analysis Assignment) Student: Lance Shields
No ratings yet
Case: Google: MBA Japan - INSY 690 (Case Analysis Assignment) Student: Lance Shields
8 pages
Guided Google: A Meta Search Engine and Its Implementation Using The Google Distributed Web Services
No ratings yet
Guided Google: A Meta Search Engine and Its Implementation Using The Google Distributed Web Services
10 pages
TIME DIFFERENCE IN CIMPLICITY OVER VIEW AND ALARM VIEW - Automation & Control Engineering Forum
No ratings yet
TIME DIFFERENCE IN CIMPLICITY OVER VIEW AND ALARM VIEW - Automation & Control Engineering Forum
1 page
How Google Works: Case History
No ratings yet
How Google Works: Case History
6 pages
Cisco ATA Guide - Support Centre For Kiwi VoIP
No ratings yet
Cisco ATA Guide - Support Centre For Kiwi VoIP
10 pages
Chapter I - Introduction: The Future of Management London Business School Professor, Gary Hamel
No ratings yet
Chapter I - Introduction: The Future of Management London Business School Professor, Gary Hamel
30 pages
Technical Manual Qa-S (10-25) PDF
No ratings yet
Technical Manual Qa-S (10-25) PDF
102 pages
Yamada Diaphragm Pump 80 Series Manual
No ratings yet
Yamada Diaphragm Pump 80 Series Manual
18 pages
Refine Your Search With Google ..
No ratings yet
Refine Your Search With Google ..
71 pages
Mechanical Engineering - Lab Manual For Measurement and Instrumentation
No ratings yet
Mechanical Engineering - Lab Manual For Measurement and Instrumentation
18 pages
Reda Hps PDF
100% (1)
Reda Hps PDF
1 page
Google
No ratings yet
Google
17 pages
Case Analysis - Google - Document Transcript
No ratings yet
Case Analysis - Google - Document Transcript
4 pages
Muhammad Naseem Electrical Supervisor CV
No ratings yet
Muhammad Naseem Electrical Supervisor CV
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Dos Tae1

Uploaded by

Dos Tae1

Uploaded by

G H Raisoni College of Engineering, Nagpur

Department of Computer Science & Engineering

A brief overview of the operation of Google search below:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.