Guesstimates
Guesstimates
They are estimates based on a mixture of guesswork and calculation. The process of problem-solving
is fairly simple. Look at the parameters that might affect the given problem and arrive at the estimated
quantity. An example could be finding the number of red lights in Delhi.
General Framework
1. Understand the question (Clarify) – Try and extract the exact information from the interviewer on
what he wants you to calculate. In the previous example, for all you know, the interviewer might be
interested in finding out the total number of red light bulbs in Delhi. Hence, it is imperative to be on the
same page as the interviewer. Always ask first before you attempt any guesstimate.
2. Devise a Logical Approach (Structure) – There is no foolproof way to approach a guesstimate. You
can solve a guesstimate using several approaches – top-down, bottom-up, process mapping, layout
centric, or critical comparison. The trick here is to go with the approach that helps you minimize your
assumptions, a simple rule of thumb while you approach guesstimates.
3. Decide which Approach to take (Analyze) – Once you have devised the approaches, it is time to
decide which approach you want to use. Ideally, think of 2-3 steps ahead of you in the approach you
decide to use and see how it works.
4. Put the numbers and the assumptions (Conclude) – This is the final and the trickiest step. Once you
get this right, you have cracked the guesstimate. However, once you practice enough, you develop a
knack for solving it.
5. Do a sanity check – itis a method to check if the answer from the guesstimate is in the bounds of
reasonableness
Strategies to solve Guesstimate:
1. Pareto Principle (80:20 Rule): The idea behind this strategy is to split what we are calculating in
terms of majority and minority. The basic idea is to calculate the major portion first, sideline the minority,
and compute it later.
2. Process Mapping: It is about deciding whether to use the push or pull approach. The same problem
can be approached from the consumption and production side.
For Example, while finding the amount of chocolate used in India in a day.
We can have 2 approaches for the same.
● Consumption side: Estimating the number of end consumers (to avoid double-counting) and
the units consumed by each consumer.
● Production side: Estimating the amount of cocoa produced, chocolate factories in a city and
extrapolating it by appropriating the share based on the population of the country.
3. Data-centric Approach: Appropriating the right numbers in the structure when it is ready. To find
numbers of an unknown category; you could assume a safe figure by considering a similar category.
4. Layout and then fill the numbers: Prepare a comprehensive exhaustive layout for the guesstimate
and then start filling in the numbers.
5. MECE (Mutually Exclusive and Collectively Exhaustive): Ensure that the buckets do not overlap with
each other and are comprehensive when taken together.
1. Household Approach: The category let's say, Cars are bought as a household purchase and hence
we calculate the number of cars as per the number of households.
2. Population Approach: Example: A category like a pen is bought for individual consumption and is
based on the number of people. Hence, we proceed with the guesstimate about the number of people.
3. Structural Approach: Example: To find the number of aeroplanes landing in India in a single day,
the bottleneck would be the runway as it controls the entire operation.
This approach includes solving the guesstimate with a structure or logical breakdown, which is suitable
for the question. Structure could be top-down, bottom-up, process mapping, layout centric, or critical
comparison. The below mentioned figure describes a top-down structure approach, where you move
from left to right, to estimate the market size.
1. Customer Segment:
- Rural-Urban (Geography-wise)
- Gender Split
- Age Split
- Income Split
- Willingness
For Example: How much sunflower oil is used in India in one year?
Here, a good idea would be to use a Geographic wise filter, as the consumption of sunflower oil is
largely dependent on the region. Typically, we can break the region into 4 categories:
East
West
North
South
Each of these regions can be further broken down into Tier I, II, III & IV regions. Now, we can use this
filter and proceed further by making logical assumptions backed by past experience.
Note that whatever, you have calculated for one region can be used for other regions by using a suitable
multiplying factor.
2. Market Sizing:
- In terms of Volume, or
- In terms of Revenue
Whenever you are tasked with market sizing problems, ensure that you clear with the interviewer what
exactly they are looking at, whether market size in terms of volumes or revenues. For example, the
toothbrush market in India could be X units and Y USD. It is also helpful if you can clear the scope of
calculation (Weekly, Monthly, Quarterly, Yearly, etc.)
3. Type of Product/Service
Usage of household products like TV, refrigerator, cars is completely different from individual products
like toothbrushes, soap, cigarettes, etc. Whenever you are presented with a guesstimate, ensure that
the first step after identifying the target customer is understanding the usage of the product. With
regards to services ensure that you consider different scenarios, for example, to find the number of
movie tickets sold in India every day, you need to consider the occupancy rate, which will be higher on
weekends and for night shows.
5. Types of product:
- Household products
- Individual products
Usage of household products like TV, refrigerator, cars is completely different from individual products
like toothbrushes, soap, cigarettes, etc. Whenever you are presented with a guesstimate, ensure that
the first step after identifying the target customer is understanding the usage of the product. With
regards to services, ensure that you consider different scenarios, for example, to find the number of
movie tickets sold in India every day, you need to consider the occupancy rate, which will be higher on
weekends and for night shows.
6. Services:
- Peak and non-peak hours
- Weekends and Weekdays
Considering these factors provides additional depth to the calculations, which many of us tend to
neglect. These factors are especially important for services (restaurants, movie theatres, malls) where
demand is a function of day and time. To make the calculation more straightforward, you can consider
two factors- peak and non-peak demand.
Final Tips
1. Structured Thinking: The importance of this point can’t be stressed enough – as this lays the
edifice for solving a guesstimate. This is a trait, which is appreciated by the interviewer.
2. Sharp Communication: Remember the 3V’s – Voice, Veracity, and Vocabulary. These traits come
into play while you present your solution to the interviewer.
3. Smart Creativity: Build your own stories for making small assumptions, but be logical.
4. Know some basic facts/ data
5. Assumptions always take buy-in from the interviewer for your assumptions and clearly explain
the rationale for the same.
The above is meant as a simple guideline, good enough for case interviews and not an accurate
representation. As income increases essential spending on Food decreases, while discretionary
spending increases.
You do not need to remember the exact numbers, just remember expenditure on food is on average
around 40%.
Solved Examples
1. Schools in Delhi
Preliminary Questions:
Should I consider both private and public schools?
- Yes
Can we eliminate dummy schools that are used by private coaching centers?
- Yes
Assumptions
- School going age would be 6 to 19 years
- Population of Delhi ~ 2 Cr
- There are 10% large sized schools (Private) 40% medium sized schools (Govt + Private ), and 50%
small sized schools (Govt + MCD)
Overall Strategy
(1) We can estimate the number of students in a school: Number of Grades x Avg Number of sections
x Avg Students per class
(2) Calculate the total number of students in Delhi by age segregation
Therefore - # schools in Delhi = Total number of students in Delhi / Average number of students in a
school in Delhi = 52,00,000 / 1,150 ~ 4500 Schools
Home Task -
Preliminary questions
Can we assume this to be an average McDonald’s as opposed to one in a specific location like an
Airport?
- Yes.
Do we assume that the outlet serves other items like wraps/puffs etc.? We will discount the people
having wraps/puffs instead of burgers.
- Assume that burgers are the only item on the menu apart from fries and coke.
It is very useful to scope out the problem correctly. This has 2 advantages.
(1) You need to identify the constraints you are working under so as to analyze only what is
required by the client and move through the case interview faster.
(2) The interviewer notes that you have thought about various aspects of a case. Although
he/she might not ask you to delve into the nitty-gritties, he/she is assured that if you’re hired
and assigned to a case, you will cover issues exhaustively.
Overall strategy
(1) Restaurant
Strategy
I would like to approach this problem from a supply point of view. I want to add that supply in this context
means not the burgers that can be manufactured but the maximum customers that can be seated in
McDonalds on a given day. I will further try to understand their consumption patterns to arrive at total
burgers sold.
I’m assuming that an average McDonalds has about 50 seats and is open from 10 am to 10pm. The
consumption patterns of burgers are different throughout the day
The total number of burgers = (# hours) x (# people per hour) x (# burgers per person)
Let’s say every person eats at McDonalds for 20 minutes. There are 3 (60min/20min) people occupying
a seat every hour. Also, in my experience since burgers in India are slightly smaller, I will assume 20%
of the people eat 2 burgers and 80% of eat 1 burger. That is an average of
Since the occupancy varies according to time of day, I would like to do the math separately for each
hour. Do you think that works?
- Sure. That seems reasonable.
For simplicity’s sake, let’s take 3 scenarios: 100% occupancy (high traffic), 50% occupancy (medium
traffic), and 25% occupancy (low traffic).
The kind of insight demonstrated by the candidate above is what you want to aim for in
guesstimates. Not only should it be important but something that can easily be incorporated in
your solution.
Total number of burgers sold in a day in a restaurant is 1600 (or ~1500 burgers).
(2) Takeaways
The takeaway counter has a queue during high traffic hours and it will be useful to bring in my own
experience at these counters to estimate the time that each exchange takes.
In my view,
Time taken for every person = 75 s = 1.25 min
I have assumed that the takeaway orders are slightly more than the restaurant orders.
Assuming that the traffic of people is the similar for takeaways as well,
High traffic: 5 hours * 100% traffic * 70 burgers = 350 burgers
Medium traffic: 5 hours * 50% traffic * 70 burgers = 175 burgers
Low traffic: 2 hours * 25% traffic * 70 burgers = 35 burgers
Total number of burgers sold in a day in a restaurant is 560 (or ~600 burgers).
Hence, the total number of burgers sold every day at an average McDonald’s is about 2200 burgers
- Impressive. Is there anything else you would like to add?
I think the bottleneck during high traffic hours is the supply i.e. the manufacturing rate of the burgers
themselves. The kitchen would be running at nearly full capacity during these hours.
Supply
Keeping in mind that the McDonald’s model is a made-to-assemble* one,
Time required to make a burger = (1) Sourcing components + (2) Heating + (3) Assembly + (4) Delivery
*A made-to-assemble model is one where the individual components are ready or ‘made’ and
require only assembling to make the finished product
(*Assuming 2 crate of patties (each having 30) take about 5 minutes to cook.)
Assuming 3 people working in the kitchen during high traffic hours, the outlet produces ~6 burgers per
minute or which is in the same ballpark as the demand. I concede that due to incomplete knowledge
about the industry, I may have made some errors in assumption.
- That is just fine. I like your approach.
Preliminary Questions
There are two types of air-conditioners- window and split. Shall I include both?
- Yes.
In what unit would you want me to estimate the market size- #ACs, INR, etc.?
- Good question. Estimate the market size in terms of tonnage. A 1 ton AC can typically cool a small
room.
Air Conditioners are used for residential cooling and commercial cooling in Mumbai. Residential cooling
involves cooling in home and apartments. Commercial cooling would involve Office Spaces, Malls,
Restaurants, Hospitals, Schools, etc. Shall I calculate the market size including all these?
- For simplicity, just calculate Residential and Office Space cooling. We can neglect the rest.
Overall Strategy
We need to calculate the market size for air-conditioners in tons. The market size would involve the
#ACs required to replace old ones along with the additional demand for new ACs.
Market size in tons = Demand to replace old ACs + Demand due to growth in market size
= (#ACs in tons) / (Avg. life of an AC) + (#ACs in tons) x (Growth rate)
We can take the average life of an AC to be 10 years. Can we take the growth rate in absence of any
other information as the growth in GDP of India which is about 5%?
- Take the growth rate to be 10%.
Alright, what remains to be calculated is the (#ACs in tons) presently in Mumbai. We are looking at
demand in
● Residential Segment
● Office Space Segment
Residential segment
We can take the average family size to be 4 in Mumbai. Though there are families of bigger and smaller
sizes, we will use this simplification. Hence for a population of 20MN we get 5MN families.
The next aspect we need to look at is the affordability of an Air Conditioner. An AC costs roughly INR
25,000 to 40,000 depending on the tons.
A Middle Class (MC) family has an income in the range of 16,000 to 32,000 pm. The average household
income would around 20,000pm. Note that the average is skewed towards the lower limit. This
would probably be the salary of junior most clerks in offices. Given the AC cost is more than a month’s
salary, I do not think that the Middle Class would be able to afford ACs. Obviously the Below Poverty
Line (BPL) and Lower Middle Class (LMC) families would not be able to afford it as well.
Now we need to see the Upper Middle Class (UMC) families. Their income is greater than 32,000 pm.
The average income of this segment would be roughly 50,000 pm or about 6 lakhs per annum. The
average UMC family would live in a 2 BHK home. I think such a family would be able to afford at least
a 1 ton AC.
In this segment there would be families which can even afford up to 3ACs, two 1 ton ACs for each
bedroom and a 2 ton AC for the living room. This is the case for my family, and our average household
income would be around 100,000pm. Based on this we can split the UMC into two segments.
● Income between 32,000pm-100,000pm. Let’s assume this will be 80% of the UMC segment. We will
assume on average a family in this segment owns a 1 ton AC.
● Income greater than 100,000pm. Based on our assumption this will be 20% of the UMC segment.
We will assume the average family in this segment owns 4 tons of ACs
Hence we get,
#Tons in residential segment= 5MN x (10%) x (80%) x (1 ton) + 5MN x (10%) x (20%) x (4 tons)
=400,000 + 400,000 =800,000 tons.
The people working in air-conditioned offices typically would have white collar jobs. Though there will
be cases where white collar professionals do not work under ACs and blue-collar workers have air
conditioning, we will neglect these exceptions. (White collar employees are those which typically work
in offices, they are accountants, bankers, bureaucrats, etc. Blue collar employees would be mechanics,
electricians, manual laborers, etc.)
The people with white collar jobs would be part of the Middle Class and Upper Middle Class in the age
group of 22-60. Assuming an equal spread of the population from 1-80 yrs., roughly 50% of the
population will lie in the 22-60 bracket.
We can assume that all men and half the women would be working.
This gives us a total of 2MN men + 1MN women = 3MN people.
In a typical office, people work in cubicles. Apart from the area of a cubicle, an air-conditioned office
has several common areas such as receptions, washrooms, lunch areas. If we were to divide the total
area of an office with the total number of employees, we can say that an office area roughly the same
as a ‘bedroom’ would be equivalent to two employees. A ‘bedroom’ typically has an area of 200 sq. ft.
Hence each employee is equivalent to 100 sq. ft. area in an office. Also we will require a 1 ton AC to
cool a room of 200 sq. ft.
Hence,
#ACs in tons= (#People working in AC offices) x(Office area per person)x(Tons required per unit area)
= (3MN) x (100 sq. ft.) x (1 ton / 200 sq. ft.) = 1.5MN tons
Combining the office segment with residential segment, we get the total AC tons to be 2.3MN.
Market Size in tons = (#ACs in tons) / (Avg. life of an AC) + (#ACs in tons) x (Growth rate)
= (2.3MN tons) / (10 yrs.) + (2.3MN tons) x (10% per year)
= 460,000 tons / year
- Great. We can close the case here.
Preliminary questions:
Should we consider domestic refrigerators only or should we include commercial refrigerators too?
- For this question, include only domestic refrigerators.
Market Penetration
Poor% 0 0 0
As the question asks the number of refrigerators sold every year, we have to take care of following
points too:
i) People buying refrigerators due to replacement (as their old machine is not working)
ii) People buying refrigerators as their first refrigerator
Refrigerators sold in a year = refrigerators due to replacement + New Demand due to growth
= (Total refrigerators ( i.e. market size) / Avg Life) + (Growth rate * Total machines (i.e. market size))
= (5.17/10)+(5.17*0.05) crore = 0.775 crores = 77.5 lakhs
Preliminary Questions
Overall Strategy
I would like to first estimate how many flight routes Air India operates in. Then we can see what would
be frequency of each route and accordingly find the no. of aircrafts required to service the route. Finally
we can sum up the required aircrafts for each route and get the result.
- Sounds good, go ahead.
We can assume that every metro has 5 tier-two orbital cities, for example there is no direct flight from
Mumbai-Chandigarh. It is routed via Delhi. Hence Chandigarh is an orbital city of Delhi. We will assume
there are direct flights only between a metro and its orbital tier-two cities, not between a metro and
different metro’s orbital tier-two city.
3. Tier-two to Tier-two (direct flights are rare, since they usually routed via Metros and hence
accommodated in previous two types of routes)
There may be some flights which have multiple stopovers. We can consider them to be equivalent to
two flights (if there is one stop over) under the types of routes mentioned above and continue with the
analysis.
If we take a high intensity metro route like Mumbai to Delhi, based on my experience, there is a flight
of Air India departing every hour.
If I assume that a flight leaves from Mumbai at 6am. It reaches Delhi at 8am. Then there is a 1 hour
turnaround time which includes maintenance of the aircraft. Then the flight leaves from Delhi at 9am.
Similarly after reaching Mumbai, the flight is ready to depart at 12pm (Noon).
At 6 am simultaneously a flight departs from Delhi, reaches Mumbai at 8am and after a 1 hr. turnaround
time departs from Mumbai at 9am.
This would mean that we need 3 flights departing from Mumbai at 6am, 7am & 8am and 3 flights
departing from Delhi at 6am, 7am & 8am to achieve a 1 hour flight frequency from 6am to 12pm
(Midnight). Therefore a route like Mumbai-Delhi requires 6 flights.
However all Metro to Metro routes are not this busy. Mumbai-Delhi is the busiest route. If we take
Kolkata to Hyderabad, it is a less busy route and the flight frequency would be more like 1 flight per 3
hours by Air India. This would mean a requirement of only 2 flights to cover the route.
On an average we can assume a requirement of 4 flights per metro route leading to an average
frequency of -a flight every 1.5 hours.
This would mean 4X15=60 aircrafts on the Metro to Metro route.
A flight departing at 6am from Delhi will reach Chandigarh at 7am, would depart from Chandigarh at
8am, would reach Delhi at 9am and would again leave from Delhi at 10am.
Hence with one flight we can achieve a flight frequency of 4 flights per day. Based on my experience
(You can ask the interviewer for his experience, makes the interview conversational) even 4
flights might be too much and Air India may be having just 3 per day to save on operating expenses.
Additionally it's possible that one air-craft might be catering to two low frequency Metro--Tier 2 routes,
however we will neglect that and assume each route will have one dedicated aircraft at least. That leads
to 30x1=30 Aircrafts.
Hence we have a total of 60+30=90 aircrafts of Air India meant for domestic routes in active service.
- Alright, good job.
Preliminary Questions
What kind of taxis, call cabs like Meru and OLA? Or the spot taxis like Black and Yellows?
- Calculate the spot taxis, only, mainly the Black and Yellows.
There is an overlap of demand between call cabs and spot taxis. However, let’s assume it is insignificant
since call cabs are used for longer distances like going to airport etc., and have a more premium income
segment.
- Fair enough.
Overall Strategy
For this let us choose the morning time, since we know most people need to leave home to go to work
or college at that time. This is better than choosing some other time like afternoon/evening where
demand can come for a wide variety of purposes (like going for shopping, theatre) and will make it
difficult to compute. We can compute the
(Total # Taxi fares) in the morning time (7am-12pm) and divide by number of morning hours to arrive
at (Total #Taxi fares per hour) in the morning.
We need to consider the following factors when calculating (Total # Taxi fares) in the morning
● Population
● Income Split
● Age split
● Gender
● Alternatives: Buses, personal cars, bikes, walking, auto-rickshaws, other vehicles
● Region: Auto-Rickshaws are not allowed in South Bombay, so use of taxis is greater there
● No. of passengers in a taxi at a time
● Journey time
Of this target population of 7.5mn, we can now find how many require transportation through autos or
taxis. Later we can compute, how many of them prefer taxis.
This step may be counter-intuitive. Try to directly go ahead with finding taxi demand instead of
(taxi + auto) demand. You would find (as we did) the need to do the income-split analysis, below,
separately for South Bombay and Rest of Mumbai. The approach given does the income-split
analysis together, and then applies a factor for market share of taxis within (taxis + autos). The
lesson here is that in the middle of a case you can pull back, tweak your approach for
simplification purpose, and then resume forward.
We need to now apply a factor, for people only using taxis out of these.
In South Bombay, autos are legally not allowed, so people there will only use taxis. South Bombay
roughly represents 20% of Mumbai’s population. In the rest of Mumbai, most people use auto-rickshaws
since their more easily available and cheaper. Let’s say 20% of people outside South Bombay use taxis
in the morning.
Now, we need to apply a factor for carpooling. This differs by income-segment. In the middle-class
segment on an average two people would use a taxi at a time. Since this is the largest segment of
demand, we will consider the average carpooling number to be 2.
Now, let’s say the demand is scattered over morning 7am-12noon, with peak-hour being 9am-10am
constituting 30% of the demand.
During peak hour of 9am-10am, there is requirement of = (500,000)x(30%) =150,000 taxis per hour
An average fare time during the peak hour would be =Time with passenger + Time without passenger
=40mins+5mins = 45mins
Avg. fares per taxi in a hour = 1 hour / (Average fare time) = 60 mins / 45mins = 1.33
Hence the total # taxis= (Total #Taxi fares per hour) / (Avg. fares per taxi in a hour)
=150,000 / 1.33
~110,000 taxis
Sanity-Check
As a sanity check let’s see what the ratio of taxis and passenger cars is:
We know population of Mumbai is 20MN. This will roughly equal to 5MN households.
We can assume that according to the income segment, the top 20% can afford cars.
This will give us about 1MN passenger cars.
The ratio of taxis to passenger cars would then be ~1:10 which seems to be in the right ball-park based
on experience.
Note the above computation of passenger cars is not how you would do it if you were asked an
entire guesstimate on it. However it is a useful ‘quick and dirty’ method for the sanity check,
here.
1. Underground and On-ground - Since the underground/ground level stations will not have any pillars,
we will assume the number of pillars for them to be zero. The connections to the preceding and
subsequent stations, however, will be taken into account
2. Pillars at station - will have more number of pillars, owing to the extra strength required for the stability
of the station premises, therefore, we will assume 8 pillars for each station above ground level.
3. Turn/ split - The number of pillars increases for every turn/split(like the one after Yamuna Bank) in
the tracks. We will assume the increase on the basis of the length of the track.
4. For every 100m, there is a pillar. Therefore, there are 10 pillars in each km.
Basis the above assumptions, the length of the various Metro Lines, and subsequently, distance
between two stations is assumed to be as follows:
Now calculating th eno of pillars according to the DMRC data and our assumptions:
Now, once the number of pillars are calculated, we can estimate the cost of painting the pillars as well
as the revenue. The following figures have been taken arbitrarily
- Assume the cost of painting each pillar to be Rs 1200.
- Assume the revenue from each advertisement placed to be 2500.
Further specifications could have made the estimation more precise, following could be considered:
1. Assigning a higher revenue to popular areas (Cost of advertising would be higher) and lesser
revenue for those less popular.
2. The above step would lead to the segregation of the metro line’s distance into densely
populated/less populated areas. Alternatively, we may also choose to list the most well known
stations where the advertising cost is bound to be higher.
3. Inculcate the metro lines still in construction and their pillars.
4. We could have also assumed specific dimensions for the diameter and the height of the pillars,
subsequently arriving at its area and multiplying the same with the cost of painting per square unit
area.
8. The area of India
The first thing to come to your mind should be the map of India. Look at it carefully:
Since we need the diagonals for the area, we need to calculate the distance between Northernmost &
the Southernmost points and Easternmost and Westernmost points.
(The best case scenario is that you know these values. If you don’t you have a way out.)
We simply need to find out something that involves speed and time — a mode of transport. Pick from
— road, rail or air. Go ahead with trains.
(They move at about 60–80 km/h and then consider all the unnecessary stoppages. Also, ease of
calculation with 50.)
Time taken to travel by trains from:
Number of cups in the past week = number of cups per day × 7 (for 7 days per week).
The assumption treats each day equally, although calculation would be more precise if we mention that
there are likely fewer cups consumed on the weekend, as people are not in the office and in general
might feel less of a need to drink coffee.
Number of cups per day: here our guess is that of the remaining 60% of people, half drink 2 cups per
day, a quarter drink 4 cups per day, and a quarter drink 1 cup per day.
This averages out to 2 × 0.5 + 4 × 0.25 + 1 × 0.25 = 2.25 cups per coffee drinker per day.
Therefore the calculation is = 60% × 2.25 × 300,000,000 = 405 million cups each day
405 million cups × 7 days per week = 2.84 billion cups per week
(you could round it to approximately 2.8 billion cups)
Note: the interviewer could then ask questions around how many cups are drunk at home or the office
versus bought from a store, or other similar variations. The interviewer might also ask your thoughts on
coffee trends and also how much revenue this would mean, etc.
11. Gmail Queries per second
Preliminary questions:
Okay, I’m going to define a query as a Gmail operation. It could be a read, write or search operation.
First, I’m going to estimate the number of Gmail users. Then, I’ll estimate how often they use the service
and how many operations they perform.
- Go on.
The next number I need is the percentage of people that use Gmail as their primary account:
When I think about my friends, I’d say about 70 % use Gmail as their primary account. I know several
that use other services. Given that my friends are more tech-savvy and American-centric than others,
let’s say worldwide only 20 % use Gmail as their primary account.
That gives us, 0.2 *(7 billion people)* 0.4= 560 million
Finally, to get number of Gmail queries per second, we do the following math:
= (560 million Gmail users) * (44 Gmail operations per week per user) * (1 week per 604,800 sec)
= 40.7k queries per second
For this question, I’d like to start with stating assumptions and then doing my top down calculations.
Calculations:
The number of people looking to buy a new phone each year is
= (315 mn people of US)x(90% people having cell phone)x(50% will be buying a new phone this year)
= 142 million
The number of people that will buy the iPhone each year:
= (142 mn people that will be buying a new phone this year) x (60% will get a smartphone) x (40% of
the smartphone buyers will buy an iPhone)
= 34 million
13. Estimate how much it costs to run Flickr for a 20GB user.
Just a quick clarification, when you say 20GB user, are you saying that the user is using 20GB of
storage, or they bought a plan, which allows them up to 20GB storage?
- The latter.
I assume you want me to calculate costs of actual usage. That is, if they signed up for a 20GB plan,
and they use something less than 20GB. That’s the number you want?
- Yes.
Okay. Let’s say that each Flickr user uploads 10 pictures per week. 1 picture is roughly 5 MB in size.
= 10 * 5 MB = 50 MB each week. =~ 2.5 GB of storage for the year.
But what we really need is the average cost at any given point in the year. So let’s just take the midpoint,
and the average user, in its first year has about 1.25 GB.
From a bandwidth perspective, we usually show the optimized version of the photo. Let’s assume
optimization can reduce file sizes to 40%. So now the user sees, 5*0.4 = 2 MB version.
Assume each picture gets viewed on avg 10 times/ month, so 10x2 MB = 20 MB/mo. gets transferred.
We assumed that the user uploads 10 pictures per week = 520 photos per year.
Let’s take the midpoint again, we get 260 photos. = 260 * 20 MB = 5.2GB per month.
I’m going to evaluate the number of elevators we need based on how many passengers we need to
transport.
To start, I’ll estimate how many people use the elevators and how long they’re willing to wait. In the
elevator industry, it’s assumed that people are willing to wait 20 seconds for an elevator. It’s also
important to know how many people enter the building during peak times. I forgot to ask: is this building
for office, retail, or residential use?
- This is an office building.
If the first part was all about elevator demand, this second part is about elevator
capacity. I’d like to estimate:
➔ How many people can fit an elevator?
➔ What’s the elevator’s average speed?
➔ How long does an elevator stay open before closing and resuming travel?
It may also be important to know whether there are alternative transportation methods such as stairs
or escalators. For the sake of simplicity, we’ll set that aside.
Let’s assume this is a 50 story office building and that each floor contains 120 employees. In total, that’s
6,000 people in a 50 story office building.
People get to work between 8a.m.- 9a.m. The 6,000 people arrive uniformly during this 60 min. duration.
That’s 6000/ 60 = 100 people per minute.
To determine how many elevators are needed to transport 33 people once every 20 seconds, we need
to determine the effective throughput of an elevator.
Let’s assume there are 12 feet per floor and the elevator travels 20 ft /sec.
Also assume that in a 50 story building, the elevator stops 12 times on its way up, and each stop takes
about 20 seconds.
6.75 minutes will have elapsed before those first set of 3.3 elevators make their way back
As already assumed, every minute 100 people use the elevator.
So, in 6.75 minutes = (6.75*100) more people will come
Now for those people = (6.75*100)/10 = 67.5 more elevators will be required
So total 67.5+3.3 = ~ 71 elevators are required
Sanity check:
Seattle’s Columbia Tower has 83 floors, and they have 46 elevators
Comments: Candidate asked some good clarifying questions and demonstrated customer empathy. Nice
approach of evaluating demand first, then capacity. A potential follow-up question is discussing elevator
algorithms to service incoming requests.
Preliminary Questions
Should I also include revenue generated by eatables sold along coffee too?
-Yes, consider eatables and coffee both for your analysis.
Assumptions:
- Since Starbucks is US-based, we consider that it generates 50% of its global revenue from its home
country.
- Operating Hours are 6 AM to 10 PM.
- Average time to make a regular starbucks coffee is 5 minutes
- At maximum efficiency, 3 orders can be processed simultaneously in a normal store
- Average price of a normal coffee in Starbucks is $3
- For takeaways, people only take coffee.
Overall Strategy:
Takeaway Orders
Order density can be segmented as - Peak (100%), Medium (50%) and Low (25%). Peak time for
takeaways are in the morning and evening as the working population grab coffee while commuting.
Density is medium around lunch and low for the rest of the day.
During Peak Time, the store works at maximum efficiency; 3 coffees are prepared in 5 Minutes during
peak time. Therefore, 36 Coffees/hour. Coffees sold in Medium Time - 18/hour Coffees sold in Low
Time - 9/hour
Revenue Generated from Takeaways = No. of orders in a day x Average price of a takeaway order =
270 * 3 ~ 810$
In-Store Orders
For these, the peak time is usually after noon, when most people take breaks from their work and after
office timings.
Total In-Store Orders in a day in a normal store = 36 x 6 + 18 x 5 + 9 x 5 ~ 350 Orders/Day
Average In-Store Order Cost = $5 (including eatables)
Assuming that each Starbucks location caters to 2000 people in that locality,
Total # starbucks stores in USA = 30 crores/2000 = 15,000 Stores