OSTA WS2024 Tutorial Session 01
OSTA WS2024 Tutorial Session 01
OSTA – WS 2024
Dr. Omer CAYIRLI
Tutorial Session I
1
Exercises
variables
➢ Table 1.7 shows data for eight
phones. The Overall Score, a
measure of the overall quality for
the phone, ranges from 0 to 100.
➢ Voice Quality has possible ratings
of poor, fair, good, very good, and
excellent.
➢ Talk Time is the manufacturer’s
claim of how long the phone can be
used when it is fully charged.
❖ Americans tend to dine out multiple times per week. The number of times a
sample of 20 families dined out last week provides the following data.
6 1 5 3 7 3 0 3 1 3
4 1 2 4 1 0 5 6 3 1
a. Compute the mean and median.
b. Compute the first and third quartiles.
c. Compute the range and interquartile range.
d. Compute the variance and standard deviation.
e. The skewness measure for these data is .34. Comment on the shape of this
distribution. Is it the shape you would expect? Why or why not?
f. Do the data contain outliers?
5
Exercises
❖ The average waiting time for a patient at an El Paso physician’s office is just over 29 minutes,
well above the national average of 21 minutes.
❖ In order to address the issue of long patient wait times, some physician’s offices are using
wait tracking systems to notify patients of expected wait times.
❖ Patients can adjust their arrival times based on this information and spend less time in
waiting rooms.
❖ The data show wait times (minutes) for a sample of patients at offices that do not have an
office tracking system and wait times for a sample of patients at offices with an office
tracking system.
a. What are the mean and median patient wait times for offices with a wait tracking system? What are the mean and
median patient wait times for offices without a wait tracking system?
b. What are the variance and standard deviation of patient wait times for offices with a wait tracking system? What
are the variance and standard deviation of patient wait times for visits to offices without a wait tracking system?
c. Do offices with a wait tracking system have shorter patient wait times than offices without a wait tracking system?
Explain.
d. Considering only offices without a wait tracking system, what is the z-score for the tenth patient in the sample?
e. Considering only offices with a wait tracking system, what is the z-score for the sixth patient in the sample? How
does this z-score compare with the z-score you calculated for part (d)?
f. Based on z-scores, do the data for offices without a wait tracking system contain any outliers? Based on z-scores,
do the data for offices with a wait tracking system contain any outliers?
6
Exercises
a. Use the data in the table above and the relative frequency method to determine probabilities that a randomly
selected line of code will need to be edited for each programmer.
b. If you randomly select a line of code from Liwei, what is the probability that the line of code will require editing?
c. If you randomly select a line of code from Sherae, what is the probability that the line of code will not require
editing?
d. Which programmer has the lowest probability of a randomly selected line of code requiring editing? Which
programmer has the highest probability of a randomly selected line of code requiring editing?
10
Exercises
11
Exercises
❖High school seniors with strong academic records apply to the nation’s most selective colleges in
greater numbers each year. Because the number of slots remains relatively stable, some colleges reject
more early applicants.
❖Suppose that for a recent admissions class, an Ivy League college received 2851 applications for early
admission. Of this group, it admitted 1033 students early, rejected 854 outright, and deferred 964 to the
regular admission pool for further consideration.
❖In the past, this school has admitted 18% of the deferred early admission applicants during the regular
admission process.
❖Counting the students admitted early and the students admitted during the regular admission process,
the total class size was 2375.
❖Let E, R, and D represent the events that a student who applies for early admission is admitted early,
rejected outright, or deferred to the regular admissions pool.
a. Use the data to estimate P(E), P(R), and P(D).
b. Are events E and D mutually exclusive? Find P(E ∩ D).
c. For the 2375 students who were admitted, what is the probability that a randomly selected student was
accepted during early admission?
d. Suppose a student applies for early admission. What is the probability that the student will be admitted for
early admission or be deferred and later admitted during the regular admission process?
12
Exercises
❖To better understand how husbands and wives feel about their finances, Money magazine conducted a
national poll of 1010 married adults age 25 and older with household incomes of $50,000 or more.
❖Consider the following example set of responses to the question, “Who is better at getting deals?”
a. Develop a joint probability table and use it to answer the following questions.
b. Construct the marginal probabilities for Who Is Better (I Am, My Spouse, We Are Equal). Comment.
c. Given that the respondent is a husband, what is the probability that he feels he is better at getting deals than his
wife?
d. Given that the respondent is a wife, what is the probability that she feels she is better at getting deals than her
husband?
e. Given a response “My spouse” is better at getting deals, what is the probability that the response came from a
husband?
f. Given a response “We are equal,” what is the probability that the response came from a husband? What is the
probability that the response came from a wife? 13
Exercises
❖The prior probabilities for events A1 and A2 are P(A1) = .40 and P(A2) = .60. It is
also known that P(A1 ∩ A2) = 0. Suppose P(B ∣ A1) = .20 and P(B ∣ A2) = .05.
a.Are A1 and A2 mutually exclusive? Explain.
b.Compute P(A1 ∩ B) and P(A2 ∩ B).
c.Compute P(B).
d.Apply Bayes’ theorem to compute P(A1 ∣ B) and P(A2 ∣ B).
❖ The prior probabilities for events A1, A2, and A3 are P(A1) = .20, P(A2) = .50, and
P(A3) = .30. The conditional probabilities of event B given A1, A2, and A3 are P(B ∣
A1) = .50, P(B ∣ A2) = .40, and P(B ∣ A3) = .30.
a.Compute P(B ∩ A1), P(B ∩ A2), and P(B ∩ A3).
b.Apply Bayes’ theorem, to compute the posterior probability P(A2 ∣ B).
c.Use the tabular approach to applying Bayes’ theorem to compute P(1 ∣ B), P(A2 ∣ B), and P(A3 ∣ B).
14
Exercises
❖A consulting firm submitted a bid for a large research project. The firm’s management initially felt they
had a 50–50 chance of getting the project. However, the agency to which the bid was submitted
subsequently requested additional information on the bid. Past experience indicates that for 75% of the
successful bids and 40% of the unsuccessful bids the agency requested additional information.
a. What is the prior probability of the bid being successful (that is, prior to the request for additional information)?
b. What is the conditional probability of a request for additional information given that the bid will ultimately be
successful?
c. Compute the posterior probability that the bid will be successful given a request for additional information.
❖A local bank reviewed its credit card policy with the intention of recalling some of its credit cards. In the
past approximately 5% of cardholders defaulted, leaving the bank unable to collect the outstanding
balance. Hence, management established a prior probability of .05 that any particular cardholder will
default. The bank also found that the probability of missing a monthly payment is .20 for customers who
do not default. Of course, the probability of missing a monthly payment for those who default is 1.
a. Given that a customer missed one or more monthly payments, compute the posterior probability that the
customer will default.
b. The bank would like to recall its card if the probability that a customer will default is greater than .20. Should the
bank recall its card if the customer misses a monthly payment? Why or why not?
15
Exercises
18
Statistics
OSTA – WS 2024
Dr. Omer CAYIRLI
Tutorial Session I
19