PR Example Sol
PR Example Sol
system retrieves the following ranking: d5, d3, d21, d36, d30, d45, d80, d28, d23, and
d12.
1. What are the precision and recall values at each retrieved document for this ranking?
Recall Precision
d5 0/6 = 0 0/1=0
d3 1/6 = 0.166 1/2 = 0.50
d21 2/6 = 0.33 2/3 = 0.66
d36 2/6 = 0.33 2/4 = 0.50
d30 3/6 = 0.50 3/5 = 0.60
d45 3/6 = 0.50 3/6 = 0.50
d80 3/6 = 0.50 3/7 = 0.42
d28 3/6 = 0.50 3/8 = 0.37
d23 3/6 = 0.50 3/9 = 0.33
d12 4/6 = 0.66 4/10 = 0.40
2. Interpolate the precision scores at 11 recall levels. Note: The interpolated precision at
the j-th standard recall level is the maximum known precision at any recall level between
the j-th and (j + 1)-th level: P( rj ) = max P( r )
r j ≤ r ≤ r j +1
4. What is the R-precision? (precision at first R retrieved documents where R is the total
number of relevant documents)
5. If we have two users, the first user decided that d12, d15, d21 are relevant to the query,
and the second user decided that d3, d12, d15, d21, d22 are relevant to the query, what is
the coverage ratio and the novelty ratio for this user? (Remember that the coverage ratio
is the proportion of relevant items retrieved out of the total relevant documents known to
a user prior to the search. The novelty ratio is the proportion of retrieved items, judged
relevant by the user, of which they were previously unaware.)
Explanation:
User1 knows 3 documents (d12, d15, d21). The system retrieves 4 relevant documents, 2
of them are the ones expected by user1 (d12, d21). Therefore the coverage ratio for user1
is 2 retrieved out of 3 expected. Among the 4 relevant documents retrieved by the system
2 are known to user1 and 2 are new. So the novelty ratio for user1 is 2 out of 4.
For user2, please compute yourself.