IR System Evaluation Guide

There are three main ways to evaluate an information retrieval system: retrieval effectiveness by measuring the relevance of search results, system quality by measuring indexing and search speeds as well as collection coverage, and user utility by measuring user happiness, return rates, and productivity through A/B testing. Effectiveness can be measured through precision, recall, and F-measure using test collections with representative documents and topics as well as known human-assessed relevance judgments. Good measures are meaningful, easily replicated, and comparable using a single number.

Uploaded by

Olivia Michel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views14 pages

IR System Evaluation Guide

Uploaded by

Olivia Michel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 14

IR Evaluation

 There are different aspects through which we can evaluate IR systems:

1. Retrieval effectiveness (standard IR evaluation)
• Relevance of search results
2. System quality
a) Indexing speed (e.g., how many documents per hour?)
b) Search speed (search latency as a function of index size)
c) Coverage (document collection size and diversity)
d) Expressiveness of the query language
3. User utility 
• User happiness based on relevance, speed, and user interface
• User return rate, user productivity (difficult to measure)
• A/B test: slight change on a deployed system visible to a fraction of users
• Difference evaluated using clickthrough log analysis
Evaluation Criteria
• Effectiveness
• How “good” are the documents are returned?
• Efficiency
• Retrieval time, indexing time, indexing size.
• Usability
• Learnability, flexibility
Reusable Test Collection
• Collection of documents
• Should be representative.
• Sample of information need.
• Should be randomized and representative.
• Usually formalized topic statement.
• Known relevance judgments.
• Assed by human.
• Binary judgments make evaluation easier.
Good Effectiveness Measures
• Should capture some aspects of what the user wants.
• The measure should be meaningful.
• Should be easily replicated by other researchers.
• Should be easily comparable.
• Expressed as a single number.
Effectiveness evaluation measure
• Set based measure
• Rank based measure
Set based measure

• IR system returns set of retrieved results without results.

• No certain number of results per query.
• Suitable for Boolean search.
Precision and recall
Precision and recall
Trade-off between R&P
• Precision
• The ability to retrieve top ranked documents that are mostly relevant.
• Recall
• The ability to retrieve all of relevant items.
Trade-off between R&P
F-measure

Chapter3 MA212 Evaluation
No ratings yet
Chapter3 MA212 Evaluation
63 pages
IR Unit 5
No ratings yet
IR Unit 5
5 pages
Search Engine Evaluation Techniques
No ratings yet
Search Engine Evaluation Techniques
45 pages
IR Evaluation with TREC_EVAL
No ratings yet
IR Evaluation with TREC_EVAL
10 pages
Chapter 6-8IR Revised
No ratings yet
Chapter 6-8IR Revised
76 pages
5 Retrievalefective
No ratings yet
5 Retrievalefective
13 pages
IR System Evaluation Guide
No ratings yet
IR System Evaluation Guide
28 pages
6 Retrieval Effectiveness
No ratings yet
6 Retrieval Effectiveness
18 pages
Unit-V
No ratings yet
Unit-V
54 pages
Evaluating Information Retrieval Effectiveness
No ratings yet
Evaluating Information Retrieval Effectiveness
20 pages
IR Lecture 5b
No ratings yet
IR Lecture 5b
36 pages
IR Lecture 5b
No ratings yet
IR Lecture 5b
36 pages
5 Retrieval Evaluation
No ratings yet
5 Retrieval Evaluation
20 pages
5 Retrievalefective
No ratings yet
5 Retrievalefective
22 pages
Performance Evaluation of Information Retrieval Systems
No ratings yet
Performance Evaluation of Information Retrieval Systems
46 pages
1727759531-6 Evaluation in Information Retrieval
No ratings yet
1727759531-6 Evaluation in Information Retrieval
24 pages
Lecture 7 - Evaluation in IR, Relevance Feedback, Query Expansion
No ratings yet
Lecture 7 - Evaluation in IR, Relevance Feedback, Query Expansion
79 pages
5-Retrieval Effectiveness
No ratings yet
5-Retrieval Effectiveness
20 pages
IR Chapt 5
No ratings yet
IR Chapt 5
55 pages
Minimize The Overhead of A User Locating Needed Information Precision and Recall
No ratings yet
Minimize The Overhead of A User Locating Needed Information Precision and Recall
14 pages
ISR Chap... 6
No ratings yet
ISR Chap... 6
14 pages
Information Retrieval: IR Evaluation
No ratings yet
Information Retrieval: IR Evaluation
36 pages
Web Mining UNIT-II Chapter-01 - 02 - 03
No ratings yet
Web Mining UNIT-II Chapter-01 - 02 - 03
19 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
26 pages
IR-Module 1 and 2
No ratings yet
IR-Module 1 and 2
48 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
45 pages
09 Evaluation
No ratings yet
09 Evaluation
22 pages
3 Retrieval Evaluation
No ratings yet
3 Retrieval Evaluation
31 pages
Unit 5
No ratings yet
Unit 5
14 pages
10 Evaluation FSS20
No ratings yet
10 Evaluation FSS20
24 pages
Lecture 5
No ratings yet
Lecture 5
37 pages
Chapter Fives
No ratings yet
Chapter Fives
29 pages
Lecture5 6
No ratings yet
Lecture5 6
30 pages
Click To Edit Master Title Style: Evaluation Techniques For
No ratings yet
Click To Edit Master Title Style: Evaluation Techniques For
15 pages
Unit 3
No ratings yet
Unit 3
27 pages
Performance Evaluation of Information Retrieval Systems
No ratings yet
Performance Evaluation of Information Retrieval Systems
28 pages
IR - Assignment - ATHARV KULKARNI
No ratings yet
IR - Assignment - ATHARV KULKARNI
6 pages
Evaluation 1
No ratings yet
Evaluation 1
63 pages
Chapter 5 Retrieval Efective
No ratings yet
Chapter 5 Retrieval Efective
24 pages
Ir Unit 4pt1
No ratings yet
Ir Unit 4pt1
98 pages
Retrieval Evaluation in IR Systems
No ratings yet
Retrieval Evaluation in IR Systems
28 pages
Information Storage And: Retrieval Techniques
No ratings yet
Information Storage And: Retrieval Techniques
56 pages
ISR Chap..1
No ratings yet
ISR Chap..1
27 pages
Information Retrival
No ratings yet
Information Retrival
7 pages
Introduction To Telecom Technologies (Telecom) : Getachew Mamo
No ratings yet
Introduction To Telecom Technologies (Telecom) : Getachew Mamo
65 pages
Measures To Evaluate The Superiority of A Search Engine
No ratings yet
Measures To Evaluate The Superiority of A Search Engine
7 pages
Information Retrivals Ans
No ratings yet
Information Retrivals Ans
78 pages
Chapter 5
No ratings yet
Chapter 5
57 pages
Fuzzy Ontologies and Scale Free Networks
No ratings yet
Fuzzy Ontologies and Scale Free Networks
11 pages
IR Lec1
No ratings yet
IR Lec1
26 pages
Precision and Recall in IR Evaluation
No ratings yet
Precision and Recall in IR Evaluation
50 pages
Performance Visualization IR
No ratings yet
Performance Visualization IR
2 pages
Lecture 6
No ratings yet
Lecture 6
58 pages
Evaluating IR Systems for Experts
No ratings yet
Evaluating IR Systems for Experts
108 pages
Measuring The Utility of Search Engine Result Pages
No ratings yet
Measuring The Utility of Search Engine Result Pages
10 pages
Multimedia Information Retrieval
No ratings yet
Multimedia Information Retrieval
143 pages
Information Retrieval
No ratings yet
Information Retrieval
5 pages
Information Retrieval Question Bank-2
No ratings yet
Information Retrieval Question Bank-2
168 pages

IR System Evaluation Guide

Uploaded by

IR System Evaluation Guide

Uploaded by

IR Evaluation

 There are different aspects through which we can evaluate IR systems:

• IR system returns set of retrieved results without results.

You might also like