IRT - Query Processing
IRT - Query Processing
Part 0ne
Dr Ali Al-Ibrahim
Reformulation of Query
Manual
• Add or remove search terms
• Change Boolean operators
Automatic
• Remove search terms
• Change weighting of search terms
• Add new search terms
2
Relevance Feedback:
3
Results of a Search
x
x hits from
x ∆ search
x
x x
x
4
(Relevance Feedback (Concept
x hits from
x original
o ∆ search
x
x o
o
5
Theoretically Best Query
optimal
query o x
x
x o
o x
x x x
x x
x
o x
x
x o x
o x
x x
x
x non-relevant documents
o relevant documents
6
Theoretically Best Query
8
Rocchio's Modified Query
9
Query Modification
n1 n2
1 1
Q1 = Q0 + n1 Σ Ri - n2 Σ Si
i =1 i =1
n1 n2
1
Q1 = α Q0 + β n Σ Ri - γ 1 Σ Si
1
i =1
n2
i =1
15
Query Operations
Chapter 5
Part Two
Operations
Introduction
Relevance Feedback
Query Expansion
Term Reweighting
Automatic Local Analysis
Query Expansion using Clustering
Automatic Global Analysis
Query Expansion using Thesaurus
Similarity Thesaurus
Statistical Thesaurua
17
Query Operations Introduction
18
Relevance Feedback
19
Relevance Feedback
After initial retrieval results are
presented, allow the user to provide
feedback on the relevance of one or
more of the retrieved documents.
Use this feedback information to
reformulate the query.
Produce new results based on
reformulated query.
Allows more interactive.
20
Relevance Feedback Architecture
Query Document
String corpus
Revise Rankings
d IR ReRanked
Query System Documents
1. Doc2
2. Doc4
Query 3. Doc5
Ranked 1. Doc1
Reformulation 2. Doc2 .
Documents 3. Doc3 .
1. Doc1 ⇓ .
2. Doc2 ⇑ .
3. Doc3 ⇓
Feedback .
.
21
Query Reformulation
22
Query Reformulation for VSR
23
Optimal Query
Assume that the relevant set of
documents Cr are known.
Then the best query that ranks all and only
the relevant queries at the top is:
1 1
qopt =
Cr
∑
d j −
N − Cr
∑
d j
∀d j ∈C r ∀d j ∉C r
28
Comparison of Methods
29
Why is Feedback Not Widely Used?
30
Pseudo Feedback
31
Pseudo Feedback Results
32
33
Local vs. Global Automatic Analysis
34
Thesaurus
A thesaurus provides information on
synonyms and semantically related
words and phrases.
Example:
physician
syn: ||croaker, doc, doctor, MD, medical,
mediciner, medico, ||sawbones
rel: medic, general practitioner, surgeon,
35
Thesaurus-based Query Expansion
36
Statistical Thesaurus
37
Query Expansion Based on a Statistical Thesaurus
38
Automatic Local Analysis
39
Automatic Local Analysis
40
Association Matrix
w1 w2 w3 …………………..wn
w1 c11 c12 c13…………………c1n
w2 c21
w3 c31
. .
. .
wn cn1
41
Problems with Local Analysis
42
Automatic Global Analysis
43
Automatic Global Analysis
44
Global vs. Local Analysis
45
Query Expansion Conclusions
46
Conclusion