Thursday, February 20, 2014

Unit 7 Reading Note (2/24)

IIR Chapter 9

Query Refinement

--Local: Rocchio algorithm (relevance feedback: the optimal query is the vector difference between the centroids of the relevant and nonrelevant documents); Probabilistic relevance feedback (Naive Bayes probabilistic model); when to work (make query close to document; relevant documents are clustered); Pseudo relevance feedback (assume top k are relevant w/o interaction); indirect relevance feedback (DirectHit).

--Global: vocabulary tools; query expansion (by thesaurus (automatic generated by exploiting word concurrence and grammatical analysis)).

No comments:

Post a Comment