TY - JOUR
T1 - Using Semantic Context to Rank the Results of Keyword Search
AU - Butavicius, Marcus A.
AU - Parsons, Kathryn M.
AU - McCormac, Agata
AU - Dennis, Simon J.
AU - Ceglar, Aaron
AU - Weber, Derek
AU - Ferguson, Lael
AU - Treharne, Kenneth
AU - Leibbrandt, Richard
AU - Powers, David
PY - 2019
Y1 - 2019
N2 - In an empirical user study, we assessed two approaches to ranking the results from a keyword search using semantic contextual match based on Latent Semantic Analysis. These techniques involved searches initiated from words found in a seed document within a corpus. The first approach used the sentence around the search query in the document as context while the second used the entire document. With a corpus of 20,000 documents and a small proportion of relevant documents (<0.1%), both techniques outperformed a conventional keyword search on a recall-based information retrieval (IR) task. These context-based techniques were associated with a reduction in the number of searches conducted, an increase in users’ precision and, to a lesser extent, an increase in recall. This improvement was strongest when the ranking was based on document, rather than sentence, context. Individuals were more effective on the IR task when the lists returned by the techniques were ranked better. User performance on the task also correlated with achievement on a generalized IQ test but not on a linguistic ability test.
AB - In an empirical user study, we assessed two approaches to ranking the results from a keyword search using semantic contextual match based on Latent Semantic Analysis. These techniques involved searches initiated from words found in a seed document within a corpus. The first approach used the sentence around the search query in the document as context while the second used the entire document. With a corpus of 20,000 documents and a small proportion of relevant documents (<0.1%), both techniques outperformed a conventional keyword search on a recall-based information retrieval (IR) task. These context-based techniques were associated with a reduction in the number of searches conducted, an increase in users’ precision and, to a lesser extent, an increase in recall. This improvement was strongest when the ranking was based on document, rather than sentence, context. Individuals were more effective on the IR task when the lists returned by the techniques were ranked better. User performance on the task also correlated with achievement on a generalized IQ test but not on a linguistic ability test.
KW - Semantic context
KW - Keyword searching
KW - Latent Semantic Analysis
UR - http://www.scopus.com/inward/record.url?scp=85049661356&partnerID=8YFLogxK
U2 - 10.1080/10447318.2018.1485263
DO - 10.1080/10447318.2018.1485263
M3 - Article
AN - SCOPUS:85049661356
SN - 1044-7318
VL - 35
SP - 725
EP - 741
JO - INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION
JF - INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION
IS - 9
ER -