TY - JOUR
T1 - A Comparative Evaluation of Full-text, Concept-based, and Context-sensitive Search
AU - Moskovitch, Robert
AU - Martins, Susana B.
AU - Behiri, Eytan
AU - Weiss, Aviram
AU - Shahar, Yuval
N1 - Funding Information:
This research was supported in part by NIH award No. LM-06806. We thank Mr. Tu and Dr. Peleg, from Stanford BioMedical Informatics, for useful discussions regarding the need for supporting the use of multiple guideline ontologies. Drs. Shiffman and Karras from Yale University and the University of Seattle assisted us in using the GEM ontology. Drs. Goldstein, Basso, Kaizer, and Advani, from Stanford University and the Veterans Administration, Palo Alto Health Care System, and Dr. Lunenfeld from the Soroka Medical Center, were extremely helpful in assessing the various interfaces and search. We want to thank the physicians from E&C-Medical Intelligence Ltd., who did this extremely time-consuming work in the creation of the test collection, Drs. Ben-Yoseph, Sidikman, and Zacsman. We thank The Ben Gurion University Information Systems Department undergraduate students A. Litmanovitz, O. Bohanna, and C. Sasson, who worked on the IR and HCI evaluation tools. We thank Drs. Shapira, Kuflik, and Goren-Bar from Ben Gurion University and Haifa University for useful discussions at the initial steps of the development of Vaidurya. Finally, we would like to thank the anonymous reviewers for their significant and helpful comments.
PY - 2007/3/1
Y1 - 2007/3/1
N2 - Objectives: Study comparatively (1) concept-based search, using documents pre-indexed by a conceptual hierarchy; (2) context-sensitive search, using structured, labeled documents; and (3) traditional full-text search. Hypotheses were: (1) more contexts lead to better retrieval accuracy; and (2) adding concept-based search to the other searches would improve upon their baseline performances. Design: Use our Vaidurya architecture, for search and retrieval evaluation, of structured documents classified by a conceptual hierarchy, on a clinical guidelines test collection. Measurements: Precision computed at different levels of recall to assess the contribution of the retrieval methods. Comparisons of precisions done with recall set at 0.5, using t-tests. Results: Performance increased monotonically with the number of query context elements. Adding context-sensitive elements, mean improvement was 11.1% at recall 0.5. With three contexts, mean query precision was 42% ± 17% (95% confidence interval [CI], 31% to 53%); with two contexts, 32% ± 13% (95% CI, 27% to 38%); and one context, 20% ± 9% (95% CI, 15% to 24%). Adding context-based queries to full-text queries monotonically improved precision beyond the 0.4 level of recall. Mean improvement was 4.5% at recall 0.5. Adding concept-based search to full-text search improved precision to 19.4% at recall 0.5. Conclusions: The study demonstrated usefulness of concept-based and context-sensitive queries for enhancing the precision of retrieval from a digital library of semi-structured clinical guideline documents. Concept-based searches outperformed free-text queries, especially when baseline precision was low. In general, the more ontological elements used in the query, the greater the resulting precision.
AB - Objectives: Study comparatively (1) concept-based search, using documents pre-indexed by a conceptual hierarchy; (2) context-sensitive search, using structured, labeled documents; and (3) traditional full-text search. Hypotheses were: (1) more contexts lead to better retrieval accuracy; and (2) adding concept-based search to the other searches would improve upon their baseline performances. Design: Use our Vaidurya architecture, for search and retrieval evaluation, of structured documents classified by a conceptual hierarchy, on a clinical guidelines test collection. Measurements: Precision computed at different levels of recall to assess the contribution of the retrieval methods. Comparisons of precisions done with recall set at 0.5, using t-tests. Results: Performance increased monotonically with the number of query context elements. Adding context-sensitive elements, mean improvement was 11.1% at recall 0.5. With three contexts, mean query precision was 42% ± 17% (95% confidence interval [CI], 31% to 53%); with two contexts, 32% ± 13% (95% CI, 27% to 38%); and one context, 20% ± 9% (95% CI, 15% to 24%). Adding context-based queries to full-text queries monotonically improved precision beyond the 0.4 level of recall. Mean improvement was 4.5% at recall 0.5. Adding concept-based search to full-text search improved precision to 19.4% at recall 0.5. Conclusions: The study demonstrated usefulness of concept-based and context-sensitive queries for enhancing the precision of retrieval from a digital library of semi-structured clinical guideline documents. Concept-based searches outperformed free-text queries, especially when baseline precision was low. In general, the more ontological elements used in the query, the greater the resulting precision.
UR - http://www.scopus.com/inward/record.url?scp=33847196843&partnerID=8YFLogxK
U2 - 10.1197/jamia.M1953
DO - 10.1197/jamia.M1953
M3 - Article
AN - SCOPUS:33847196843
SN - 1067-5027
VL - 14
SP - 164
EP - 174
JO - Journal of the American Medical Informatics Association
JF - Journal of the American Medical Informatics Association
IS - 2
ER -