DOI QR코드

DOI QR Code

A Study on Query Refinement by Online Relevance Feedback in an Information Filtering System

온라인 이용자 피드백을 사용한 정보필터링 시스템의 수정질의 최적화에 관한 연구

  • 최광 ((주)오롬정보 교육정보화사업팀) ;
  • 정영미 (연세대학교 문헌정보학과)
  • Published : 2003.12.30

Abstract

In this study an information filtering system was implemented and a series of relevance feedback experiments were conducted using the system. For the relevance feedback, the original queries were searched against the database and the results were reviewed by the researchers. Based on users' online relevance judgements a pair of 17 refined queries were generated using two methods called 'co-occurrence exclusion method' and 'lower frequencies exclusion method,' In order to generate them, the original queries, the descriptors and category codes appeared in either relevant or irrelevant document sets were applied as elements. Users' relevance judgments on the search results of the refined queries were compared and analyzed against those of the original queries.

이 연구의 목적은 대량의 최신정보를 제공하는 정보필터링 시스템에서 이용자 피드백에 의해 수정질의를 자동생성하여 재검색을 수행함으로써 검색 성능을 최적화할 수 있는 방안을 찾는 데 있다. 이용자가 입력한 초기질의를 사용하여 정보필터링 시스템이 검색한 문헌에 대해 이용자가 적합성 여부를 온라인으로 입력하도록 하고, 이 피드백 결과를 토대로 '중복제거법'과 ‘저빈도제거법' 두 가지 방법에 의해각각 17개의 수정질의를 생성하여 재검색한 결과를 초기 검색결과와 비교 분석하였다. 수정질의는 각각의 방법마다 17개 패턴의 불논리 질의형태를 미리 만든 다음 초기질의에 디스크립터와 분류기호를 결합하여 생성하였으며, 재검색 결과에 대한 적합성 평가를 통해 최적의 수정질의식을 도출하였다.

Keywords

References

  1. 박지연. 2001. 질의확장에 의한 단락 검색의 성능 향상에 관한 연구. 석사학위논문. 연세대학교 대학원, 문헌정보학과.
  2. Callan, J. 1996. "Document filtering with inference networks." Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 262-269.
  3. Croft, W. B.. Harper, D. J. 1979. "Using probabilistic models of document retrieval without relevance information." Journal of Documentation. 35(4) : 285-295. https://doi.org/10.1108/eb026683
  4. Denning, P. 1982. "ACM president's letter on electronic junk." Communications of the ACM. 25(3): 163-165. https://doi.org/10.1145/358453.358454
  5. Dillon, M., Desper, J. 1980. "The use of automatic relevance feedback in boolean retrieval systems." Journal of Documentation. 36(3) 197-208. https://doi.org/10.1108/eb026696
  6. Fidel, R., Crandall, M. 1997. "Thrrole of subject access in information filtering". Proceedings of the 1997 Clinic on Library Applications of Data Processing. 16-27.
  7. Fisher, G., Stevens, C. 1991. "Information access in complex. poorly structured information spaces." Chi'91 Conference Proceedings of ACM, New York. 63-70.
  8. Greenberg, J. 2001. "Optimal query (QE) processing methods with semantically encoded structured thesauri terminology." Journal of the American Society for Information Science and Technology. 52(6) : 487-498. https://doi.org/10.1002/asi.1093
  9. Ide, E. 1971. "New experiments in relevance feedback." In The Smart System-Experiments in Automatic Document Processing. NJ : Prentice Hall.
  10. Kay, J., Kummerfeld, R. J. 1996. "User model based filtering and customisation of web pages." UM'96 Workshops. [online]. [cited 2002.10.03] .
  11. Luhn, H. P. 1958. A business intelligent machine." IBM Journal of R&D, 2 314-319. https://doi.org/10.1147/rd.24.0314
  12. Malone, T. W., Grant, K. R., Turbak, F. A. 1986. "The Information Lens an intelligent system for information sharing in organizations." CHI'86 Proceedings of ACM. 1-8
  13. Mandala, R., Takenobu, T. and Tanaka, H. 2000. "Query expansion using hetrogeneous thesauri." Inoformation Prosessing & Management. 36(3): 361-378. https://doi.org/10.1016/S0306-4573(99)00068-0
  14. Qui, Y., Frei, H. 1993. "Concept based query expansion." ACM SIGIR '93. 160-169.
  15. Ram, A. 1992. "Natural language understanding for information filtering system." Communications of the ACM. 35(12): 80-81. https://doi.org/10.1145/138859.138869
  16. Rocchio, J. J. 1971. "Relevance feedback in information retrieval." In G. Salton (ed). Smart Retrieval System. NJ : Prentice Hall.
  17. Rodriguez-Mula, H. G., Garcia-Monila and A. Paepcke. 1998. "Collaborative value filtering on the web." Computer Networks and ISDN Systems 30. 736-738. https://doi.org/10.1016/S0169-7552(98)00040-3
  18. Salton, G. 1981. "The estimation of term relevance weights using relevance feedback." Journal of Documentation 37(4) : 194-214. https://doi.org/10.1108/eb026717
  19. Stadnyk I. and Kass, R. 1992. "Modeling users' interests in information filters." Communications of the ACM. 35(12) : 49-50. https://doi.org/10.1145/138859.138864
  20. Stevens, C. 1992. "Automating the creation of information filters." Communications of the ACM 35(12) : 48. https://doi.org/10.1145/138859.138863
  21. Xu, J., Croft, W. B. 1996. "Query expansion using local and global document analysis." Proceedings of ACM SIGIR International Conference on Research and Development in Information Retrieval. 4-11.
  22. Yang, Y. 1999. "An evaluation of statistical approaches to text categorization." Information Retrieval, 1: 69-90. https://doi.org/10.1023/A:1009982220290