Improving Performance of Web Search using The User Preference in Query Word Senses

질의어 의미별 사용자 선호도를 이용한 웹 검색의 성능 향상

  • 김형일 (동국대학교 컴퓨터공학과) ;
  • 김준태 (동국대학교 컴퓨터공학과)
  • Published : 2004.08.01

Abstract

In this paper, we propose a Web page weighting scheme using the user preference in each sense of query word to improve the performance of Web search. Generally search engines assign weights to a web page by using relevancy only, which is obtained by comparing the query word and the words in a web page. In the information retrieval from huge data such as the Web, simple word comparison cannot distinguish important documents because there exist too many documents with similar relevancy In this paper we implement a WordNet-based user interface that helps to distinguish different senses of query word, and constructed a search engine in which the implicit evaluations by multiple users are reflected in ranking by accumulating the number of clicks. In accumulating click counts, they are stored separately according to senses, so that more accurate search is possible. The experimental results with several keywords show that the precision of proposed system is improved compared to conventional search engines.

본 논문에서는 웹 검색의 성능 향상을 위해 질의어 의미별 사용자 선호도를 이용한 웹 페이지의 가중치 부여 방식을 제안한다. 일반적으로 검색엔진들은 검색 질의어와 웹 페이지의 어휘 비교에 의한 관련도 측정만을 사용하여 웹 페이지의 가중치를 부여한다. 웹과 같이 방대한 자료를 대상으로 검색을 할 경우 유사한 관련도를 가진 검색 결과가 매우 많으므로 어휘 비교만으로는 중요한 웹 페이지를 선별하기 어렵다. 본 논문에서는 질의어의 의미를 구분하도록 워드넷(WordNet)을 이용한 사용자 인터페이스를 구축하고, 사용자의 클릭 수를 각 웹 페이지의 가중치에 누적함으로써 다수 사용자의 검색 행위에 의한 묵시적 평가가 웹 페이지의 검색 순위에 반영되는 검색 시스템을 구현하였다. 클릭수의 누적에 있어서 질의 어 의미별로 가중치를 구분하여 저장함으로써 일반적인 검색엔진보다 정확한 검색이 되었으며, 웹 페이지의 범주별 가중치와 질의어의 의미별 사용자 선호도를 이용함으로써 검색 시스템의 성능을 향상시킬 수 있다는 것을 20개의 어휘에 관련된 41개의 의미들을 대상으로 실험한 결과로 확인하였다.

Keywords

References

  1. D. Dreilinger and A. E. Howe, 'An information gathering agent for querying web search engines,' Computer Science Technical report, CS-96-111, Colorado State University, 1996
  2. D. Dreilinger and A. E. Howe, 'Experiences with selecting search engines using metasearch,' ACM Transactions on Information Systems, Vol.15, 1997
  3. E. J. Glover, S. Lawrence, M. D. Gordon, W. P. Birmingham, and C. L. Giles, 'Web Search - Your Way', Communications of the ACM, vol. 44, No.12, 2001 https://doi.org/10.1145/501317.501319
  4. N. J. Belkin, D. Kelly, G. Kim, J. Y. Kim, H. J. Lee, G. Muresan, M. C. Tang, X. J. Yuan and C. Cool, 'Query length in interactive information retrieval,' SIGIR,' pp. 205-212, 2003
  5. S. Lawrence, 'Context in Web Search,' IEEE Data Engineering Bulletin, Vol.23, pp.25-32, 2000
  6. X. Shen and C. X. Zhai, 'Exploiting query history for document ranking in interactive information retrieval,' SIGIR 2003, pp. 377-378, 2003
  7. D. Moldovan and R. Mihalcea, 'A WordNet-Based Interface to Internet Search Engines,' Proceedings of FLAIRS-98, 1998
  8. E. Voorhees, 'Using WordNet to disambiguate word senses for text retrieval,' Proceedings of the 16th ACM-SIGIR Conference, 1993
  9. M. Sanderson, 'Word sense disambiguation and information retrieval,' Proceedings of SIGIR-94, 1994
  10. X. Li, S. Szpakowicz and S. Matwin, 'A Word-Net-based Algorithm for Word Sense Disambiguation,' The 1995 International Joint Conferences on Artificial Intelligence, 1995
  11. G. A. Miller, 'WordNet : An On-Line Lexical Database,' International Journal of Lexicography, 1990 https://doi.org/10.1093/ijl/3.4.235
  12. G. A. Miller, 'Nouns in WordNet: A Lexical Inheritance System,' Communications of the ACM, Volume 38, Issue 11, 1995
  13. S. Scott and S. Matwin, 'Text Classification Using WordNet Hypernyms,' Coling-ACL '98Workshop, 1998
  14. A. S. Chakravarthy and K. B. Haase, 'NetSerf : Using Semantic Knowledge to Find Internet Information Archives,' Proceeding of the ACM SIGIR Conference, 1995 https://doi.org/10.1145/215206.215326
  15. D. Moldovan and R. Mihalcea, 'Using WordNet and Lexical Operators to improve Internet Searches,' IEEE Internet Computing, Vol.4, No.1, 2000 https://doi.org/10.1109/4236.815847
  16. M. Balabanovic, 'An Adaptive Web Page Recommendation Service,' Proceedings of the First International Conference on Autonomous Agents, 1997
  17. L. Chen and K. Sycara, 'WebMate : A Personal Agent for Browsing and Searching,' Proceedings of the 2nd Internarional Conference on Autonomous Agents, 1998 https://doi.org/10.1145/280765.280789
  18. E. J. Glover, S. Lawrence,W. P. Birmingham and C. L. Giles, 'Architecture of a Metasearch Engine That Supports User Information Needs,' CIKM, pp. 210-216, 1999 https://doi.org/10.1145/319950.319980
  19. E. J. Glover and W. P. Birmingham, 'Using decision theory to order documents,' In Digital Libraries 98, Pittsburgh, PA, 1998 https://doi.org/10.1145/276675.276732
  20. E. Agichtein, S. Lawrence and L. Gravano, 'Learning search engine specific query transformations for question answering,' In Tenth International World Wide Web Conference, Hong Kong, 2001 https://doi.org/10.1145/371920.371976
  21. J. M. Kleinberg, 'Authoritative sources in a hyperlinked environment,' The Journal of the ACM, Volume 46, Issue 5, 1999