Web mining is the application of data mining techniques to discover interesting patterns from the Web. Web usage mining is the process of extracting useful information from server logs i.e users history. While discovering interesting patterns in multi agents the efficiency is decreased. In our paper presents hybrid algorithm is designed for the improvement of the efficiency of keywords based search engine. The model divides mining task into several parallel agents which coordinately work together, and the mining efficiency is improved greatly. The hybrid algorithm is Evolved from HITS, algorithm. Hybrid algorithm removes Link Farm pages in the expansion of root set, makes anchor text similarity calculation when crawling link page, and chooses pages by a brief conceptual analysis of page content. With the overcoming of the shortcomings of only text analysis or link analysis, Hybrid enhances the search engine in understanding the user interest and crawling more Web pages to meet the needs of the users.
Bharati, K. F.
"Hybrid Algorithm for Improving Efficiency of Keywords based Search Engine,"
International Journal of Computer Science and Informatics: Vol. 1
, Article 9.
Available at: https://www.interscience.in/ijcsi/vol1/iss4/9