Focused crawling aims to search only the relevant subset of the WWW for a specific topic of user interest; leading to the necessity to decide about the relevancy of a document to the topic of interest; especially when the user is not perfect in specifying the exact context of the topic. This paper provides a novel framework of a context based distributed focused crawler that maintains an index of web documents pertaining to the context of keywords resulting in storage of more related documents.
Gupta, Pooja; Sharma, Ashok; Gupta, J. P.; and Bhatia, Komal
"A Novel Framework for Context Based Distributed Focused Crawler (CBDFC),"
International Journal of Computer and Communication Technology: Vol. 1
, Article 4.
Available at: https://www.interscience.in/ijcct/vol1/iss1/4