Abstract

Abstract

Extensive pages along with their URLs are taken as samples, a query which can best summarize the page itself is constructed and sent to the search engine, the samples’ URLs are compared with the returned URLs, if there is a match between them or their content, consider the query as a lexical signature query or strong query. Assume that the page and its URL in surface web are supposed to be found by general search engine leads to the search engine’s quality measurement. By sending the strong query to different search engines, the qualities are derived. It will be a good source for the measurement only if the query extraction and processing targeted on web pages are well designed and implemented. This process is called find a lexical signature query for a given web page.

 

Keywords: lexical signature, query, search engine, Google, Yahoo, HTML tags, term frequency, document frequency, graph-based ranking algorithm, word rank, sentence rank.

你可能感兴趣的:(Abstract)