The main goals of focused web crawler are to get more web pages which are correlative with a certain topic and prepare data for users querying.
聚焦网络爬虫并不追求大的覆盖,而将目标定为抓取与某一特定主题内容相关的网页,为面向主题的用户查询准备数据资源。
In this paper, through the sort of the emergency event case website and improvement of the crawl algorithm, we get our emergency focused web crawler.
论文通过对突发事件案例网站进行分类,改进爬行器算法,实现面向突发事件案例的爬行器。
Traditional focused crawler is targeting web pages that are relevant to some specific topics. But some applications, such as web directory, are providing users with relevant websites.
传统的聚焦爬虫抓取的目标是与某一特定主题内容相关的网页,而在有些应用中,如网络目录,更多的是给用户提供主题相关网站。
Focused crawler is a subject-oriented information retrieval system. It can meet the users' need and retrieve information that is relevant to some specific subjects from the web automatically.
聚焦爬虫是一种面向主题的信息搜集系统,可以根据用户需要从互联网上自动搜集到主题相关信息,在主题搜索引擎、站点结构分析等方面取得越来越广泛的应用。
Focused crawler is a subject-oriented information retrieval system. It can meet the users' need and retrieve information that is relevant to some specific subjects from the web automatically.
聚焦爬虫是一种面向主题的信息搜集系统,可以根据用户需要从互联网上自动搜集到主题相关信息,在主题搜索引擎、站点结构分析等方面取得越来越广泛的应用。
应用推荐