本体 搜索这里指通过聚焦爬虫(Focused Web Crawler)在Web 上跟踪链接、搜寻并下载由语义Web 语言,例 如RDFS 和OWL,编写的本体文档及相关数据(如链接所在的Web 页面等)的过程。
基于8个网页-相关网页
chemistry focused web crawler 化学主题网络爬虫
The main goals of focused web crawler are to get more web pages which are correlative with a certain topic and prepare data for users querying.
聚焦网络爬虫并不追求大的覆盖,而将目标定为抓取与某一特定主题内容相关的网页,为面向主题的用户查询准备数据资源。
In this paper, through the sort of the emergency event case website and improvement of the crawl algorithm, we get our emergency focused web crawler.
论文通过对突发事件案例网站进行分类,改进爬行器算法,实现面向突发事件案例的爬行器。
Traditional focused crawler is targeting web pages that are relevant to some specific topics. But some applications, such as web directory, are providing users with relevant websites.
传统的聚焦爬虫抓取的目标是与某一特定主题内容相关的网页,而在有些应用中,如网络目录,更多的是给用户提供主题相关网站。
应用推荐