WebCollector 2.x官网和镜像:java
官网:https://github.com/CrawlScript/WebCollectorgit
镜像:http://git.oschina.net/webcollector/WebCollectorgithub
WebCollector 2.x教程:web
WebCollector 2.x tutorial 2 (BreadthCrawler中文教程)ajax
WebCollector 2.x 新闻网页正文自动提取算法算法
WebCollector 2.x 抽取器 (Extractor和MultiExtractorCrawler)cookie
WebCollector爬取JS生成数据spa
WebCollector爬取搜狗搜索(分页).net
WebCollector爬取JSON数据orm
使用SoupLang脚本同时管理多个页面爬取 SoupLang脚本
用WebCollector 2.x爬取新浪微博(无需手动获取cookie)
WebCollector 2.x教程(镜像):
WebCollector 2.x tutorial 2 (BreadthCrawler中文教程)
WebCollector 2.x 新闻网页正文自动提取算法
WebCollector 2.x 抽取器 (Extractor和MultiExtractorCrawler)
WebCollector爬取JS生成数据
WebCollector爬取搜狗搜索(分页)
WebCollector爬取JSON数据