python模仿scrapy框架实现一个通用爬虫

时间 2021-01-16

原文原文链接

不适用框架，写一个功能相对较全面的爬虫，包含监控网站更新、代理ip、限流器、提取链接、重试下载等基本功能代码如下：主函数： def run(self): while not self.crawler_queue.empty(): url_str = self.crawler_queue.get() print("url_str is ::::::{}".