cheerio 服务器端的jquery

cheerio

https://cheerio.js.org/html

Fast, flexible, and lean implementation of core jQuery designed specifically for the server.

 

Features

❤ Familiar syntax: Cheerio implements a subset of core jQuery. Cheerio removes all the DOM inconsistencies and browser cruft from the jQuery library, revealing its truly gorgeous API.前端

ϟ Blazingly fast: Cheerio works with a very simple, consistent DOM model. As a result parsing, manipulating, and rendering are incredibly efficient.git

❁ Incredibly flexible: Cheerio wraps around @FB55’s forgiving htmlparser2. Cheerio can parse nearly any HTML or XML document.github

例子



const cheerio = require('cheerio'); const $ = cheerio.load('<ul id="fruits">...</ul>');
$('.apple', '#fruits').text() //=> Apple $('ul .pear').attr('class') //=> pear $('li[class=orange]').html() //=> Orange 

 

用处

能够用做服务器端的网页爬虫, 解析前端的静态页面。web

由于通常页面的主页都使用静态页面, 来提升SEO, 和首屏的页面相应度。使用这种工具正好能够应对, 静态页面解析, 提取有用数据。网页爬虫

 

还能够处理静态页面, 给静态页面添加一些页面元素,或者脚本, 在代理的角色中,对往来网页作修改。服务器

 

其它

Cheerio is not a web browser

Cheerio parses markup and provides an API for traversing/manipulating the resulting data structure. It does not interpret the result as a web browser does. Specifically, it does not produce a visual rendering, apply CSS, load external resources, or execute JavaScript. If your use case requires any of this functionality, you should consider projects like PhantomJS or JSDom.app

相关文章
相关标签/搜索