以图灵社区每周特价页面为例: https://www.ituring.com.cn/tag/36527
html
这个页面每周一都会更新三个半价电子书,先分析页面xpath,找出规律.web
复制后粘贴,能够看到三本书的xpath分别是:url
//*[@id="tag-book"]/div/ul/li[1]/div[2]/h4/a
spa
//*[@id="tag-book"]/div/ul/li[2]/div[2]/h4/a
code
//*[@id="tag-book"]/div/ul/li[3]/div[2]/h4/a
htm
只有中间的li[]
内容不同.blog
所以options部分,xpath部分能够用li[*]
匹配全部状况.能够这样写:it
{ "expected_update_period_in_days": "2", "url": "https://www.ituring.com.cn/tag/36527", "type": "html", "mode": "on_change", "extract": { "url": { "xpath": "//*[@id=\"tag-book\"]/div/ul/li[*]/div[2]/h4/a", "value": "@href" }, "title": { "xpath": "//*[@id=\"tag-book\"]/div/ul/li[*]/div[2]/h4/a", "value": "@title" } } }
使用slack agent推送.
options部分:io
{ "webhook_url": "https://hooks.slack.com/services/xxxx/xxxxxxxx", "channel": "#book", "username": "Huginn", "message": "{{title}} https://www.ituring.com.cn{{url}}", "icon": "" }