找到全部 <img src=....>
图像的连接:html
xpath = './/img/@src'
img_urls = html.xpath(xpath)
from lxml import etree
etree 下的 HTML 对象,其构造函数接受 requests.request 的返回值对象:python
url = ...
user_agent = ...
headers = {'User-Agent' : user_agent}
req = requests.request(url=url, headers=headers)
html = etree.HTML(req.text)
xpath定位中starts-with、contains和text()的用法函数