常见的UA值,使用的时候能够直接复制粘贴,也能够用浏览器访问的时候抓包css
1.Android Mozilla/5.0 (Linux; Android 4.1.1; Nexus 7 Build/JRO03D) AppleWebKit/535.19 (KHTML, like Gecko) Chrome/18.0.1025.166 Safari/535.19 Mozilla/5.0 (Linux; U; Android 4.0.4; en-gb; GT-I9300 Build/IMM76D) AppleWebKit/534.30 (KHTML, like Gecko) Version/4.0 Mobile Safari/534.30 Mozilla/5.0 (Linux; U; Android 2.2; en-gb; GT-P1000 Build/FROYO) AppleWebKit/533.1 (KHTML, like Gecko) Version/4.0 Mobile Safari/533.1 2.Firefox Mozilla/5.0 (Windows NT 6.2; WOW64; rv:21.0) Gecko/20100101 Firefox/21.0 Mozilla/5.0 (Android; Mobile; rv:14.0) Gecko/14.0 Firefox/14.0 3.Google Chrome Mozilla/5.0 (Windows NT 6.2; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/27.0.1453.94 Safari/537.36 Mozilla/5.0 (Linux; Android 4.0.4; Galaxy Nexus Build/IMM76B) AppleWebKit/535.19 (KHTML, like Gecko) Chrome/18.0.1025.133 Mobile Safari/535.19 4.iOS Mozilla/5.0 (iPad; CPU OS 5_0 like Mac OS X) AppleWebKit/534.46 (KHTML, like Gecko) Version/5.1 Mobile/9A334 Safari/7534.48.3 Mozilla/5.0 (iPod; U; CPU like Mac OS X; en) AppleWebKit/420.1 (KHTML, like Gecko) Version/3.0 Mobile/3A101a Safari/419.3
用来处理复杂请求html
# 生成 cookie的管理器 cookie_handler = request.HTTPCookieProcessor(cookie) # 建立http请求管理器 http_handler = request.HTTPHandler() # 生成https管理器 https_handler = request.HTTPSHandler()
proxies = { "http":"address of proxy", "https": "address of proxy" } rsp = requests.request("get", "http:xxxxxx", proxies=proxies)
代理验证前端
#可能须要使用HTTP basic Auth, 能够这样 # 格式为 用户名:密码@代理地址:端口地址 proxy = { "http": "china:123456@192.168.1.123:4444"} rsp = requests.get("http://baidu.com", proxies=proxy)
若是遇到web客户端验证,须要添加auth=(用户名,密码)python
autu=("test1", "123456")#受权信息 rsp = requests.get("http://www.baidu.com", auth=auth)
requests能够自动处理cookie信息git
rsp = requests.get("http://xxxxxxxxxxx") # 若是对方服务器给传送过来cookie信息,则能够经过反馈的cookie属性获得 # 返回一个cookiejar实例 cookiejar = rsp.cookies #能够讲cookiejar转换成字典 cookiedict = requests.utils.dict_from_cookiejar(cookiejar)
能让咱们跨请求时保持某些参数,好比在同一个session实例发出的 全部请求之间保持cookiegithub
# 建立session对象,能够保持cookie值 ss = requests.session() headers = {"User-Agetn":"xxxxxxxxxxxxxxxxxx"} data = {"name":"xxxxxxxxxxx"} # 此时,由建立的session管理请求,负责发出请求, ss.post("http://www.baidu.com", data=data, headers=headers) rsp = ss.get("xxxxxxxxxxxx")
若是不须要验证ssl证书,则设置成False表示关闭web
rsp = requests.get("https://www.baidu.com", verify=False) # 若是用verify=True访问12306,会报错,由于他证书有问题