httplib2---python下的http请求终结者(转)

时间 2019-11-08

标签 httplib2 httplib python http 请求终结者栏目 Python 繁體版

原文原文链接

httplib2---python下的http请求终结者(转)html

httplib2功能介绍：http://code.google.com/p/httplib2/python

httplib2实例页面：http://code.google.com/p/httplib2/w/listweb

httplib2问题提交：http://code.google.com/p/httplib2/issues/list缓存

好吧，我以为官方的样例仍是比较全的，这里就直接贴一下吧。服务器

Simple Retrievalcookie

import httplib2 app

h = httplib2.Http(".cache") dom

resp, content = h.request("http://example.org/", "GET") socket

Authenticationgoogle

import httplib2

h = httplib2.Http(".cache")

h.add_credentials('name', 'password')

resp, content = h.request("https://example.org/chap/2", ##ssl + base认证

"PUT", body="This is text",

headers={'content-type':'text/plain'} )

Cache-Control

import httplib2

h = httplib2.Http(".cache")

resp, content = h.request("http://bitworking.org/") #请求被缓存，下次还会用这个缓存而不去发送新的请求，缓存生效时间有web配置决定

...

resp, content = h.request("http://bitworking.org/",

headers={'cache-control':'no-cache'}) ##设置不用缓存，当次将不用缓存，而是直接发一个新的请求

Forms

>>> from httplib2 import Http

>>> from urllib import urlencode

>>> h = Http()

>>> data = dict(name="Joe", comment="A test comment")

>>> resp, content = h.request("http://bitworking.org/news/223/Meet-Ares", "POST", urlencode(data))

>>> resp

{'status': '200', 'transfer-encoding': 'chunked', 'vary': 'Accept-Encoding,User-Agent',

'server': 'Apache', 'connection': 'close', 'date': 'Tue, 31 Jul 2007 15:29:52 GMT',

'content-type': 'text/html'}

Cookies

#!/usr/bin/env python

import urllib

import httplib2

http = httplib2.Http()

url = 'http://www.example.com/login'

body = {'USERNAME': 'foo', 'PASSWORD': 'bar'}

headers = {'Content-type': 'application/x-www-form-urlencoded'}

response, content = http.request(url, 'POST', headers=headers, body=urllib.urlencode(body))

headers = {'Cookie': response['set-cookie']} ###将得到cookie设置到请求头中，以备下次请求使用

url = 'http://www.example.com/home'

response, content = http.request(url, 'GET', headers=headers) ##本次请求就不用带用户名，密码了

Proxies

import httplib2

import socks ##须要第三方模块

httplib2.debuglevel=4

h = httplib2.Http(proxy_info = httplib2.ProxyInfo(socks.PROXY_TYPE_HTTP, 'localhost', 8000))

r,c = h.request("http://bitworking.org/news/")

======================================================================================

下面是我本身对模块功能的尝试：

Http对象的构造方法：

__init__(self, cache=None, timeout=None, proxy_info=None, ca_certs=None, disable_ssl_certificate_validation=False)

proxy_info 的值是一个 ProxyInfo instance.

| 'cache'：

存放cache的位置，要么为字符串，要么为支持文件缓存接口的对象

| timeout：

超时时间，默认时会取python对socket连接超时的值

| ca_certs：

一个用于ssl服务器认证用的包涵了主CA认证的文件路径，默认会使用httplib2绑定的证书

| disable_ssl_certificate_validation：

肯定是否进行ssl认证

| add_certificate(self, key, cert, domain)

| 添加一个ssl认证key和文件

| add_credentials(self, name, password, domain='')

| 添加一个用户名，密码信息

| clear_credentials(self)

| 删除掉全部的用户名，密码信息，貌似仍是能够存多个用户名和密码

Http.request(self, uri, method='GET', body=None, headers=None, redirections=5, connection_type=None)

说明：

执行单次的http请求

uri：

一个以'http' 或 'https'开头的资源定位符字串，必须是一个绝对的地址

method：

支持全部的http请求方式。如： GET, POST, DELETE, etc..

body：

请求的附件数据，一个通过urllib.urlencode编码的字符串

headers：

请求头信息，一个字典对象

redirections：

最大的自动连续的重定向次数默认为5

(response, content)元组，response是一个httplib2.Response对象，content就是包含网页源码的字符串

httplib2.Response对象

其实就是一个包含全部头信息的字典，由于它自己就是集成自字典对象的