HttpClient链接池抛出大量ConnectionPoolTimeoutException

时间 2019-11-13

标签 httpclient 链接抛出大量 connectionpooltimeoutexception 栏目系统网络繁體版

原文原文链接

今天解决了一个HttpClient的异常，汗啊，一个HttpClient使用稍有不慎都会是毁灭级别的啊。java

这里有以前由于route配置不当致使服务器异常的一个处理：http://blog.csdn.net/shootyou/article/details/6415248linux

里面的HttpConnectionManager实现就是我在这里使用的实现。apache

问题表现：tomcat

tomcat后台日志发现大量异常服务器

[plain] view plain copyapp

print ?tcp

org.apache.http.conn.ConnectionPoolTimeoutException: Timeout waiting for connection

时间一长tomcat就没法继续处理其余请求，从假死变成真死了。url

linux运行：spa

[plain] view plain copy.net

print ?

netstat -n | awk '/^tcp/ {++S[$NF]} END {for(a in S) print a, S[a]}'

发现CLOSE_WAIT的数量始终在400以上，一直没降过。

问题分析：

一开始我对个人HttpClient使用过程深信不疑，我不认为异常是来自这里。

因此我开始从TCP的链接状态入手，猜想可能致使异常的缘由。之前常常遇到TIME_WAIT数过大致使的服务器异常，很容易解决，修改下sysctl就ok了。可是此次是CLOSE_WAIT，是彻底不一样的概念了。

关于TIME_WAIT和CLOSE_WAIT的区别和异常处理我会单独起一篇文章详细说说个人理解。

简单来讲CLOSE_WAIT数目过大是因为被动关闭链接处理不当致使的。

我说一个场景，服务器A会去请求服务器B上面的apache获取文件资源，正常状况下，若是请求成功，那么在抓取完资源后服务器A会主动发出关闭链接的请求，这个时候就是主动关闭链接，链接状态咱们能够看到是TIME_WAIT。若是一旦发生异常呢？假设请求的资源服务器B上并不存在，那么这个时候就会由服务器B发出关闭链接的请求，服务器A就是被动的关闭了链接，若是服务器A被动关闭链接以后本身并无释放链接，那就会形成CLOSE_WAIT的状态了。

因此很明显，问题仍是处在程序里头。

先看看个人HttpConnectionManager实现：

[java] view plain copy

print ?

public class HttpConnectionManager {
private static HttpParams httpParams;
private static ClientConnectionManager connectionManager;
/**
* 最大链接数
*/
public final static int MAX_TOTAL_CONNECTIONS = 800;
/**
* 获取链接的最大等待时间
*/
public final static int WAIT_TIMEOUT = 60000;
/**
* 每一个路由最大链接数
*/
public final static int MAX_ROUTE_CONNECTIONS = 400;
/**
* 链接超时时间
*/
public final static int CONNECT_TIMEOUT = 10000;
/**
* 读取超时时间
*/
public final static int READ_TIMEOUT = 10000;
static {
httpParams = new BasicHttpParams();
// 设置最大链接数
ConnManagerParams.setMaxTotalConnections(httpParams, MAX_TOTAL_CONNECTIONS);
// 设置获取链接的最大等待时间
ConnManagerParams.setTimeout(httpParams, WAIT_TIMEOUT);
// 设置每一个路由最大链接数
ConnPerRouteBean connPerRoute = new ConnPerRouteBean(MAX_ROUTE_CONNECTIONS);
ConnManagerParams.setMaxConnectionsPerRoute(httpParams,connPerRoute);
// 设置链接超时时间
HttpConnectionParams.setConnectionTimeout(httpParams, CONNECT_TIMEOUT);
// 设置读取超时时间
HttpConnectionParams.setSoTimeout(httpParams, READ_TIMEOUT);
SchemeRegistry registry = new SchemeRegistry();
registry.register(new Scheme("http", PlainSocketFactory.getSocketFactory(), 80));
registry.register(new Scheme("https", SSLSocketFactory.getSocketFactory(), 443));
connectionManager = new ThreadSafeClientConnManager(httpParams, registry);
}
public static HttpClient getHttpClient() {
return new DefaultHttpClient(connectionManager, httpParams);
}
}

看到没MAX_ROUTE_CONNECTIONS 正好是400，跟CLOSE_WAIT很是接近啊，难道是巧合？继续往下看。

而后看看调用它的代码是什么样的：

[java] view plain copy

print ?

public static String readNet (String urlPath)
{
StringBuffer sb = new StringBuffer ();
HttpClient client = null;
InputStream in = null;
InputStreamReader isr = null;
try
{
client = HttpConnectionManager.getHttpClient();
HttpGet get = new HttpGet();
get.setURI(new URI(urlPath));
HttpResponse response = client.execute(get);
if (response.getStatusLine ().getStatusCode () != 200) {
return null;
}
HttpEntity entity =response.getEntity();
if( entity != null ){
in = entity.getContent();
.....
}
return sb.toString ();
}
catch (Exception e)
{
e.printStackTrace ();
return null;
}
finally
{
if (isr != null){
try
{
isr.close ();
}
catch (IOException e)
{
e.printStackTrace ();
}
}
if (in != null){
try
{
<span style="color:#ff0000;">in.close ();</span>
}
catch (IOException e)
{
e.printStackTrace ();
}
}
}
}

很简单，就是个远程读取中文页面的方法。值得注意的是这一段代码是后来某某同窗加上去的，看上去没啥问题，是用于非200状态的异常处理：

[java] view plain copy

print ?

if (response.getStatusLine ().getStatusCode () != 200) {
return null;
}

代码自己没有问题，可是问题是放错了位置。若是这么写的话就没问题：

[java] view plain copy

print ?

client = HttpConnectionManager.getHttpClient();
HttpGet get = new HttpGet();
get.setURI(new URI(urlPath));
HttpResponse response = client.execute(get);
HttpEntity entity =response.getEntity();
if( entity != null ){
in = entity.getContent();
..........
}
if (response.getStatusLine ().getStatusCode () != 200) {
return null;
}
return sb.toString ();

看出毛病了吧。在这篇入门（HttpClient4.X 升级入门 + http链接池使用）里头我提到了HttpClient4使用咱们经常使用的InputStream.close()来确认链接关闭，前面那种写法InputStream in 根本就不会被赋值，意味着一旦出现非200的链接，这个链接将永远僵死在链接池里头，太恐怖了。。。因此咱们看到CLOST_WAIT数目为400，由于对一个路由的链接已经彻底被僵死链接占满了。。。

其实上面那段代码还有一个没处理好的地方，异常处理不够严谨，因此最后我把代码改为了这样：

[java] view plain copy

print ?

public static String readNet (String urlPath)
{
StringBuffer sb = new StringBuffer ();
HttpClient client = null;
InputStream in = null;
InputStreamReader isr = null;
HttpGet get = new HttpGet();
try
{
client = HttpConnectionManager.getHttpClient();
get.setURI(new URI(urlPath));
HttpResponse response = client.execute(get);
if (response.getStatusLine ().getStatusCode () != 200) {
get.abort();
return null;
}
HttpEntity entity =response.getEntity();
if( entity != null ){
in = entity.getContent();
......
}
return sb.toString ();
}
catch (Exception e)
{
get.abort();
e.printStackTrace ();
return null;
}
finally
{
if (isr != null){
try
{
isr.close ();
}
catch (IOException e)
{
e.printStackTrace ();
}
}
if (in != null){
try
{
in.close ();
}
catch (IOException e)
{
e.printStackTrace ();
}
}
}
}

显示调用HttpGet的abort，这样就会直接停止此次链接，咱们在遇到异常的时候应该显示调用，由于谁能保证异常是在InputStream in赋值以后才抛出的呢。

好了，分析完毕，明天准备总结下CLOSE_WAIT和TIME_WAIT的区别。