透彻分析和解决一切javaWeb项目乱码问题

时间 2020-08-08

标签透彻分析解决一切 javaweb 项目乱码问题栏目 Java 繁體版

原文原文链接

前言

乱码是咱们在程序开发中常常碰到且让人头疼的一件事，尤为是咱们在作javaweb开发，若是咱们没有清楚乱码产生的原理，碰到乱码问题了就容易摸不着头脑，无从下手。html

乱码主要出如今两部分，以下：java

第一，浏览器经过表单提交到后台，若是表单内容有中文，那么后台收到的数据可能会出现乱码。web

第二，后端服务器须要返回给浏览器数据，若是数据中带有中文，那么浏览器上可能会显示乱码。spring

接下来咱们逐一分析乱码产生的缘由，以及如何解决乱码问题。macos

1、后端收到浏览器提交的中文乱码

这里又分为get请求和post请求。后端

get请求

get请求，请求参数中带有中文，后台接收会出现乱码，缘由是tomcat默认编码是“ISO-8859-1”，因此tomcat会使用“ISO-8859-1”对中文进行编码，该编码不支持中文，因此后台接收到就乱码了。解决方式有两种。数组

param = new String(param.getBytes("ISO-8859-1"),"utf-8");

修改tomcat编码为"utf-8"，不建议使用这种方式。浏览器

post请求

post请求，出现乱码的缘由同get请求，解决方式比较简单，以下：tomcat

request.setCharacterEncoding("utf-8");

设置请求参数的编码格式为“utf-8”，这样就不会有问题了。服务器

2、后端返回中文给浏览器发生乱码

后端返回数据给浏览器，通常也有两种形式，一种是response.getOutputStream()，一种是response.getWriter()。

二者区别以及使用规则

getOutputStream()就是获得了OutputStream，用来向客户端（浏览器）输出任何数据，若是输出的是字符，会被转换成二进制输出，若是字符中出现中文，那么会出现“java.io.CharConversionException:Not an ISO 8859-1 character:”异常
getWriter()是对outputStream进行了包装,用来输出字符用的。

所以，调用requonse.getWriter()方法时可实现文本字符串数据输出，调用response.getOutputStream()方法可现实字节流数据的输出。因此，若是要输出图片等二进制数据时，须要使用response.getOutputStream。

注意，getOutputStream()和getWriter()不能同时使用，不然会抛出”getWriter() has already been called for this response“异常。

区别讲完了，下面咱们主要仍是经过实践分析下乱码产生的原理。

response.getOutputStream().print()

返回英文数据就不说了，没什么问题，看下返回中文是什么效果；

@RequestMapping("/helloworld.do")
public void helloworld(HttpServletRequest request, HttpServletResponse response) throws IOException {
    String str = "中国加油，武汉加油";
    response.getOutputStream().print(str);
}

结果以下：

分析：

OutPutStream是输出二进制数据的，因此须要对字符串改为二进制输出，Tomcat使用的是"ISO8859-1"编码对其进行转换，而中文对”ISO859-1“不支持，因此就抛异常了。

response.getOutputStream.write()

一样的，咱们再来看下输出中文会怎么样。

@RequestMapping("/helloworld.do")
public void helloworld(HttpServletRequest request, HttpServletResponse response) throws IOException {
    String str = "中国加油，武汉加油";
    response.getOutputStream().write(str.getBytes());
}

页面输出结果以下：

涓浗鍔犳补锛屾姹夊姞娌�

分析：

在java中,String的getBytes()方法是获得一个操做系统默认的编码格式的字节数组，我电脑的系统是macos，默认编码格式是utf-8，返回给浏览器是utf-8编码格式的字节数组，可是浏览器默认是"gbk"编码解析，因此就乱码了。

既然这样，那咱们换成“gb2312”编码（gb2312编码是gbk编码的一种）试试呢？

@RequestMapping("/helloworld.do")
public void helloworld(HttpServletRequest request, HttpServletResponse response) throws IOException {
    String str = "中国加油，武汉加油";
    response.getOutputStream().write(str.getBytes());
}

页面输出：

中国加油，武汉加油

原理咱们弄清楚了，可是在项目开发中，咱们须要编码统一，最经常使用的就是中文字符编码"UTF-8"，但是按照咱们的理解，若是咱们直接response.getOutputStream().write(str.getBytes("utf-8"));确定会乱码，咱们须要用某种方式，告诉浏览器，你要用我指定的“utf-8”编码接受我返回的中文。response.setContentType("text/html;charset=UTF-8")这样就完事了，看看效果吧。

@RequestMapping("/helloworld.do")
public void helloworld(HttpServletRequest request, HttpServletResponse response) throws IOException {
    String str = "中国加油，武汉加油";
    response.setContentType("text/html;charset=utf-8");
    response.getOutputStream().write(str.getBytes("utf-8"));
}

页面输出：

中国加油，武汉加油

response.getWriter()

前面已经总结过了，response.getWriter()跟response.getOutputStream()不同，outputStream是输出二进制的，writer是输出字符串的。response.getWriter()输出也有两种方法，一种是print()，一种是write()，其实二者在处理乱码这一块没有什么区别，就不分开讲述了。

示例：

@RequestMapping("/helloworld.do")
public void helloworld(HttpServletRequest request, HttpServletResponse response) throws IOException {
    String str = "中国加油，武汉加油";
    response.getWriter().print(str);
}

页面输出：

?????????

分析：

一样的，Tomcat默认的编码是ISO 8859-1，当咱们输出中文数据的时候，Tomcat会依据ISO 8859-1码表给咱们的数据编码，中文不支持这个码表呀，因此出现了乱码。

这个时候**response.setContentType("text/html;charset=UTF-8")**又派上用场了。

@RequestMapping("/helloworld.do")
public void helloworld(HttpServletRequest request, HttpServletResponse response) throws IOException {
    String str = "中国加油，武汉加油";
    response.setContentType("text/html;charset=utf-8");
    response.getWriter().print(str);
}

页面输出：

中国加油，武汉加油

在这里，response.setContentType("text/html;charset=UTF-8")作了两件事，response.setCharacterEncoding("UTF-8");和response.setHeader("Content-Type", "text/html;charset=UTF-8");具体就是，第一，输出中文”中国加油，武汉加油“的时候，对中文进行”utf-8“编码；第二，告诉浏览器，你也要用"utf-8"来显示我返回的中文。

最后

对于springMVC项目，如何解决乱码问题呢？项目中通常会在web.xml中配置编码过滤器。配置以下：

<filter>
    <filter-name>encodingFilter</filter-name>
    <filter-class>org.springframework.web.filter.CharacterEncodingFilter</filter-class>
    <init-param>
      <param-name>encoding</param-name>
      <param-value>UTF-8</param-value>
    </init-param>
    <init-param>
      <param-name>forceEncoding</param-name>
      <param-value>true</param-value>
    </init-param>
  </filter>
  <filter-mapping>
    <filter-name>encodingFilter</filter-name>
    <url-pattern>/*</url-pattern>
  </filter-mapping>

这样能保证请求的参数按照指定的编码格式进行编码，简单翻看下过滤器源码以下：

@Override
	protected void doFilterInternal(
			HttpServletRequest request, HttpServletResponse response, FilterChain filterChain)
			throws ServletException, IOException {

		if (this.encoding != null && (this.forceEncoding || request.getCharacterEncoding() == null)) {
			request.setCharacterEncoding(this.encoding);
			if (this.forceEncoding) {
				response.setCharacterEncoding(this.encoding);
			}
		}
		filterChain.doFilter(request, response);
	}

代码中有两处重要的地方值得注意，分别是request.setCharacterEncoding(this.encoding);和response.setCharacterEncoding(this.encoding);前者表示咱们对请求过来的参数使用指定的"utf-8"进行编码，后者即是，返回给浏览器时，后端返回字符的编码是“utf-8”。

好了，通过以上分析是否是乱码也没有那么可怕了。只要明白其中的原因，解决起来就是一行代码或者几行配置的事儿了，若是你们以为有帮助，不妨点赞支持一下？

若是你们以为我写的不错、清晰易懂的话，能够关注个人公众号“灰太狼学爪哇”，不按期分享原创技术文章，与君共勉。

原文出处：https://www.cnblogs.com/xiaoming0601/p/12304418.html