详细文章:html
http://www.cnblogs.com/yuanchenqi/articles/5956943.htmlpython
http://www.diveintopython3.net/strings.htmlide
|
上图仅适用于py2code
先说python2htm
再说python3blog
#-*-coding:utf-8-*- import sys print(sys.getdefaultencoding()) #ascii msg = "你好" msg_gbk = msg.decode("utf-8").encode("gbk") msg_gbk_to_utf8 = msg_gbk.decode("gbk").encode("utf-8") print(msg) print(msg_gbk) print(msg_gbk_to_utf8)
import sys print(sys.getdefaultencoding()) #utf-8 msg="你好" msg_gbk = msg.encode("gbk") #默认就是Unicode,不用再decode msg_gbk_to_unicode = msg_gbk.decode("gbk") msg_gbk_to_utf8 = msg_gbk.decode("gbk").encode("utf-8") print(msg) #你好 print(msg_gbk) #b'\xc4\xe3\xba\xc3' print(msg_gbk_to_unicode) #你好 print(msg_gbk_to_utf8) #b'\xe4\xbd\xa0\xe5\xa5\xbd'
编码应用比较多的场景应该是爬虫了,互联网上不少网站用的编码格式很杂,虽然总体趋向都变成utf-8,但如今仍是很杂,因此爬网页时就须要你进行各类编码的转换,不过生活正在变美好,期待一个不须要转码的世界。