问题:matlab脚本与函数文件的中文注释显示乱码.
环境:matlab R2016a.Windows 10 home.
解决方案:
step1
检查locale值
matlab命令行键入命令
feature(‘locale’) html
>> feature('locale') ans = 包含如下字段的 struct: ctype: 'zh_CN.UTF-8' collate: 'zh_CN.UTF-8' time: 'zh_CN.UTF-8' numeric: 'en_US_POSIX.UTF-8' monetary: 'zh_CN.UTF-8' messages: 'zh_CN.UTF-8' encoding: 'UTF-8' terminalEncoding: 'UTF-8' jvmEncoding: 'UTF-8' status: 'MathWorks locale management system initialized.' warning: ''
注意:
System locale设置
控制面板Region,Administrative选项卡,Language for non-Unicode设置为Chinese(Simplified,China).
step3
确认效果.
打开m文件,添加中文注释,保存并从新打开确认中文显示正常.
注意:
若是中文字符显示?是乱码问题,若是显示方框则是字体显示问题(好比使用西方字体显示中文字体).可在matlab预设->字体选择中文字体.matlab默认设置为Monospaced,若是系统没有支持等宽字体的中文字体,中文字符会显示方框.能够下载并安装宋体-18030,该字体支持等宽字体.
补充
若是想以utf-8编码,而不是GBK编码的话,请编辑matlab的locale数据库文件lcdata.xml(matlab bin目录下).
删除
< encoding name=”GBK”>
< encoding_alias name=”936”>
< /encoding>数据库
并将
< encoding name=”UTF-8”>
< encoding_alias name=”utf8”/>
< /encoding>
改成< encoding name=”UTF-8”>
< encoding_alias name=”utf8”/>
< encoding_alias name=”GBK”/>
< /encoding>
重启matlab以后,即以utf-8编码.
注意:
常见的编码格式
• ascII编码,别名ANSI,使用7个bit.
• utf-8,可变长度,兼容ascII,中文字符占16个bit
• utf-16,左右字符一概占16个bit
• GB2312,简体中文
• GB5,繁体中文
lcdata.xml修改locale tag不起做用,我猜想matlab硬编码了locale列表,而不是查询lcdata.xml.其内部的locale格式
locale — Format: language_territory.codeset
For example, for the U.S. English locale setting en_US.US-ASCII, en means that the display language is English. US indicates that time and date displays use U.S. conventions. US-ASCII is the coded character set (codeset) used to display text.jvm
From https://www.mathworks.com/help/matlab/matlab_env/how-the-matlab-process-uses-locale-settings.html
若是你的系统和用户locale值均为zh_CN,则matlab查询硬编码存储的locale列表定位到zh_CN.GBK,并使用GBK在lcdata.xml查询codeset.函数
参考文献
• ISO-3166 Country Codes
• ISO-639 Language Codes
• Internationalization(matlab帮助)
• How do I get my MATLAB editor to read UTF-8 characters? UTF-8 characters in blank squares in editors, but in the command window and workspace works fine.字体