python excel的相关操做

时间 2019-11-08

标签 python excel 相关栏目 Python 繁體版

原文原文链接

由于常常用到对excel的相关操做，今天就在此总结相关内容，方便你们参考。html

python操做excel除了读就是写。python

揭秘Book

经过open_workbook返回的xlrd.Book对象包含了全部对工做簿要的事情，能被用于在工做簿中取得独立的sheet。数组

这个nsheets属性是一个整数，包含工做簿sheet的数量。这个属性与sheet_by_index方法结合起来是获取独立sheet最经常使用的方法。ide

从读开始字体

sheet_names方法返回包含工做簿中全部sheet名字的unicode列表。单独的sheet能够经过sheet_by_name方法使用这些名字获取。编码

sheets方法的结果是迭代获取工做簿中的每一个sheet。spa

from xlrd import open_workbook

book = open_workbook('simple.xls')

print book.nsheets

for sheet_index in range(book.nsheets):
    print book.sheet_by_index(sheet_index)
    
print book.sheet_names()
for sheet_name in book.sheet_names():
    print book.sheet_by_name(sheet_name)
    
for sheet in book.sheets():
    print sheet

xlrd.Book对象有与工做簿内容相关的其它属性，但不多用到：excel

codepage
countries
user_name

若是你可能须要运用这些属性，请查看xlrd文档。code

经过上面介绍的方法返回的xlrd.sheet.Sheet对象包含了全部对worksheet和它的内容操做的信息。orm

name属性是worksheet名字的unicode表示。

nrows和ncols属性分别包含了worksheet中的行数和列数。

下面例子展现了如何使用迭代来显示一个worksheet的内容：

Unicode

由xlrd产生的全部文本属性不是unidecode对象，就是ascii字符串（不多）。

由Microsoft Excel输入的每一个文本都是下列编码之一：

Latin1,若是匹配
UTF_16_LE，若是不匹配Latin1
在更老的文件中，是按MS字符集规范编码的。他们由xlrd映射到Python编码，结果还是unicode对象。

其余知名软件用错误字符集或不用字符集写入Excel文件的状况是不多的。这种状况下，可能须要在open_workbook方法中指定正确的字符集。

from xlrd import open_workbook
book = open_workbook('dodgy.xls',encoding='cp1252')

xlrd

http://pypi.python.org/pypi/xlrd

导入
import xlrd

打开excel
file = xlrd.open_workbook('demo.xls')

查看文件中包含sheet的名称
file.sheet_names()

获得第一个工做表，或者经过索引顺序或工做表名称
sheet = file.sheets()[0]
sheet = file.sheet_by_index(0)
sheet = file.sheet_by_name(u'Sheet1')

获取行数和列数
nrows = sheet.nrows
ncols = sheet.ncols

循环行,获得索引的列表
for rownum in range(sheet.nrows):
print sheet.row_values(rownum)

获取整行和整列的值（数组）
sheet.row_values(i)
sheet.col_values(i)

单元格（索引获取）
cell_A1 = sheet.cell(0,0).value
cell_C4 = sheet.cell(2,3).value

分别使用行列索引
cell_A1 = sheet.row(0)[0].value
cell_A2 = sheet.col(1)[0].value

xlwt

http://pypi.python.org/pypi/xlrd

导入xlwt

import xlwt

新建一个excel文件

file = xlwt.Workbook() #注意这里的Workbook首字母是大写，无语吧

新建一个sheet

sheet = file.add_sheet('sheet name')

写入数据sheet.write(行,列,value)

sheet.write(0,0,'test')

若是对一个单元格重复操做，会引起
returns error:
# Exception: Attempt to overwrite cell:
# sheetname=u'sheet 1' rowx=0 colx=0

因此在打开时加cell_overwrite_ok=True解决

sheet = file.add_sheet('sheet name',cell_overwrite_ok=True)

保存文件

file.save('demo.xls')

另外，使用style

style = xlwt.XFStyle() #初始化样式

font = xlwt.Font() #为样式建立字体

font.name = 'Times New Roman'

font.bold = True

style.font = font #为样式设置字体

sheet.write(0, 0, 'some bold Times text', style) # 使用样式

xlwt 容许单元格或者整行地设置格式。还能够添加连接以及公式。能够阅读源代码，那里有例子：

dates.py, 展现如何设置不一样的数据格式

hyperlinks.py, 展现如何建立超连接 (hint: you need to use a formula)

merged.py, 展现如何合并格子

row_styles.py, 展现如何应用Style到整行格子中.

操做大的Excel文件

若是你在操做特别大的Excel文件，那么有两个你应该注意的xlrd特性：

open_workbook方法的on_demand参数为True，被访问时会致使只往内存里加载worksheet。
xlrd.Book对象有一个unload_sheet方法能经过指定sheet索引或sheet名称从内存中卸载worksheet。

下面的例子展现了一个大的workbook怎么去迭代被检查只匹配某一模式的sheet，并在内存中某个时间被卸载。

from xlrd import open_workbook

book = open_workbook('simple.xls',on_demand=True)

for name in book.sheet_names():
	if name.endswith('2'):
		sheet = book.sheet_by_name(name)
		print sheet.cell_value(0,0)
		book.unload_sheet(name)

参考http://blog.sina.com.cn/s/blog_63f0cfb20100o617.html