pandas练习（三）------ 数据分组

时间 2019-12-11

标签 pandas 练习数据分组繁體版

原文原文链接

探索酒类消费数据

相关数据见（github）html

步骤1 - 导入pandas库

import pandas as pd

步骤2 - 数据集

path3 = "./data/drinks.csv"      # drinks.csv

步骤3 将数据框命名为drinks

drinks = pd.read_csv(path3) drinks.head()

输出：python

步骤4 哪一个大陆(continent)平均消耗的啤酒(beer)更多？

beeravg = drinks.groupby('continent').beer_servings.mean() beeravg.sort_values(ascending=False)

输出：git

步骤5 打印出每一个大陆(continent)的红酒消耗(wine_servings)的描述性统计值

drinks.groupby('continent').wine_servings.describe()

输出：github

步骤6 打印出每一个大陆每种酒类别的消耗平均值

drinks.groupby('continent').mean()

输出：ui

步骤7 打印出每一个大陆每种酒类别的消耗中位数

drinks.groupby('continent').median()

输出：spa

步骤8 打印出每一个大陆对spirit饮品消耗的平均值，最大值和最小值

drinks.groupby('continent').spirit_servings.agg(['mean', 'min', 'max'])

输出：3d

参考连接：

一、http://pandas.pydata.org/pandas-docs/stable/cookbook.html#cookbookcode

二、https://www.analyticsvidhya.com/blog/2016/01/12-pandas-techniques-python-data-manipulation/htm

三、https://github.com/guipsamora/pandas_exercisesblog

相关文章

相关标签/搜索

数据分层、数据分组、数据集

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。

最新文章

本站公众号

欢迎关注本站公众号,获取更多信息

相关文章

>>更多相关文章<<