Hive经典案例:求出每一个用户到当月为止的最大当月访问次数和累积到当月的总访问次数

1、数据准备ui

有以下数据:用户名,月份,访问次数spa

A,2015-01,5
A,2015-01,15
B,2015-01,5
A,2015-01,8
B,2015-01,25
A,2015-01,5
A,2015-02,4
A,2015-02,6
B,2015-02,10
B,2015-02,5
A,2015-03,16
A,2015-03,22
B,2015-03,23
B,2015-03,10
B,2015-03,1code

 

2、需求分析orm

一、建立表blog

create table requirement(
    name string,
    month string,
    num int
)
row format delimited fields terminated by ',';

二、上传数据rem

load data local inpath '/usr/mydir/data/requirement.txt' into table requirement;

三、验证数据是否正确string

select * from requirement;

四、查询数据it

SELECT 
    name,
    month,
    month_sum,
    MAX(month_sum) over(partition by name order by month) max,
    SUM(month_sum) over(partition by name order by month) sum
from(
    SELECT name,month,SUM(num) as month_sum from requirement group by name,month
    ) t1;

 

3、结果io

相关文章
相关标签/搜索