一、hive的分组和组内排序---语法spa
语法:
row_number() over (partition by 字段a order by 计算项b desc ) rankcode
- rank是排序的别名
- partition by:相似hive的建表,分区的意思;
- order by :排序,默认是升序,加desc降序;
- 这里按字段a分区,对计算项b进行降序排序
二、hive的分组和组内排序 --- 实例
要取top10品牌,各品牌的top10渠道,各品牌的top10渠道中各渠道的top10档期blog
一、取top10品牌排序
select “品牌” , sum/count/其余() as num from "table_name" order by num desc limit 10;ci
二、取top10品牌下各品牌的top10渠道it
select a.* from (select "品牌","渠道",sum/count() as num, row_number () over (partition by "品牌" order by num desc) rank from “table_name” where 品牌限制条件 group by “品牌”,“渠道” ) a having a.rank <= 10;io
三、 取top10品牌下各品牌的top10渠道中各渠道的top10档期table
select a.* from (select "品牌","渠道","档期",sum/count/其余() as num row_number() over (partition by "档期" order by num desc) rank from "table_name" where 品牌限制条件 group by “品牌”,“渠道) a Having a.rank <= 10;class
SELECT v.visitor_phone,v.city,v.bigarea, row_number()over (partition by visitor_phone order by visitor_phone desc) rank from visitor v WHERE 1=1 and visitor_name in('蒋凤','周金魁') group BY v.visitor_phone,v.city
---------------------------------
18222666666 北京 华北区 1
18222666666 天津 华北区 2
13402777777 北京 华北区 1
13402777777 成都 中西部 2
取编号为1的select
SELECT a.* from( SELECT v.visitor_phone,v.city,v.bigarea, row_number()over (partition by visitor_phone order by visitor_phone desc) rank from visitor v WHERE 1=1 and visitor_name in('蒋凤','周金魁') group BY v.visitor_phone,v.city )a where a.rank=1;-------------------------------------18222666666 北京 华北区 113402777777 北京 华北区 1