HiveSQL或SparkSQl中group by与grouping sets、with cube和with rollup用法演示

GROUPING SETS: 根据不同的维度组合进行聚合,等价于将不同维度的GROUP BY结果集进行UNION ALL
数据准备:

建表语句:
create table tmp.gb(
a string,
b string,
c int
)row format delimited fields terminated by '\t' stored as textfile;
案例数据:
1 1 1
2 1 2
2 2 2
2 2 3
2 1 2
1 2 2

使用案例:

第一种组合:
select a,b,sum(c) from gb group by a,b grouping sets(a);
1 NULL 3
2 NULL 9
第二种组合:
select a,b,sum(c) from gb group by a,b grouping sets(b);
NULL 1 5
NULL 2 7
第三种组合:
select a,b,sum(c) from gb group by a,b grouping sets(a,b);
NULL 1 5
NULL 2 7
1 NULL 3
2 NULL 9
第四种组合:
select a,b,sum(c) from gb group by a,b grouping sets((a,b));
1 1 1
1 2 2
2 1 4
2 2 5
第五种组合:
select a,b,sum(c) from gb group by a,b

你可能感兴趣的:(spark,hive,sql,hive,spark)