hive函数集合

1:CAST (expression AS data_type)

CAST函数用于将某种数据类型的表达式显式转换为另一种数据类型。CAST()函数的参数是一个表达式,它包括用AS关键字分隔的源值和目标数据类型。可以转换的类型是有限制的。

这个类型可以是以下值其中的一个:二进制,同带binary前缀的效果 : BINARY    

字符型,可带参数 : CHAR()     

日期 : DATE     

时间: TIME     

日期时间型 : DATETIME     

浮点数 : DECIMAL      

整数 : SIGNED     

无符号整数 : UNSIGNED


2:binary

Mysql默认查询是不分大小写的,可以在SQL语句中加入 binary来区分大小写;BINARY不是函数,是类型转换运算符,它用来强制它后面的字符串为一个二进制字符串,可以理解为在字符串比较的时候区分大小写

例如 :select * from sys_user where binary loginno = `qqq`


3:explode

explode就是将sql或者hive一行中复杂的array或者map结构拆分成多行。lateral view(侧视图)的意义是配合explode(或者其他的UDTF),一个语句生成把单行数据拆解成多行后的数据结果集。形成的虚拟表tb1 相当于再和原表联结 形成了新表interests1直接使用explode 一次只能生成一个字段 要生成多列就是使用lateral view(侧视图)

drop table explode_lateral_view;

create table explode_lateral_view

(`area` string,

`goods_id` string,

`sale_info` string)

ROW FORMAT DELIMITED

FIELDS TERMINATED BY '|'

STORED AS textfile;

导入数据:

INSERT INTO management_cc.explode_lateral_view VALUES('a:shandong,b:beijing,c:hebei','1,2,3,4,5,6,7,8,9','{"source":"7fresh","monthSales":4900,"userCount":1900,"score":"9.9"},{"source":"jd","monthSales":2090,"userCount":78981,"score":"9.8"},{"source":"jdmart","monthSales":6987,"userCount":1600,"score":"9.0"}]');

表内数据如下

select explode(split(goods_id,',')) as goods_id from explode_lateral_view;

select explode(split(area,',')) as area from explode_lateral_view;

select explode(split(regexp_replace(regexp_replace(sale_info,'\\[\\{',''),'}]',''),'},\\{')) as  sale_info from explode_lateral_view;

然后我们想用get_json_object来获取key为monthSales的数据:

select get_json_object(explode(split(regexp_replace(regexp_replace(sale_info,'\\[\\{',''),'}]',''),'},\\{')),'$.monthSales') as  sale_info from explode_lateral_view;

然后挂了FAILED: SemanticException [Error 10081]: UDTF's are not supported outside the SELECT clause, nor nested in expressions

UDTF explode不能写在别的函数内

如果你这么写,想查两个字段,select explode(split(area,',')) as area,good_id from explode_lateral_view;

会报错FAILED: SemanticException 1:40 Only a single expression in the SELECT clause is supported with UDTF's. Error encountered near token 'good_id'

使用UDTF的时候,只支持一个字段,这时候就需要LATERAL VIEW出场了

select goods_id2,sale_info from explode_lateral_view LATERAL VIEW explode(split(goods_id,','))goods as goods_id2;

select goods_id2,sale_info,area2

from explode_lateral_view

LATERAL VIEW explode(split(goods_id,','))goods as goods_id2

LATERAL VIEW explode(split(area,','))area as area2;



select

get_json_object(concat('{',sale_info_r,'}'),'$.source') as source,

get_json_object(concat('{',sale_info_r,'}'),'$.monthSales') as monthSales,

get_json_object(concat('{',sale_info_r,'}'),'$.userCount') as userCount,

get_json_object(concat('{',sale_info_r,'}'),'$.score') as score

from explode_lateral_view

LATERAL VIEW explode(split(

regexp_replace(

regexp_replace(sale_info,'\\{',''),

'}]',

''

),'},'))sale_info as sale_info_r;


你可能感兴趣的:(hive函数集合)