HIVE解析JSON数组

HIVE解析JSON数组

数据示例:
[{“payAmount”:“375000”,“payChannelCode”:“BOC”},{“payAmount”:“376000”,“payChannelCode”:“AOC”}]

1.get_json_object函数提取json数组里面特定字段值

1.1 get_json_object可以提取json数组指标位置的值(跟数组一样)

select
get_json_object('[{"payAmount":"375000","payChannelCode":"BOC"},{"payAmount":"376000","payChannelCode":"AOC"}]','$[0]')

结果
{“payAmount”:“375000”,“payChannelCode”:“BOC”}

1.2 get_json_object数组可以用*来表示所有值(提取的结果是一个数组形式的字符串)

select
get_json_object('[{"payAmount":"375000","payChannelCode":"BOC"},{"payAmount":"376000","payChannelCode":"AOC"}]','$[*].payChannelCode')

结果
[“BOC”,“AOC”]

1.3 用字符串替换函数剔除掉"[]等无用的字符(注意转译符)

select
regexp_replace(
 get_json_object('[{"payAmount":"375000","payChannelCode":"BOC"},{"payAmount":"376000","payChannelCode":"AOC"}]','$[*].payChannelCode'),
 '\\[|\\]|\"','')

结果
BOC,AOC

2.使用regexp_replace替换掉[]同时将数组之间的分割符’,‘改为’;’
2.1 剔换掉[]

select
regexp_replace('[{"payAmount":"375000","payChannelCode":"BOC"},{"payAmount":"376000"}]','\\[|\\]','')

结果
{“payAmount”:“375000”,“payChannelCode”:“BOC”},{“payAmount”:“376000”}
2.2 将数组},{替换成};{这样就可以用split函数切割成数组(替换的字符不能在数组值内出现)

select
regexp_replace(regexp_replace('[{"payAmount":"375000","payChannelCode":"BOC"},{"payAmount":"376000"}]','\\[|\\]',''),'\\}\,\\{','\\}\;\\{')

结果
{“payAmount”:“375000”,“payChannelCode”:“BOC”};{“payAmount”:“376000”}

3.使用regexp_replace替换掉[{和}]和",然后用分割符为},{来切分字符串
3.1 替换掉[{和}]和"

select
 regexp_replace('[{"payAmount":"375000","payChannelCode":"BOC"},{"payAmount":"376000"}]','\\[\\{|\\}\\]|\"','')

结果
payAmount:375000,payChannelCode:BOC},{payAmount:376000

3.2 用分割符为},{来切分字符串

select
split(
 regexp_replace('[{"payAmount":"375000","payChannelCode":"BOC"},{"payAmount":"376000"}]','\\[\\{|\\}\\]|\"',''),
 '\\}\,\\{')

结果
[“payAmount:375000,payChannelCode:BOC”,“payAmount:376000”]
不过列转行后要用str_to_map将数组值转化为一个map,这样方便提取数组内的值

你可能感兴趣的:(hive,json,hive,字符串)