数仓项目实战05:ADS数据应用层

5. ADS应用数据层

5.1 流量类指标_平台统计表【ADS】

建库:

create database ads_nshop;
use ads_nshop;

建表:

CREATE external TABLE
IF NOT EXISTS ads_nshop.ads_nshop_platform_flow_stat (
customer_gender TINYINT COMMENT '性别:1男 0女',
age_range string COMMENT '年龄段',
customer_natives string COMMENT '所在地区',
visit_avg_duration INT COMMENT '人均页面访问时长',
visit_avg_counts INT COMMENT '人均页面访问数'
) partitioned BY (bdp_day string) stored AS parquet location '/data/nshop/ads/operation/ads_nshop_platform_flow_stat/'

1.从ods_02_customer 表统计用户信息
2.从dwd_nshop_actlog_pdtview 用户浏览信息表查出用户浏览时长和浏览用户
3.从dws_nshop_ulog_view 用户浏览表查出访问次数和访问用户
4.用总的停留时长除以去重后的用户数量就是平均访问时长,用总访问次数除以去重后的用户数就是平均访问次数。
统计sql如下:

insert overwrite table ads_nshop.ads_nshop_platform_flow_stat partition(bdp_day='20200618')
select 
c.customer_gender,
c.customer_age_range,
c.customer_natives,
sum(pv.duration)/count(dis

你可能感兴趣的:(数仓)