##[PDF]Atlas和Ranger进行数据治理

【PDF大放送】Spark&Hadoop Summit精选分享PDF合集-博客-云栖社区-阿里云 https://yq.aliyun.com/articles/72207?spm=5176.100239.blogcont71098.13.Kt7Srt

【Hadoop Summit Tokyo 2016】

//
Apache Atlas: Introduction
Metadata Repository
• Flexible type system to capture schema/metadata of multiple components
• Out-of-box models for Hive, HDFS, Storm, Falcon, Sqoop
Data Lineage/Provenance
• Captures data lineage across components
Classification
• Use tags to classify the data – like PII, PHI, PCI, EXPIRES_ON
• Support for attributes in tags – like expiry_date
Search
• Search using classifications, attributes
• Advanced search using DSL; convenient full-text search
Integrations
• With Apache Hive, Apache Storm, Apache Falcon, Apache Sqoop for metadata and lineage
• With Apache Ranger for classification based security
APIs to add support for more components

//
Apache Atlas: Lineage


##[PDF]Atlas和Ranger进行数据治理_第1张图片
Paste_Image.png

//


##[PDF]Atlas和Ranger进行数据治理_第2张图片
Paste_Image.png

【Hadoop Summit Tokyo 2016】企业数据分类和治理

##[PDF]Atlas和Ranger进行数据治理_第3张图片
Paste_Image.png
##[PDF]Atlas和Ranger进行数据治理_第4张图片
Paste_Image.png
##[PDF]Atlas和Ranger进行数据治理_第5张图片
Paste_Image.png
##[PDF]Atlas和Ranger进行数据治理_第6张图片
Paste_Image.png
##[PDF]Atlas和Ranger进行数据治理_第7张图片
Paste_Image.png
##[PDF]Atlas和Ranger进行数据治理_第8张图片
Paste_Image.png

你可能感兴趣的:(##[PDF]Atlas和Ranger进行数据治理)