As the volume and variety of data increases, the challenges of good data governance are likely to become more difficult. Digital transformation technologies have resulted in new data sources. How do users know what data is available? How do administrators manage data when they might not know what type of data exists and where it’s stored? Does the data contain sensitive or personal information?
Writing metadata descriptions for data sources is often a wasted effort. Client applications typically ignore descriptions that are stored in the data source. Creating documentation for data sources is difficult because you must keep documentation in sync with data sources. Users also might not trust documentation that they think is out of date.
Without the ability to track data from end to end, you must spend time tracing problems created by data pipelines that other teams own. If you make changes to your datasets, you can accidentally affect related reports that are business or mission critical.
Azure Purview is a data governance tool offering a unified, cloud-based data discovery, management, and protection approach. It helps organizations manage and govern their on-premises, multi-cloud, and software as a service (SaaS) data. The primary purpose of Azure Purview is to provide a comprehensive understanding of an organization’s data landscape through data discovery, classification, and lineage tracking.
Since its release in December 2020, Azure Purview has been evolving with new features and integrations to enhance its data governance capabilities.
Microsoft Purview is designed to address these issues and help enterprises get the most value from their existing information assets. Its catalog makes data sources easy to discover and understand by the users who manage the data.
I am going to introduce the Azure Purview from scratch using the load map.
- Day 1: Azure Purview brief Introduction
- Day 2: Quick start, what is inside Azure Purview
- Day 3: How Microsoft Purview works – Data Source, Rule Sets, and Classification
- Day 4: Registering ADLS Gen2 and Scan in Purview
- Day 5: Registering Azure SQL Database and Scan in Purview
- Day 6: Registering Azure Synapse Analytics workspaces and scan in Microsoft Purview
- Day 7: Permission and Roles, Business Glossary in Purview
- Day 8: Data Lineage – Extract SQL ADF and Synapse Pipeline Lineage
- Day 9: Managed attributes in Data Map
Next step: Day 2: Quick start, what is inside Azure Purview
Please do not hesitate to contact me if you have any questions at William . chen @ mainri.ca
(remove all space from the email account 😊)