109 строки
6.7 KiB
Markdown
109 строки
6.7 KiB
Markdown
|
---
|
||
|
type: post
|
||
|
title: "Tip 332 - How to get started with Azure Purview"
|
||
|
excerpt: "Learn how to get started with Azure Purview"
|
||
|
tags: [Databases]
|
||
|
share: true
|
||
|
date: 2021-9-8 05:00:00
|
||
|
---
|
||
|
|
||
|
::: tip
|
||
|
|
||
|
:fire: Make sure you [star the repo](https://github.com/microsoft/azuretipsandtricks) to keep up to date with new tips and tricks.
|
||
|
|
||
|
:bulb: Learn more : [Azure Purview Overview](https://docs.microsoft.com/azure/purview/overview/?WT.mc_id=docs-azuredevtips-azureappsdev).
|
||
|
|
||
|
:tv: Watch the video : [How to get started with Azure Purview](https://youtu.be/Kteh9cXkHIE?WT.mc_id=youtube-azuredevtips-azureappsdev).
|
||
|
|
||
|
:::
|
||
|
|
||
|
### How to get started with Azure Purview
|
||
|
|
||
|
#### Keep track of your data landscape
|
||
|
It is hard to keep track of all your data. You want to know where data is, and if it contains sensitive information. And you want to know where you data is coming from and how it is being transformed and by whom. [Azure Purview](https://docs.microsoft.com/azure/purview/overview/?WT.mc_id=docs-azuredevtips-azureappsdev) enables you to manage your data, wherever it is stored, including on-premises and in other clouds. Purview can automatically discover data and classify it, so that you know what is in it.
|
||
|
|
||
|
In this post, we'll get started with [Azure Purview](https://docs.microsoft.com/azure/purview/overview/?WT.mc_id=docs-azuredevtips-azureappsdev) by creating a new Azure Purview account, and coupling and scanning a data source.
|
||
|
|
||
|
#### Prerequisites
|
||
|
If you want to follow along, you'll need the following:
|
||
|
* An Azure subscription (If you don't have an Azure subscription, create a [free account](https://azure.microsoft.com/free/?WT.mc_id=azure-azuredevtips-azureappsdev) before you begin)
|
||
|
* An [Azure SQL Database](https://azure.microsoft.com/services/sql-database/?WT.mc_id=azure-azuredevtips-azureappsdev) in Azure. You can follow [this quickstart](https://docs.microsoft.com/azure/azure-sql/database/single-database-create-quickstart?WT.mc_id=azure-azuredevtips-azureappsdev) to create one
|
||
|
|
||
|
#### Create and use an Azure Purview account
|
||
|
Let's start by creating an Azure Purview account in the Azure portal.
|
||
|
|
||
|
1. Go to the [Azure portal](https://portal.azure.com/?WT.mc_id=azure-azuredevtips-azureappsdev)
|
||
|
2. Click the **Create a resource** button (the plus-sign in the top left corner)
|
||
|
3. Search for **purview**, select the "Azure Purview" result and click **Create**. This shows the **Create Purview Account** blade
|
||
|
1. Select a **Resource Group**
|
||
|
2. Give the Purview account a **Name**
|
||
|
3. Pick a **location**
|
||
|
4. Select **Next: Networking**
|
||
|
|
||
|
<img :src="$withBase('/files/116create0.png')">
|
||
|
|
||
|
(Create an Azure Purview account in the Azure portal)
|
||
|
|
||
|
5. You can connect to Azure Purview with a [Private endpoint](https://docs.microsoft.com/azure/private-link/private-endpoint-overview?WT.mc_id=azure-azuredevtips-azureappsdev) to make it more secure. Leave it set to **All networks** and select **Next: Configuration**
|
||
|
6. In the configuration blade, you can manage the platform size and catalog capabilities of the Purview account. Leave the **Platform size** to **4 capacity units** and select **Review + Create** and then **Create**
|
||
|
|
||
|
<img :src="$withBase('/files/116create1.png')" width="75%">
|
||
|
|
||
|
(Configure Platform size of the Purview account)
|
||
|
|
||
|
When the Purview account is created, navigate to it in the Azure portal. From there, you can configure access to the account and networking settings. And to use Purview, you need to open **Purview Studio**. You can open that from the **Overview blade** of the Purview account.
|
||
|
|
||
|
<img :src="$withBase('/files/116purviewportal.png')" width="75%">
|
||
|
|
||
|
(Azure Purview Studio)
|
||
|
|
||
|
You can use the Purview account to track and manage your data, even when it is on-premises or in other clouds. To get started, you need to register a data source.
|
||
|
|
||
|
1. In Purview Studio, select the **Sources menu** from the left menu. This shows the data sources that are registered with Purview
|
||
|
2. Let's register the first data source. Click **Register** to get started
|
||
|
|
||
|
<img :src="$withBase('/files/116register.png')" width="75%">
|
||
|
|
||
|
(Register a data source in Purview)
|
||
|
|
||
|
3. You can register all sorts of data sources in Purview. **Select Azure SQL Database** and click **Continue**
|
||
|
1. Select your **Azure subscription**
|
||
|
2. Pick the **Azure SQL Database server** that contains the database you want to register
|
||
|
3. Click **Register**
|
||
|
4. Now that the data source is registered, you can start scanning its data. In the menu of the data source, click **New scan**
|
||
|
|
||
|
<img :src="$withBase('/files/116purviewportal2.png')" width="90%">
|
||
|
|
||
|
(Create a new data scan in Purview Studio)
|
||
|
|
||
|
1. Fill in a name for the scan
|
||
|
2. Select the **database** that you want to scan
|
||
|
3. Leave the **Credential setting** as it is. And click **See more** to see the name of the Purview identity that needs to access the database
|
||
|
4. We need to grant Purview access to the database. Copy the **Managed Identity Name** of the Purview credential
|
||
|
5. Go to the Azure portal and navigate to the **Azure SQL Database Server**
|
||
|
6. In there, navigate to the **Active Directory admin setting**
|
||
|
7. Click **Set admin**
|
||
|
8. Paste the **Managed Identity Name** in the search bar
|
||
|
9. Click **Select**
|
||
|
10. And click **Save** to grant the Purview identity access to the database server
|
||
|
|
||
|
<img :src="$withBase('/files/116grantaccess.png')">
|
||
|
|
||
|
(grant access to Purview)
|
||
|
|
||
|
11. Go back to Purview Studio and click **Test connection** in the Create Scan blade. This should result in a **"Connection successful"** message
|
||
|
12. Click **Continue**
|
||
|
13. Select the table(s) of the database to scan and click **Continue**
|
||
|
14. Leave the rule set as it is. You can take a look at its details to see what this rule set will scan for. The default AzureSqlDatabase rule set scans for many things, including credit card numbers and dates of birth. Click **Continue**
|
||
|
15. You can scan this data source on a schedule. For instance, every week. We'll do it once. Select **Once** and click **Continue**
|
||
|
16. Click **Save and Run** to start the scan
|
||
|
|
||
|
When you click on the **View details** link of the data source, you can see if the scan is complete. When it is, you'll also see the results of the scan. It could be that it classified some data, like credit card numbers. Now, you can analyze this information further in an analytics engine like [Microsoft Power BI](https://powerbi.microsoft.com/?WT.mc_id=microsoft-azuredevtips-azureappsdev) and see insights in Purview Studio.
|
||
|
|
||
|
<img :src="$withBase('/files/116result.png')">
|
||
|
|
||
|
(Scan results in Purview Studio)
|
||
|
|
||
|
#### Conclusion
|
||
|
[Azure Purview](https://docs.microsoft.com/azure/purview/overview/?WT.mc_id=docs-azuredevtips-azureappsdev) is your main entry point for managing and analyzing all of your data, regardless of where it is stored. Go and check it out!
|