Data Sharing in Microsoft Purview

by Mar 9, 2023

In today’s world, data is the key to success for businesses. The more data a business has, the better it can make decisions and stay ahead of its competitors. However, data is not always easy to come by, and many businesses struggle with finding and accessing the data they need. This is where Microsoft Purview comes in.

Benefits

There are several benefits for data sharing in Microsoft Purview:

  • Safe time and resources from the business
  • Share data, Businesses can control who has access to their data
  • Secure, Businesses can control who has access to their data

Data sharing scenarios

Microsoft Purview Data Sharing can help with various data sharing scenarios, including:

  • Collaborate with external business partners while maintaining data security in your own environment.
  • Outsource data transformation and processing to third party ISVs or data aggregators by sharing raw data and receiving normalized data and analytics results back.
  • Automate sharing of big data (for example: IoT data, scientific data, satellite and surveillance images or videos, financial market data) in near real time and without data duplication.
  • Share data between different departments within the organization.

In place Data Sharing/Receiving in Microsoft Purview

Currently in Preview

Requirements:

Or do we need to say current limitations:

  • Supported Azure Regions: Canada Central, Canada East, UK South, UK West, Australia East, Japan East, Korea South, and South Africa North
  • Performance: Standard
  • Redundancy Options:  LRS, GRS, RA-GRS
  • Storage Accounts: ADLS Gen2 or Blob Storage accounts
  • Source and Target storage account must be in the same region, this can be different from your Purview Account

Before we can start we need to register the AllowDataSharing feature on the subscription.

AllowDataSharing

Attention

Only storage accounts registered after registration will work. If you did the registration after the storage account creation you will receive the following error message upon creation of the data share:

Failed to attach

Create Share

To create a Data Share you must have the Microsoft Purview Collection Role, Data Share Contributors, assigned.

The first time you will start using Data Sharing and you’re a Guest user,  your account must first be associated with the Azure Active Directory.

Verification Guest user

You will receive a email with code, copy/paste the code before you continue.

Verify emailaddress

To enable a data Share, select the Azure Storage or Azure Data Lake Storage (ADLS) Gen 2 data asset you would like to share data from.

Select ADLS or Storage account

Click on Data Share.

Create a New Share. Specify a name and a description of share contents (optional). Then select Continue.

Create new share

Search for the assets you want to Share and specify the Share name.

Data assets selection

Add the Recipient, in this situation I’ve selected a user in a different tenant but you can also select in the same tenant, same subscription or different subscription.

Recipients

Add the Recipient, in this situation I’ve selected a user in a different tenant and defined an expiration date of the share, the share will be terminated on this date.

Click on Create and Share, adding more users can be easily done afterwards.

The Share is created, The recipients of your share will receive an invitation and they can view the pending share in their Microsoft Purview account.

In Purview you will have an overview of all Shares you created:

Share overview

Receive Share

Now we have setup the Sent Share, we’re ready to receive the data. In this situation, it will be a different Purview account in a different Tenant.

All invites which have been shared and not have attached can be found in the Share Invites tab.

Share overview

A notification will also be send, that a new Invite has been received.

Share Invite

Click on Receive Share to attached to the correct Storage Account, make sure that the AllowDataSharing feature on the Azure Subscription has been registered, otherwise you will receive the message below.

Attache Share with error

Select the Storage account where you want to receive data or create a new storage.

Attach Share

  • Received share name: Leave as is or change it as you like it.
  • Path: New or existing container in Storage Account
  • Folder: The Folder where you want to receive the data

Attach the target to continue. When the storage account is attached you will see that on the Received Share overview.

Attached state

You can now access the shared data in your storage account.

In the Purview account where we create the data Share we can also see that the data is attached.

Overview of attached data shares

Great to know, in the receive share the data is read-only, updated data in the sent share will be synced in real-time to the receive share.

In my next blog I will explain, how Microsoft Purview Data Sharing Lineage will work, just as a quick teaser. Have look in you data Assets, you will now find a new data asset:

Azure Active Directory Asset Purview

Conclusion

Data sharing is a crucial component of modern business, and Microsoft Purview makes it easy and secure. By sharing data within and across organizations, businesses can improve collaboration, save time and resources, and stay ahead of their competitors. If you’re interested in learning more about Microsoft Purview and its data sharing capabilities, be sure to check it out!

If you want to know more on Data Share Lineage in Microsoft Purview you read my blog on that topic.

Like always, if you have any questions leave them in the comments.

Documentation Links as reference:

How to share data in Microsoft Purview

How to receive shared data in Microsoft Purview

Microsoft Purview Data Sharing FAQ

Feel free to leave a comment

0 Comments

Submit a Comment

Your email address will not be published. Required fields are marked *

two × four =

This site uses Akismet to reduce spam. Learn how your comment data is processed.

How to setup Code Repository in Azure Data Factory

Why activate a Git Configuration? The main reasons are: Source Control: Ensures that all your changes are saved and traceable, but also that you can easily go back to a previous version in case of a bug. Continuous Integration and Continuous Delivery (CI/CD): Allows...

Azure Synapse Analytics Power BI Integration

Creating a Linked Service for Power BI Open your Synapse Studio and select the Management Hub. Add a new Linked Service If you haven't connect to Power BI before, you will see the screen above. If you want to add another Power BI Linked Service(Workspace). Search for...

Azure Data Factory Let’s get started

Creating an Azure Data Factory Instance, let's get started Many blogs nowadays are about which functionalities we can use within Azure Data Factory. But how do we create an Azure Data Factory instance in Azure for the first time and what should you take into account? ...

Data Sharing Lineage in Microsoft Purview

In my previous blog post, I wrote how you can share data within your organization or across organizations. Now it's time to have a look how the lineage will look like. In this article I will explain the Microsoft Purview Data Sharing Lineage and not the Lineage for...

How to create a Azure Synapse Analytics Workspace

Creating your Azure Synapse Analytics Workspace In the article below I would like to take you through,  how you can configure an Azure Synapse Workspace and not the already existing Azure Synapse Analytics SQL Pool(formerly Azure SQL DW): In de Azure Portal search for...

Provision users and groups from AAD to Azure Databricks (part 4)

Assign Users and groups to Azure Databricks Workspace In the previous blog, you created the metastore in your Azure Databricks account to assign an Azure Databricks Workspace. In this blog, you will learn how to assign Users and Groups to an Azure Databricks Workspace...

Azure Data Factory and Azure Synapse Analytics Naming Conventions

Naming Conventions More and more projects are using Azure Data Factory and Azure Synapse Analytics, the more important it is to apply a correct and standard naming convention. When using standard naming conventions you create recognizable results across different...

SSMS 18.xx: Creating your Azure Data Factory SSIS IR directly in SSMS

Creating your Azure Data Factory(ADF) SSIS IR in SSMS Since  version 18.0 we could see our Integration Catalog on Azure Instances directly. Yesterday I wrote an article how to Schedule your SSIS Packages in ADF, during writing that article I found out that you can...

Azure Synapse Pause and Resume SQL Pool

Pause or Resume your Dedicated SQL Pool in Azure Synapse Analytics Azure Synapse Analytics went GA in beginning of December 2020, with Azure Synapse we can now also create a Dedicated SQL Pool(formerly Azure SQL DW). Please read this document to learn what a Dedicated...

Create an Azure Synapse Analytics Apache Spark Pool

Adding a new Apache Spark Pool There are 2 options to create an Apache Spark Pool.Go to your Azure Synapse Analytics Workspace in de Azure Portal and add a new Apache Spark Pool. Or go to the Management Tab in your Azure Synapse Analytics Workspace and add a new...