Azure Data Factory
Azure Data Factory is a cloud-based data integration and orchestration service by Microsoft Azure. See all your ETLs in decube and monitor the status of your jobs.
Supported Capabilities
Metadata Extraction
✅
Metadata Types Collected
Schema, Virtual Table, Virtual Column, Data Job, Data Run, Data Task
Data Profiling
❌
Data Preview
❌
Data Quality
❌
Configurable Collection
❌
External Table
❌
View Table
❌
Stored Procedure
❌
Supported Lineage Mapping
Azure Data Factory can map lineage relationships to upstream and downstream objects from the following connectors:
Upstream Connectors: postgresql, mysql, synapse, azure_server, databricks, sql_server, redshift, bigquery, adls
Downstream Connectors: postgresql, mysql, synapse, azure_server, databricks, sql_server, redshift, bigquery, adls
Connection Requirements
From our Azure account, we will need the following information:
Tenant ID
Client ID
Client Secret
Subscription ID
Resource Group Name
Factory Name
Data Source Name

How to connect
On the Azure Home Page, go to
Azure Active Directory
.

Go to
App registrations
.

Click on
New registration
.

Click
Register
.

Save the
Application (client) ID
andDirectory (tenant) ID
.Click
Add a certificate or secret
.Go to
Client secrets
and click+ New client secret
.

Click
Add
.

Copy and save the
Value
.

Go to Data Factories and click the factory you wanna add.

Go to
Access Control (IAM)
.

Click on
+ Add
and selectAdd role assignment
.

Select
Data Factory Contributor
.

Go to
Members
tab and Click on+ Select members
.

Search and select the service principal that was created in the previous step. Click on
Select
.

Go to
Review + assign
tab and ClickReview + assign

Go to Data Factories and select the factory you wanna add, copy the
Name
andResource group
.

Copy the
Subscription ID
.

Fill all the required fields in the connection form, and click on Test this connection once connection is successful, give your database a name and connect the data source.
Last updated