Azure Data Factory
Azure Data Factory is a cloud-based data integration and orchestration service by Microsoft Azure. See all your ETLs in decube and monitor the status of your jobs.
Supported Capabilities
Metadata Extraction
✅
Metadata Types Collected
Schema, Virtual Table, Virtual Column, Data Job, Data Run, Data Task
Data Profiling
❌
Data Preview
❌
Data Quality
❌
Configurable Collection
❌
External Table
❌
View Table
❌
Stored Procedure
❌
Supported Lineage Mapping
Azure Data Factory can map lineage relationships to upstream and downstream objects from the following connectors:
Upstream Connectors: postgresql, mysql, synapse, azure_server, databricks, sql_server, redshift, bigquery, adls
Downstream Connectors: postgresql, mysql, synapse, azure_server, databricks, sql_server, redshift, bigquery, adls
Connection Requirements
From our Azure account, we will need the following information:
Tenant ID
Client ID
Client Secret
Subscription ID
Resource Group Name
Factory Name
Data Source Name

How to connect
On the Azure Home Page, go to
Azure Active Directory.

Go to
App registrations.

Click on
New registration.

Click
Register.

Save the
Application (client) IDandDirectory (tenant) ID.Click
Add a certificate or secret.Go to
Client secretsand click+ New client secret.

Click
Add.

Copy and save the
Value.

Go to Data Factories and click the factory you wanna add.

Go to
Access Control (IAM).

Click on
+ Addand selectAdd role assignment.

Select
Data Factory Contributor.

Go to
Memberstab and Click on+ Select members.

Search and select the service principal that was created in the previous step. Click on
Select.

Go to
Review + assigntab and ClickReview + assign

Go to Data Factories and select the factory you wanna add, copy the
NameandResource group.

Copy the
Subscription ID.

Fill all the required fields in the connection form, and click on Test this connection once connection is successful, give your database a name and connect the data source.
Last updated