Azure Function for Metadata
Decube can leverage a customer-hosted Azure Function to retrieve file schemas, minimizing file egress from the customer's environment to Decube's compute. While this approach is generally applicable to the SaaS deployment model, it can also be beneficial for self-hosted solutions where the Data Plane resides in a different region than the Storage Container.
Prerequisite
Create an Azure Function with Custom Handler as the Runtime Stack
Networking between Decube Data Plane (either SaaS or customer own) to the Azure Function
Creating Azure Function
Go to your Azure Portal and navigate to Function App and click on `Create`

Choose either Consumption (recommended), Functions Premium or App Service

Fill in the appropriate field
Function App Name - a unique name for this function
Runtime stack - Custom Handler
Region - Closest to your ADLS storage account region
Operating System - Linux

Under Networking, choose Enable Public Access - On
See Azure documentation if alternate networking is required

Under Deployment, disable Continous Deployment
Storage, Monitoring, Tags are to be set up according to customer need
Create the Azure Function
Deployment
Visit https://github.com/DecubeIO/adls-azure-function to start
Clone the repo to a local machine
Run
func azure functionapp publish $functionAppName --custom
$functionAppName
is based on the Azure Function name used in Creating Azure Function
Using the Azure Function

When creating/modifying an ADLS source, enable
Use remote Azure Function
Fill in the Azure Function URL in this format:
https://myfunction.azurewebsites.net
Fill in the Azure Function Key. This can be found in your Portal Azure > Go to created function above > Functions > App Keys > Either choose an existing host key or create a new host key
Last updated