Azure Blob Storage Ingress Setup Guide

Step-by-step of how to ingest data from Azure Blob Storage into your Datastreamer pipeline.

A step-by-step guide to configuring your pipeline with Azure Blob Storage Ingress with credentials and Jobs.

Prerequisites

To use Azure Blob Storage within your Pipelines, you will need to have the following:

  1. Datastreamer Portal account with access to add a new component.
  2. Azure Shared Access Token (SAS). Here is a guide on how to create a SAS.
  3. The container exists.
  4. Ensure the provided SAS has permission to view, list and write objects.

Setup Instructions

Step 1: Open the target Pipeline within Portal

You can create a new Pipeline or open an existing Pipeline within the Portal.

Step 2: Add a Google Cloud Storage Ingress component from the selection menu

Click on the Ingress (+) button and select the component Azure Blob Storage Ingress on the side panel menu.

Step 3: Configure the component

Next, click on the Configure button, and a list of properties will be displayed on the side panel.

The configuration is divided into 2 parts: the Component Properties and the Jobs. Component Properties must be configured before deploying the pipeline. Jobs, on the other hand, can be configured before or after the pipeline is deployed.

Component Properties Form

New Job Form

Step 4: Deploy your Pipeline

Upon deployment, this Pipeline is fully ready to consume and process data. Deploying generally takes 10-30 seconds for full rollout and diagnostics.

More details on Pipeline deployment is available here: Docs: Pipeline Deployoment