Instagram

The Instagram source collects posts, reels, and public profile content from Instagram as part of a Data Stream.

📘
Datastreamer selects from available providers automatically based on your Job configuration. You do not need to manage provider accounts or credentials to use this source.

Configuring a Job

When creating a Job for the Instagram source, you define what content to collect. Common configuration options include:

Keywords / hashtags: terms or hashtags to search for
Profile targets: specific Instagram accounts to monitor
Date range: the time window to collect data from
Content type: posts, reels, or both
Document limit: maximum number of documents per Job run

Refer to the Job creation documentation for full configuration details: Creating Jobs

What is Collected

Each document returned from the Instagram source represents a post or reel. Fields are mapped to the Datastreamer unified schema and include content text, media references, author metadata, engagement metrics (likes, comments), and post date.

Platform-specific fields are available under the instagram schema namespace. See the Schema Reference for field details.

Troubleshooting

Job fails or returns no data

Check that the query, hashtag, or account target is valid and publicly accessible
Verify the date range contains data
Review Job logs for specific errors

Unexpectedly high document counts

Add more specific filters to narrow the query
Set a document limit on the Job to cap volume per run
Review the DVU pricing page to understand how document counts affect costs

Provider switch noted in Job logs

This is expected behavior. If a provider is unavailable, the Job is routed to an alternative automatically. No action is required.

Sources Overview
Creating Jobs
Data Volume Units
Direct Integrations: use if you need a specific provider directly

Updated about 1 month ago

Did this page help you?

Configuring a Job

What is Collected

Troubleshooting

Related