Instagram

The Instagram source collects posts, reels, and public profile content from Instagram as part of a Data Stream.


📘

Datastreamer selects from available providers automatically based on your Job configuration. You do not need to manage provider accounts or credentials to use this source.

Configuring a Job

When creating a Job for the Instagram source, you define what content to collect. Common configuration options include:

  • Keywords / hashtags: terms or hashtags to search for
  • Profile targets: specific Instagram accounts to monitor
  • Date range: the time window to collect data from
  • Content type: posts, reels, or both
  • Document limit: maximum number of documents per Job run

Refer to the Job creation documentation for full configuration details: Creating Jobs


What is Collected

Each document returned from the Instagram source represents a post or reel. Fields are mapped to the Datastreamer unified schema and include content text, media references, author metadata, engagement metrics (likes, comments), and post date.

Platform-specific fields are available under the instagram schema namespace. See the Schema Reference for field details.


Troubleshooting

Job fails or returns no data

  • Check that the query, hashtag, or account target is valid and publicly accessible
  • Verify the date range contains data
  • Review Job logs for specific errors

Unexpectedly high document counts

  • Add more specific filters to narrow the query
  • Set a document limit on the Job to cap volume per run
  • Review the DVU pricing page to understand how document counts affect costs

Provider switch noted in Job logs

  • This is expected behavior. If a provider is unavailable, the Job is routed to an alternative automatically. No action is required.

Related