The Instagram source collects posts, reels, and public profile content from Instagram as part of a Data Stream.
Datastreamer selects from available providers automatically based on your Job configuration. You do not need to manage provider accounts or credentials to use this source.
Configuring a Job
When creating a Job for the Instagram source, you define what content to collect. Common configuration options include:
- Keywords / hashtags: terms or hashtags to search for
- Profile targets: specific Instagram accounts to monitor
- Date range: the time window to collect data from
- Content type: posts, reels, or both
- Document limit: maximum number of documents per Job run
Refer to the Job creation documentation for full configuration details: Creating Jobs
What is Collected
Each document returned from the Instagram source represents a post or reel. Fields are mapped to the Datastreamer unified schema and include content text, media references, author metadata, engagement metrics (likes, comments), and post date.
Platform-specific fields are available under the instagram schema namespace. See the Schema Reference for field details.
Troubleshooting
Job fails or returns no data
- Check that the query, hashtag, or account target is valid and publicly accessible
- Verify the date range contains data
- Review Job logs for specific errors
Unexpectedly high document counts
- Add more specific filters to narrow the query
- Set a document limit on the Job to cap volume per run
- Review the DVU pricing page to understand how document counts affect costs
Provider switch noted in Job logs
- This is expected behavior. If a provider is unavailable, the Job is routed to an alternative automatically. No action is required.
Related
- Sources Overview
- Creating Jobs
- Data Volume Units
- Direct Integrations: use if you need a specific provider directly
