YouTube
The YouTube source collects videos and comments from YouTube as part of a Data Stream.
Datastreamer selects from available providers automatically based on your Job configuration. You do not need to manage provider accounts or credentials to use this source.
Configuring a Job
When creating a Job for the YouTube source, you define what content to collect. Common configuration options include:
- Keywords / query: terms to search for across YouTube
- Channel targets: specific channels to monitor
- Date range: the time window to collect data from
- Content type: videos, comments, or both
- Document limit: maximum number of documents per Job run
Refer to the Job creation documentation for full configuration details: Creating Jobs
What is Collected
Each document returned from the YouTube source represents a video or comment. Fields are mapped to the Datastreamer unified schema and include content title and description, author metadata, engagement metrics (views, likes, comments), and publish date.
For audio and video transcription from YouTube content, see Direct Integration for enrichments to apply.
Troubleshooting
Job fails or returns no data
- Check that the query or channel target is valid and publicly accessible
- Verify the date range contains data
- Review Job logs for specific errors
Unexpectedly high document counts
- Scope the query to specific channels rather than broad keywords
- Set a document limit on the Job
- Review the DVU pricing page
Provider switch noted in Job logs
- This is expected behavior. If a provider is unavailable, the Job is routed to an alternative automatically. No action is required.
Related
- Sources Overview
- Creating Jobs
- Social Voice
- Data Volume Units
- Direct Integrations: use if you need a specific provider directly
