YouTube

The YouTube source collects videos and comments from YouTube as part of a Data Stream.


📘

Datastreamer selects from available providers automatically based on your Job configuration. You do not need to manage provider accounts or credentials to use this source.

Configuring a Job

When creating a Job for the YouTube source, you define what content to collect. Common configuration options include:

  • Keywords / query: terms to search for across YouTube
  • Channel targets: specific channels to monitor
  • Date range: the time window to collect data from
  • Content type: videos, comments, or both
  • Document limit: maximum number of documents per Job run

Refer to the Job creation documentation for full configuration details: Creating Jobs


What is Collected

Each document returned from the YouTube source represents a video or comment. Fields are mapped to the Datastreamer unified schema and include content title and description, author metadata, engagement metrics (views, likes, comments), and publish date.

For audio and video transcription from YouTube content, see Direct Integration for enrichments to apply.


Troubleshooting

Job fails or returns no data

  • Check that the query or channel target is valid and publicly accessible
  • Verify the date range contains data
  • Review Job logs for specific errors

Unexpectedly high document counts

  • Scope the query to specific channels rather than broad keywords
  • Set a document limit on the Job
  • Review the DVU pricing page

Provider switch noted in Job logs

  • This is expected behavior. If a provider is unavailable, the Job is routed to an alternative automatically. No action is required.

Related