Jump to Content
View Interactive Demo ↗
Book a call ↗
Go to Datastreamer.io ↗
Documentation
Recipes & Templates
Product Updates
View Interactive Demo ↗
Book a call ↗
Go to Datastreamer.io ↗
Documentation
Documentation
Recipes & Templates
Product Updates
IPTC Media Topic Categorization Taxonomy
All
Pages
Start typing to search…
Getting Started
👋 Welcome to Datastreamer!
🚀 Platform Overview
🎯 Platform FAQs
What are Data Volume Units?
Platform Glossary
Creating Your First Pipeline
Creating Your First Pipeline
Creating Your Account
Quick Start Pipeline Templates
Sentiment, Location, and Language Enriched Pipeline Template
Multi-source Entity Recognition Pipeline Template
Pipeline Management
What is a Dynamic Pipeline?
Pipeline Deployment
Managing Pipelines
Pipeline Versioning
Pausing, Stopping, and Deleting
Pipeline Import and Export
Tracking & Monitoring
Pipeline Document Inspector Component
Volume Health Monitoring & Alerting
Pipeline Metrics & Analytics
Component Log Viewer
Failed Items Viewer
Quick Diagnostics
DATA CONNECTORS
Connector Automation (Jobs)
Creating Jobs (Portal, API)
Managing Jobs
Stopping a Job (Portal)
Listing Jobs via Jobs API
Get Job Details via Jobs API
Cancelling Jobs via Jobs API
Deleting Jobs via Jobs API
Updating Jobs (Portal, API)
Advanced Job Search
Job Failure Handling
Jobs Creation with AI/Agents
Best Practices for Data Collection Jobs
Jobs DVU Count API
Connector Automation (AI Tools)
MCP Server Setup Guide
Job Creation Agent
Automated Data Sources
About Auto Sources
Twitter/X Setup (Auto Sources)
Instagram Setup (Auto Sources)
Threads Setup (Auto Sources)
Facebook Setup (Auto Sources)
Reddit Setup (Auto Sources)
YouTube Setup (Auto Sources)
TikTok Setup (Auto Sources)
Data Sources
Apify
Apify Setup Guide
Troubleshooting & FAQ
Brightdata
Brightdata Account Setup
Brightdata Amazon Products
Brightdata CNN News
Brightdata Crunchbase Business
Brightdata Ebay Products
Brightdata Etsy Products
Brightdata G2 Reviews
Brightdata Github Code
Brightdata Glassdoor Jobs
Brightdata Google Shopping
Brightdata Indeed Jobs
Brightdata Instagram Posts
Brightdata Pinterest Posts
Brightdata Reddit Posts
Brightdata Shein Products
Brightdata Target Products
Brightdata Trustradius Reviews
Brightdata Walmart Products
Brightdata Yahoo Finance Business
Brightdata Youtube Posts
Bluesky Live Feed
DarkOwl Search
Data365
Data365 Facebook Keywords Latest
Data365 Instagram Profile Feed Posts
Data365 Instagram Profile Search
Data365 Tiktok Keywords
Data365 X (Twitter) Keywords
Data365 Facebook Posts Search
Opoint News
Socialgist
Socialgist Blogs
Socialgist Blog Links
Socialgist Boards
Socialgist Boards Compliance
Socialgist News
Socialgist News Compliance
Socialgist Quora
Socialgist Reddit
Socialgist Reddit Links
Socialgist Reviews
Socialgist Tencent Weibo
Socialgist Tiktok
Socialgist Videos
Socialgist VK
WebSightLine
WebSightLine Augmented Instagram
WebSightLine Instagram
WebSightLine Threads
Private Data Sources
Databases
Datastreamer File Storage Ingress
Datastreamer Searchable Storage Ingress
Event Streaming
Google Pub/Sub Ingress
Direct Data Upload
Cloud Storage
Cloud Storage Ingress Configuration
Google Cloud Storage Ingress
Google Cloud Storage Ingress Setup Guide
Amazon S3 Ingress
Amazon S3 Ingress Setup Guide
Azure Blob Storage Ingress
Azure Blob Storage Ingress Setup Guide
Cloudflare R2 Storage Ingress
Cloudflare R2 Setup Guide
SFTP
TRANSFORMATIONS
Unify Transformer
Unified Data Dictionary
JSON Schema Transformer
Custom Transformations
OPERATIONS
Operations & Enrichments Overview
Custom Functions
Routing & Filtering
Concat
JSON Router
Splitters
Lucene Document Filter
Document Batcher
Document Deduplication
AI Operations
Google Translate
Gemini Translate (Large Language Model)
Hard News
Violence Detection
Intent Classification
Sentiment Classification
AI Sentiment Classifier
AI Emotion Classification
Product Sentiment Classifier
Sentiment Classification (Long content )
Sentiment Classification (short content)
Category Classification
AI Category Classifier
Market Interest Categorization Taxonomy
IPTC Media Topic Categorization Taxonomy
Entity Recognition
Named Entity Recognition
AI Entity Recognition Classifier
AI Brand Recognition
Intent Classification
ESG Classification
ESG Classifier
AI ESG Classifier
Location Classification
Location Inference Models
Dominant Location Classifier
Open AI Completion
Private AI PII Redaction
Content Similarity Clustering
Influence Classification
NLP Classifiers
Language Detection (Datastreamer)
Language Detection (Google Service)
File Operations
PDF Table Extraction to Unified Schema
PDF to JSON Text Extraction
WebSightLine File Fetcher
WebSightLine Profile Fetcher
DESTINATION CONNECTORS
What is "Pipeline Egress"?
Databases
Snowflake
Snowflake Setup Guide
Databricks
Databricks File Egress
Databricks SQL Egress
Searchable Storage by Datastreamer
Adding & Using Searchable Storage
Managing your Searchable Storage
Searchable Storage APIs
Search API
Aggregations with Search API
Count API
ETL Platforms
Fivetran
Fivetran Setup Guide
Webhook
Firehose
Firehose
Cloud Storage
Amazon S3 Storage Egress Connector
Google Cloud Storage Egress
Azure Blob Storage Egress
Cloudflare R2 Storage Egress
SFTP Storage Egress
Event Streaming
Google Pub/Sub Egress Connector
Datastreamer File Storage Egress
Elasticsearch Egress
Big Query
Big Query Writer
Big Query Fixed Schema Writer
Social Voice
Social Voice Keywords Extraction
Social Voice Toxicity Detection
Social Voice IAB Categories
Social Voice Direction of Focus
Social Voice Political Leaning
Social Voice Tonality Analysis
Social Voice Personality Analysis
Social Voice Entity Detection
Social Voice Transcription
Social Voice Audio Quality Analysis
Social Voice Music Detection
Social Voice Transcription and Translation
Billing & Cost Management
How Datastreamer is priced
Committed Usage Discounts (Commits)
Billing Management
Billing Dashboard
Detailed Billing Views (Using Tags)
Pricing Calculator
Budget Alerts
Tips: Optimizing Your Datastreamer Usage
Add-On Bundles
What are Bundles?
NLP Classifiers (Datastreamer-provided)
AI Classifiers (Datastreamer-provided)
Location Inference Bundle (Datastreamer-provided)
Bright Data Specialty Sources
Bright Data High Result Source Bundle
Security & Compliance
Datastreamer Architecture Overview
Pipeline Regional Deployment
Platform Transparency Overview
Platform Security Overview
Compliance-Sensitive Usage
Other
Tips & Tricks
How To Enrich Your Own Data (Bring Your Own Data)
Blacklist Filtering
Estimating external data volumes
Google Sheet Integration
📘
Executive "Launch" Guide
Powered by
Loading
Loading…