Jump to Content
View Interactive Demo ↗
Book a call ↗
Go to Datastreamer.io ↗
Documentation
Recipes & Templates
Product Updates
View Interactive Demo ↗
Book a call ↗
Go to Datastreamer.io ↗
Documentation
Moon (Dark Mode)
Sun (Light Mode)
Documentation
Recipes & Templates
Product Updates
Databases
Search
All
Pages
Start typing to search…
Getting Started
👋 Welcome to Datastreamer!
🚀 Quickstart Guide
📘 Platform Glossary
🎯 Platform FAQs
Creating Your First Pipeline
Getting Access to Portal
CORE CONCEPTS
Core Concepts
Datastreamer Architecture Overview
What are Data Volume Units?
USING Datastreamer
Pipeline Deployment
Managing Pipelines
Pipeline Versioning
Pausing, Stopping, and Deleting
Pipeline Metrics & Analytics
Tracking & Monitoring
Pipeline Document Inspector Component
Volume Health Monitoring & Alerting
Connector Automation (Jobs)
Creating Jobs (Portal, API)
Managing Jobs
Stopping a Job (Portal)
Listing Jobs via Jobs API
Get Job Details via Jobs API
Updating Jobs (Portal)
Advanced Job Search
Job Failure Handling
DATA CONNECTORS
Data Sources
Brightdata
Bluesky Live Feed
DarkOwl Search
Data365
Data365 Facebook Keywords Latest
Data365 Instagram Profile Feed Posts
Data365 Instagram Profile Search
Data365 Tiktok Keywords
Data365 X (Twitter) Keywords
Data365 Facebook Posts Search
Opoint News
Socialgist
Socialgist Blogs
Socialgist Blog Links
Socialgist Boards
Socialgist Boards Compliance
Socialgist News
Socialgist News Compliance
Socialgist Quora
Socialgist Reddit
Socialgist Reddit Links
Socialgist Reviews
Socialgist Tencent Weibo
Socialgist Tiktok
Socialgist Videos
Socialgist VK
WebSightLine
WebSightLine Augmented Instagram
WebSightLine Instagram
WebSightLine Threads
Databases
Cloud Storage Ingress
Datastreamer Searchable Storage Ingress
Google Cloud Storage Ingress
Event Streaming
Google Pub/Sub Ingress
Direct Data Upload
TRANSFORMATIONS
Unify Transformer
Datastreamer Unify Schema
JSON Schema Transformer
Custom Transformations
OPERATIONS
Operations & Enrichments Overview
Custom Functions
Routing & Filtering
Concat
JSON Document Router
Splitters
AI Operations
Google Translate
Gemini Translate (Large Language Model)
Hard News
Violence Detection
Intent Classification
Sentiment Classification
AI Sentiment Classifier
AI Emotion Classification
Product Sentiment Classifier
Sentiment Classification (Long content )
Sentiment Classification (short content)
Category Classification
AI Category Classifier
Market Interest Categorization Taxonomy
IPTC Media Topic Categorization Taxonomy
Entity Recognition
Named Entity Recognition
AI Entity Recognition Classifier
AI Brand Recognition
Intent Classification
ESG Classification
ESG Classifier
AI ESG Classifier
Location Classification
Location Inference Models
Dominant Location Classifier
Open AI Completion
Open AI Image Generation
Private AI PII Redaction
Content Similarity Clustering
NLP Classifiers
Language Detection (Datastreamer)
Language Detection (Google Service)
File Operations
PDF Table Extraction to Unified Schema
PDF to JSON Text Extraction
File Fetching
DESTINATION CONNECTORS
What is "Pipeline Egress"?
Databases
Searchable Storage by Datastreamer
Adding & Using Searchable Storage
Managing your Searchable Storage
Searchable Storage APIs
Aggregation APIs
Search API
Count API
ETL Platforms
Fivetran
Fivetran Setup Guide
Webhook
Firehose
Firehose
Cloud Storage
Amazon S3 Storage Egress Connector
Event Streaming
Google Pub/Sub Egress Connector
Billing & Cost Management
Usage-Based Pricing
Billing Dashboard
Pricing Calculator
Security & Compliance
Pipeline Regional Deployment
Platform Transparency Overview
Platform Security Overview
Compliance-Sensitive Usage
Other
Add-On Bundles
What are Bundles?
NLP Classifiers (Datastreamer-provided)
AI Classifiers (Datastreamer-provided)
Location Inference Bundle (Datastreamer-provided)
Bright Data Specialty Sources
Bright Data High Result Source Bundle
Tips & Tricks
How To Enrich Your Own Data (Bring Your Own Data)
Blacklist Filtering
Estimating external data volumes
Powered by
Databases
Updated about 20 hours ago
What is "Pipeline Egress"?
Searchable Storage by Datastreamer