Datashake Social Media

The Datashake Social Media Integration allows you to search and scrape social media content across multiple platforms using keyword searches, content scraping, and content source scraping. It handles the interaction, polling, and result download; and ingresses the returned data into your pipeline.

The complete Datashake documentation about their endpoint can be found in their site here.

Supported Platforms

Search Types

All Datashake platforms support three search types:

Search TypeDescription
Keyword SearchSearch for posts containing specific keywords using boolean operators (AND, OR, NOT, parenthesis and double quotes)
Content (Single Post)Scrape a specific post with its comments
Content Source (Profile/Account)Scrape multiple posts from a profile, channel, or account

Examples

Keyword Search

Use boolean operators to refine your search:

  • "cat OR dog" - Posts about cats or dogs
  • "tesla AND (model 3 OR model y) NOT elon" - Tesla posts about specific models, excluding Musk-related content
  • "python AND (programming OR coding) NOT (snake OR reptile)" - Python programming posts, excluding snakes

Content Scrape

Scrape a specific post to collect data along with its comments:

  • https://www.reddit.com/r/technology/comments/example_post/ - Reddit post
  • https://x.com/user/status/123456789 - Tweet
  • https://www.youtube.com/watch?v=example - YouTube video

Content Source Scrape

Scrape all posts from a profile, channel, or account:

  • https://www.reddit.com/r/technology/ - Reddit subreddit
  • https://x.com/elonmusk - X (Twitter) profile
  • https://www.youtube.com/@examplechannel - YouTube channel

Mode

ModeDescription
ArchiveSearches only the internal archive (faster)
On DemandPerforms a real-time search (may take longer)
⚠️

Note: For X (Twitter), LinkedIn, and Bluesky keyword searches, the mode is always set to Archive