[https://github.com/imperatrona/twitter-scraper] Scrape the Twitter frontend API without authentication with Golang.

Find a file

Alexander Sheiko 01ac1b672c Add optional delay between API requests		2021-07-16 13:52:22 +03:00
.github/workflows	Create codeql-analysis.yml	2020-10-02 10:07:08 +03:00
.gitignore	add scrap tweets for any search query feature	2020-05-14 14:59:33 +02:00
api.go	Add optional delay between API requests	2021-07-16 13:52:22 +03:00
api_test.go	Code optimization	2021-01-28 11:12:20 +02:00
go.mod	Total refactoring	2020-12-11 20:58:49 +02:00
go.sum	Total refactoring	2020-12-11 20:58:49 +02:00
LICENSE	Add MIT license	2020-02-11 14:40:05 +02:00
profile.go	Move cacheIDs	2021-04-23 10:41:22 +03:00
profile_test.go	Fix profile test	2021-07-02 14:55:49 +03:00
README.md	Add optional delay between API requests	2021-07-16 13:52:22 +03:00
scraper.go	Add optional delay between API requests	2021-07-16 13:52:22 +03:00
search.go	Extend timeline object	2021-07-16 11:08:43 +03:00
search_test.go	Add SearchProfiles	2021-04-22 21:38:49 +03:00
timeline.go	Add InReplyToStatus, QuotedStatus and RetweetedStatus parsing in Tweet	2021-07-16 12:21:41 +03:00
trends.go	Add Scraper object	2020-12-12 23:33:57 +02:00
trends_test.go	Fix error msg	2021-04-22 20:35:33 +03:00
tweets.go	Extend timeline object	2021-07-16 11:08:43 +03:00
tweets_test.go	Fix test	2021-07-16 12:39:11 +03:00
types.go	Remove unused Retweet type	2021-07-16 12:59:16 +03:00
util.go	Extend timeline object	2021-07-16 11:08:43 +03:00

README.md

Twitter Scraper

Twitter's API is annoying to work with, and has lots of limitations — luckily their frontend (JavaScript) has it's own API, which I reverse-engineered. No API rate limits. No tokens needed. No restrictions. Extremely fast.

You can use this library to get the text of any user's Tweets trivially.

Installation

go get -u github.com/n0madic/twitter-scraper

Usage

Get user tweets

package main

import (
    "context"
    "fmt"
    twitterscraper "github.com/n0madic/twitter-scraper"
)

func main() {
    scraper := twitterscraper.New()
    for tweet := range scraper.GetTweets(context.Background(), "Twitter", 50) {
        if tweet.Error != nil {
            panic(tweet.Error)
        }
        fmt.Println(tweet.Text)
    }
}

It appears you can ask for up to 50 tweets (limit ~3200 tweets).

Get single tweet

package main

import (
    "fmt"

    twitterscraper "github.com/n0madic/twitter-scraper"
)

func main() {
    scraper := twitterscraper.New()
    tweet, err := scraper.GetTweet("1328684389388185600")
    if err != nil {
        panic(err)
    }
    fmt.Println(tweet.Text)
}

Search tweets by query standard operators

Tweets containing “twitter” and “scraper” and “data“, filtering out retweets:

package main

import (
    "context"
    "fmt"
    twitterscraper "github.com/n0madic/twitter-scraper"
)

func main() {
    scraper := twitterscraper.New()
    for tweet := range scraper.SearchTweets(context.Background(),
        "twitter scraper data -filter:retweets", 50) {
        if tweet.Error != nil {
            panic(tweet.Error)
        }
        fmt.Println(tweet.Text)
    }
}

The search ends if we have 50 tweets.

See Rules and filtering for build standard queries.

Set search mode

scraper.SetSearchMode(twitterscraper.SearchLatest)

Options:

twitterscraper.SearchTop - default mode
twitterscraper.SearchLatest - live mode
twitterscraper.SearchPhotos - image mode
twitterscraper.SearchVideos - video mode
twitterscraper.SearchUsers - user mode

Get profile

package main

import (
    "fmt"
    twitterscraper "github.com/n0madic/twitter-scraper"
)

func main() {
    scraper := twitterscraper.New()
    profile, err := scraper.GetProfile("Twitter")
    if err != nil {
        panic(err)
    }
    fmt.Printf("%+v\n", profile)
}

Search profiles by query

package main

import (
    "context"
    "fmt"
    twitterscraper "github.com/n0madic/twitter-scraper"
)

func main() {
    scraper := twitterscraper.New().SetSearchMode(twitterscraper.SearchUsers)
    for profile := range scraper.SearchProfiles(context.Background(), "Twitter", 50) {
        if profile.Error != nil {
            panic(profile.Error)
        }
        fmt.Println(profile.Name)
    }
}

Get trends

package main

import (
    "fmt"
    twitterscraper "github.com/n0madic/twitter-scraper"
)

func main() {
    scraper := twitterscraper.New()
    trends, err := scraper.GetTrends()
    if err != nil {
        panic(err)
    }
    fmt.Println(trends)
}

Use http proxy

err := scraper.SetProxy("http://localhost:3128")
if err != nil {
    panic(err)
}

Delay requests

Add delay between API requests (in seconds)

scraper.WithDelay(5)

Load timeline with tweet replies

scraper.WithReplies(true)

Default Scraper (Ad hoc)

In simple cases, you can use the default scraper without creating an object instance

import twitterscraper "github.com/n0madic/twitter-scraper"

// for tweets
twitterscraper.GetTweets(context.Background(), "Twitter", 50)
// for tweets with replies
twitterscraper.WithReplies(true).GetTweets(context.Background(), "Twitter", 50)

// for search
twitterscraper.SearchTweets(context.Background(), "twitter", 50)

// for profile
twitterscraper.GetProfile("Twitter")

// for trends
twitterscraper.GetTrends()