X-GPT

CLI tool for Twitter/X scraping and semantic search. Scrape tweets, generate embeddings, ask questions, discover users.

Quick Start

bun install
cp .env.example .env  # Add OPENAI_KEY, AUTH_TOKEN, CT0
bun run src/cli.ts db --init
bun dev                       # Start web UI at localhost:3001

Commands

# Development
bun dev                       # Start web UI at localhost:3001
bun cli                       # Run CLI directly

# Core Commands
xgpt interactive              # Guided setup
xgpt scrape <username>        # Scrape tweets from user
xgpt search "terms"           # Search tweets by topic/phrase
xgpt users discover "query"   # Find Twitter profiles by bio/name
xgpt embed                    # Generate embeddings
xgpt ask "question"           # Semantic search + GPT answer
xgpt serve                    # Start web UI
xgpt db --stats               # Database stats
xgpt config list              # Show config

Web UI

Start a browser-based interface with all CLI functionality:

bun dev                       # http://localhost:3001
xgpt serve --port 8080        # Custom port

Features:

Dashboard - Stats overview, quick actions
Scrape - Scrape tweets from any user
Search - Topic-based search with filters
Discover - Find Twitter profiles by bio/keywords
Ask - AI Q&A with relevant tweets
Config - Edit settings inline

Search Command

Find tweets by topic using Twitter's search API:

# Find AI startup discussions from last 7 days
xgpt search "building in public, indie hacker, shipped" --days 7

# Track trending tech topics
xgpt search "AGI, GPT-5, foundation models" --name "AI Trends" --max 1000

# Preview query without executing
xgpt search "rust lang, rustacean" --dry-run

# Search and auto-embed for semantic queries
xgpt search "YC demo day, fundraising" --mode top --embed

# Resume interrupted search
xgpt search --resume 42

Twitter Account Safety

Search and scrape operations count against your account's rate limits. Excessive usage may trigger Twitter's anti-bot detection.

Best Practices:

Start with --max 100 to test queries
Use --dry-run to preview before executing
Avoid running multiple concurrent searches
Space out large searches (1000+ tweets) by several hours

Rate Limit Handling:

Searches automatically wait and retry when rate limited
Use --resume <session-id> if you need to restart
Wait at least 15 minutes before retrying manually

Discover Command

Find Twitter profiles by bio, name, or keywords:

# Find Google engineers
xgpt users discover "google engineer" --max 20 --save

# Find AI researchers
xgpt users discover "AI researcher" --max 50

# Output as JSON
xgpt users discover "indie hacker" --json

Discovered profiles can be saved to the database with --save, storing bio, location, follower counts, and verification status.

How It Works

Scrape tweets from Twitter/X using session cookies
Generate vector embeddings via OpenAI
Query with natural language - finds relevant tweets via cosine similarity, generates answer with GPT

Documentation

Setup - Installation, cookies, environment
Usage - Commands, filtering, configuration
Database - Schema, migrations, optimization
Architecture - Project structure, data flow
Testing - Test commands

Tech Stack

Bun runtime
SQLite + Drizzle ORM
OpenAI API (embeddings + chat)
@the-convocation/twitter-scraper v0.21.0
Commander.js (CLI)
Elysia + HTMX (Web UI)

Development

bun dev               # Start web UI (localhost:3001)
bun cli               # Run CLI
bun test              # Run tests
bun run typecheck     # Type check

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.ai/plans		.ai/plans
.claude		.claude
.github		.github
benchmarks		benchmarks
data		data
docs		docs
public		public
scripts		scripts
src		src
tests		tests
.env.template		.env.template
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
README.md		README.md
bun.lock		bun.lock
drizzle.config.ts		drizzle.config.ts
eslint.config.js		eslint.config.js
package.json		package.json
test-basic.ts		test-basic.ts
test-modules.ts		test-modules.ts
test-results.json		test-results.json
tsconfig.json		tsconfig.json
tweets.db		tweets.db
tweets.json		tweets.json
vectors.json		vectors.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

X-GPT

Quick Start

Commands

Web UI

Search Command

Twitter Account Safety

Discover Command

How It Works

Documentation

Tech Stack

Development

License

About

Uh oh!

Releases

Packages

Contributors 2

Languages

ParkerRex/XGPT

Folders and files

Latest commit

History

Repository files navigation

X-GPT

Quick Start

Commands

Web UI

Search Command

Twitter Account Safety

Discover Command

How It Works

Documentation

Tech Stack

Development

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages