X-GPT

CLI tool for Twitter/X scraping and semantic search. Scrape tweets, generate embeddings, ask questions, discover users.

Quick Start

bun install
cp .env.example .env  # Add OPENAI_KEY, AUTH_TOKEN, CT0
bun run src/cli.ts db --init
bun dev                       # Start web UI at localhost:3002

Commands

# Development
bun dev                       # Start web UI at localhost:3002
bun cli                       # Run CLI directly

# Core Commands
xgpt interactive              # Guided setup
xgpt scrape <username>        # Scrape tweets from user
xgpt search "terms"           # Search tweets by topic/phrase
xgpt users discover "query"   # Find Twitter profiles by bio/name
xgpt embed                    # Generate embeddings
xgpt ask "question"           # Semantic search + GPT answer
xgpt read <tweet>             # Fetch a single tweet by ID or URL
xgpt thread <tweet>           # Fetch the author thread for a tweet
xgpt replies <tweet>          # Fetch replies to a tweet
xgpt user-tweets <username>   # Fetch a user timeline
xgpt mentions --user <name>   # Fetch tweets mentioning a user
xgpt serve                    # Start web UI
xgpt db --stats               # Database stats
xgpt config list              # Show config

Web UI

Start a browser-based interface with all CLI functionality:

bun dev                       # http://localhost:3002
xgpt serve --port 8080        # Custom port

Features:

Dashboard - Stats overview, quick actions
Scrape - Scrape tweets from any user
Search - Topic-based search with filters
Discover - Find Twitter profiles by bio/keywords
Ask - AI Q&A with relevant tweets
Config - Edit settings inline
Job Taskbar - Real-time progress for long operations

Search Command

Find tweets by topic using Twitter's search API:

# Find AI startup discussions from last 7 days
xgpt search "building in public, indie hacker, shipped" --days 7

# Track trending tech topics
xgpt search "AGI, GPT-5, foundation models" --name "AI Trends" --max 1000

# Preview query without executing
xgpt search "rust lang, rustacean" --dry-run

# Search and auto-embed for semantic queries
xgpt search "YC demo day, fundraising" --mode top --embed

# Resume interrupted search
xgpt search --resume 42

Twitter Account Safety

Search and scrape operations count against your account's rate limits. Excessive usage may trigger Twitter's anti-bot detection.

Best Practices:

Start with --max 100 to test queries
Use --dry-run to preview before executing
Avoid running multiple concurrent searches
Space out large searches (1000+ tweets) by several hours

Rate Limit Handling:

Searches automatically wait and retry when rate limited
Use --resume <session-id> if you need to restart
Wait at least 15 minutes before retrying manually

Discover Command

Find Twitter profiles by bio, name, or keywords:

# Find Google engineers
xgpt users discover "google engineer" --max 20 --save

# Find AI researchers
xgpt users discover "AI researcher" --max 50

# Output as JSON
xgpt users discover "indie hacker" --json

# Script-friendly output (stable JSON envelope)
xgpt search "AGI, GPT-5" --script

Discovered profiles can be saved to the database with --save, storing bio, location, follower counts, and verification status.

How It Works

Scrape tweets from Twitter/X using session cookies
Generate vector embeddings via OpenAI
Query with natural language - finds relevant tweets via cosine similarity, generates answer with GPT

Documentation

Getting Started

Setup - Installation, cookies, environment variables
Usage - Commands, filtering, configuration

Architecture

Architecture - Project structure, data flow, dependencies
Database - Schema, migrations, optimization
Server - Web server architecture, routes, templates

Core Systems

Error Handling - Error categories, recovery suggestions, API errors
Job Tracking - Job lifecycle, cancellation, SSE updates
Commands - Command runner pattern, execution flow
Validation - Input validation with TypeBox

Reference

API Reference - REST API endpoints
Configuration - All config options and defaults
Components - UI component library
Utilities - Retry logic, formatting, helpers

Development

Testing - Unit, integration, and E2E testing

Tech Stack

Bun runtime
SQLite + Drizzle ORM
OpenAI API (embeddings + chat)
@the-convocation/twitter-scraper v0.21.0
Commander.js (CLI)
Elysia + HTMX (Web UI)

Development

bun dev               # Start web UI (localhost:3002)
bun cli               # Run CLI
bun test              # Run tests
bun run typecheck     # Type check

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.ai		.ai
.claude		.claude
.github		.github
benchmarks		benchmarks
data/backups		data/backups
docs		docs
public		public
scripts		scripts
src		src
tests		tests
.env.template		.env.template
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
README.md		README.md
bun.lock		bun.lock
drizzle.config.ts		drizzle.config.ts
eslint.config.js		eslint.config.js
package.json		package.json
test-basic.ts		test-basic.ts
test-modules.ts		test-modules.ts
test-results.json		test-results.json
tsconfig.json		tsconfig.json
tweets.db		tweets.db
tweets.json		tweets.json
vectors.json		vectors.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

X-GPT

Quick Start

Commands

Web UI

Search Command

Twitter Account Safety

Discover Command

How It Works

Documentation

Getting Started

Architecture

Core Systems

Reference

Development

Tech Stack

Development

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

ParkerRex/XGPT

Folders and files

Latest commit

History

Repository files navigation

X-GPT

Quick Start

Commands

Web UI

Search Command

Twitter Account Safety

Discover Command

How It Works

Documentation

Getting Started

Architecture

Core Systems

Reference

Development

Tech Stack

Development

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages