x-scraper

📁 aatrooox/blog.zzao.club 📅 1 day ago

总安装量

周安装量

#41341

全站排名

安装命令

npx skills add https://github.com/aatrooox/blog.zzao.club --skill x-scraper

Agent 安装分布

replit 1

trae 1

trae-cn 1

opencode 1

codex 1

Skill 文档

X.com Post Scraper

Extracts recent posts from X.com users with full engagement data using authenticated cookies.

Quick Start

Basic command:

cd .opencode/skills/x-scraper/scripts
python3 scraper.py <username> [count]

Example:

python3 scraper.py example_user 15

Output: /tmp/x_{username}_posts.json

Prerequisites

Before first use, verify environment requirements:

Python 3.11+: Check with python3 --version
Playwright: Check with python3 -c "import playwright"
Cookie file: Check with ls /tmp/x_cookies_pw.json

If any prerequisite is missing, see references/setup.md for detailed installation and configuration guide.

Common Workflows

First-time setup

See references/setup.md for complete environment configuration.

Daily scraping

python3 scraper.py <username> [count]

Custom cookie file

python3 scraper.py <username> [count] --cookie-file /path/to/cookies.json

Troubleshooting

If scraper fails, see references/troubleshooting.md for common issues and solutions.

Output Format

{
  "index": 1,
  "username": "example_user",
  "postId": "1234567890123456789",
  "publishTime": "2025-12-03T18:28:32.000Z",
  "postLink": "https://x.com/example_user/status/1234567890123456789",
  "textContent": "Post text content...",
  "views": "471K",
  "likes": "1.1K",
  "retweets": "153",
  "replies": "44"
}

Key fields:

postLink – Direct URL to post
publishTime – ISO 8601 timestamp
views/likes/retweets/replies – Abbreviated metrics (K, M)

When to Use This Skill

Trigger when user requests:

“æ´ç @æäºº æè¿çåè¨”
“ççæäººå¨Xä¸è¯´äºä»ä¹”
“Scrape X.com posts from @username”
“Get latest tweets from user”
“Analyze X user’s recent posts”

Available Scripts

`scraper.py` – Main scraper

python3 scraper.py <username> [count] [--cookie-file <path>]

Scrapes user timeline with replies
Default count: 10 posts
Default cookie: /tmp/x_cookies_pw.json

`convert_cookies.py` – Cookie converter

python3 convert_cookies.py <input-file> [output-file]

Converts Cookie-Editor JSON to Playwright format
Required before first scraping

Reference Documents

setup.md – Complete environment setup guide (Python, Playwright, cookies)
troubleshooting.md – Error diagnosis and solutions
usage.md – Detailed usage examples and advanced options

Limitations

Requires X.com authentication cookies
Cookies expire (~7 days), need re-export
Rate limits may apply
Cannot access private/protected accounts

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台