email-finder

📁 psyduckler/openclaw-sales-skills 📅 4 days ago

总安装量

周安装量

#69151

全站排名

安装命令

npx skills add https://github.com/psyduckler/openclaw-sales-skills --skill email-finder

Agent 安装分布

replit 1

openclaw 1

Skill 文档

Email Finder

Discover email addresses associated with a domain using multiple methods.

How It Works

Website Scraping â Fetches homepage, /contact, /about, /team pages and extracts emails via regex
Search Dorking â Searches for published emails in directories and search engines
Pattern Guessing â If a name is provided, generates common patterns (first@, first.last@, flast@, etc.)
DNS Hints â Checks MX/SPF/DMARC records to identify the email provider
SMTP Verification â Verifies all found/guessed emails using RCPT TO

Dependencies

pip3 install dnspython

Usage

Basic domain search

python3 scripts/find_emails.py example.com

With name for pattern guessing

python3 scripts/find_emails.py example.com --name "John Smith"

Skip SMTP verification

python3 scripts/find_emails.py example.com --no-verify

Options

--name "First Last" â Enable pattern guessing for a specific person
--no-verify â Skip SMTP verification step
--timeout SECONDS â Connection timeout (default: 10)

Output

JSON to stdout:

{
  "domain": "example.com",
  "provider": "Google Workspace",
  "mx": ["aspmx.l.google.com"],
  "spf": "v=spf1 include:_spf.google.com ~all",
  "dmarc": "v=DMARC1; p=reject; rua=mailto:dmarc@example.com",
  "emails_found": 2,
  "emails": [
    {
      "email": "info@example.com",
      "source": "scraped",
      "deliverable": "yes",
      "smtp_detail": "2.1.5 OK"
    },
    {
      "email": "john.smith@example.com",
      "source": "guessed",
      "deliverable": "catch-all",
      "smtp_detail": "2.1.5 OK"
    }
  ]
}

Source values

Value	Meaning
`scraped`	Found on the domain’s website
`searched`	Found via search/directory lookup
`guessed`	Generated from name patterns
`dns`	Found in DNS records (DMARC reports, etc.)

Deliverable values

Value	Meaning
`yes`	Server accepted the recipient
`no`	Server rejected the recipient (invalid)
`catch-all`	Server accepts all addresses
`unknown`	Could not determine
`not_checked`	Verification was skipped

Rate Limiting

The script includes built-in rate limiting at every stage to protect your IP:

# Defaults: 0.5s between page fetches, 2s between SMTP checks, max 15 SMTP checks
python3 scripts/find_emails.py example.com --name "John Smith"

# Conservative settings for sensitive environments
python3 scripts/find_emails.py example.com --scrape-delay 1.0 --smtp-delay 4 --max-smtp-checks 8

# Just scrape, no SMTP (zero risk)
python3 scripts/find_emails.py example.com --no-verify

Options

--scrape-delay SECONDS â Pause between website page fetches (default: 0.5)
--smtp-delay SECONDS â Pause between SMTP verification checks (default: 2.0)
--max-smtp-checks N â Max SMTP verifications per run (default: 15). Remaining emails get not_checked status.

Why rate limiting matters

This tool hits both web servers and mail servers. Without rate limiting:

Web scraping â Aggressive crawling gets your IP blocked by WAFs (Cloudflare, etc.) and makes you look like a bot. Respectful delays avoid this.
SMTP verification â Mail servers flag IPs making rapid RCPT TO requests. Your IP can get blacklisted, affecting your ability to send real email.
Residential IPs are fragile â Unlike datacenter IPs, your home/office IP is shared across all your internet activity. Getting it blacklisted affects everything.

Guidelines for agents

Scenario	Recommended approach
Single domain lookup	Defaults are fine
Domain + name pattern guessing	Defaults are fine (15 SMTP checks covers all patterns)
Multiple domains in sequence	Add 5-10s pause between domains. Don’t run more than 20 domains/day
Just need the email provider	Use `--no-verify` â DNS-only, zero risk
Bulk prospecting (50+ domains)	Use a paid service (Hunter.io, Apollo) or spread across multiple days

Key principle: The script is designed for targeted lookups, not mass scraping. If you need to process hundreds of domains, use a dedicated service with proper IP reputation management.

Limitations

Website scraping depends on emails being visible in page source (won’t find obfuscated/JS-rendered emails)
Search engines may block automated queries
SMTP verification requires outbound port 25 access
Catch-all domains accept all addresses â can’t confirm real inboxes
Be respectful: the script adds delays between requests but don’t run it in tight loops

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台