cookiepal.oss
CLI

Crawl

Crawl a site and inventory cookies.

cookiepal crawl <url>

Drives a headless Chromium session over <url> using Crawlee + Puppeteer, does a BFS walk of same-origin links, and writes every cookie it observes into the SQLite file. The dashboard reads those rows for categorization and config.

Flags

  • --max-pages <n> — maximum pages to visit. Default 50.
  • --max-depth <n> — maximum link depth from the seed URL. Default 3.
  • --concurrency <n> — parallel browser pages. Default 3.
  • --fresh — ignore cache and do a full crawl.
  • --proxy <url> — route the crawl through an HTTP/HTTPS/SOCKS5 proxy. Supports inline basic auth (http://user:pass@host:port).
  • --timeout <secs> — per-page navigation timeout in seconds. Default 60. Handler timeout scales to 2 × this value.

What you get

  • Rows in the cookies table keyed by site, name, and domain.
  • A site entry ready to inspect in cookiepal dashboard.
  • Nothing is sent off the machine — the catalog ships inside the CLI via @cookiepal-oss/cookie-db.

On this page