AgentSiteBot # The crawler behind the curtain. AgentSiteBot is the headless browser [agentsite](/) uses to render your single-page-app pages so they can be served back to LLM crawlers (GPTBot, ClaudeBot, PerplexityBot, and other agents that don't execute JavaScript). It only crawls domains a registered owner has connected to an account, and only after a token has been used. ## How to identify it AgentSiteBot identifies itself with a deterministic User-Agent string: User-Agentcopy ``` AgentSiteBot/1.0 (+https://agentsite.com/bot) ``` ## How to allowlist it (Cloudflare) If your site is behind Cloudflare and our renderer is being challenged, add a WAF custom rule: 1. Cloudflare → your zone → _Security → WAF → Custom rules_. 2. Match: `(http.user_agent contains "AgentSiteBot")`. 3. Action: **Skip** → Bot Fight Mode, Browser Integrity Check, Managed Challenge. 4. Deploy. We don't yet publish a static IP range — agentsite's renderer runs on rotating egress, so allowlist by User-Agent is the supported method for now. ## How to block it If you don't want AgentSiteBot to render your site, two options: - Don't register the domain in agentsite — without a valid token + matching domain, our crawler never visits. - Add a `Disallow` entry in your `robots.txt`: robots.txtcopy ``` User-agent: AgentSiteBot Disallow: / ``` ## What it does _not_ do - It does not run JavaScript that submits forms or POSTs. - It does not follow links beyond the path it was asked to render. - It does not log in, accept cookies meaningfully, or attempt to bypass auth walls. - It does not crawl your site on a schedule of its own — it renders on demand when a request from your installed snippet (or a direct API caller) reaches us. ## Contact Questions, abuse reports, allowlist help: [get in touch](/contact). Cookies We use cookies to make this site work and to understand how it's used. [Learn more](https://www.cookiesandyou.com/) Decline Got it