Question 1

How does the tester decide which rule wins?

Accepted Answer

It follows Google's rule precedence: the most specific rule wins, where specificity is the number of characters in the rule path. The longest matching Allow or Disallow decides. When an Allow and a Disallow match the same length, Allow wins.

Question 2

How are User-agent groups chosen?

Accepted Answer

Crawlers obey the most specific group that names them. An exact or matching user-agent token (like Googlebot) overrides the catch-all star group, and only that one group's rules are evaluated. Unlisted bots fall back to the star group.

Question 3

What do the * wildcard and $ anchor mean?

Accepted Answer

An asterisk matches any run of characters, so Disallow: /*.pdf blocks every PDF path. A dollar sign at the end anchors the match to the end of the URL, so Disallow: /*.php$ blocks /index.php but not /index.php?id=1.

Question 4

Does an empty Disallow block anything?

Accepted Answer

No. A bare Disallow: line with no path means allow everything for that group. It is the standard way to grant a crawler full access to your site.

Question 5

Does robots.txt keep a page out of Google's index?

Accepted Answer

No. It only controls crawling. A blocked URL can still be indexed if other pages link to it. To keep a page out of search results, allow crawling and add a noindex meta tag, or use HTTP authentication.

Question 6

Is my robots.txt or URL sent to a server?

Accepted Answer

No. The test runs entirely in your browser. Nothing you paste is uploaded, logged, or stored anywhere.

Robots.txt Tester

How to test a URL against robots.txt

Examples

Frequently asked questions

Related tools

Robots.txt Generator

XML Sitemap Generator

Canonical Tag Generator

Article Schema Generator

Breadcrumb Schema Generator

Bulk Hreflang Generator