Question 1

What is robots.txt?

Accepted Answer

robots.txt is a file that tells search engine crawlers which pages or sections of your site to crawl or skip. It lives at your site's root (example.com/robots.txt) and helps manage crawler traffic and indexing.

Question 2

Does robots.txt hide pages from Google?

Accepted Answer

No! robots.txt only prevents crawling, not indexing. If other sites link to a blocked page, Google may still index its URL. For true hiding, use noindex meta tags or password protection.

Question 3

What does Disallow: / do?

Accepted Answer

Disallow: / tells crawlers to not access any page on your site. Use this carefully - it will remove your entire site from search results over time. It's mainly for development/staging sites.

Question 4

Should I include a sitemap reference?

Accepted Answer

Yes! Adding Sitemap: https://yoursite.com/sitemap.xml helps search engines discover your content structure. It's a simple addition that aids indexing.

Robots.txt Generator

What is a Robots.txt Generator?

Common Robots.txt Directives

What to Block

Frequently Asked Questions