Robots.txt File Generator

Create search engine crawler directories indexing instructions. Use standard presets for WordPress, Shopify, and Magento, and compile highly secure crawlers rules easily.

Quick Presets:

Rules Configuration

Slows down search bots. Useful for small/shared servers to prevent overload.

Add Custom Crawl Rule

Active Instruction Rules

Robots.txt Output

COMPILED ROBOTS.TXT
# Compiled robots.txt goes here
                    

What is a Robots.txt File?

A **robots.txt file** is a plain text document that resides in the root directory of your website domain. It communicates directly with search engine indexing bots (crawlers like Googlebot, Bingbot, YandexBot) to specify which directories, files, or parameters they are allowed to crawl. Proper robots.txt structuring prevents crawlers from wasting crawl budget on private database folders (`/wp-admin/`, `/checkout/`, `/tmp/`), directly optimizing your domain indexation capacity.

Understanding directives: Allow vs Disallow

Directives are simple rules instructing crawlers. `Disallow: /admin/` tells all compliant bots to stay away from your administrative folder. `Allow: /wp-admin/admin-ajax.php` serves as an exception to let Wordpress plugins load assets dynamically. Always linking your XML Sitemap inside robots.txt guarantees that crawlers discover all target organic posts instantly.

Robots Checklist

  • User-agent: *: Standard rule for all bots.
  • Disallow: /private/: Hides sensitive paths.
  • Crawl-delay limits: Protects server health.
  • Sitemap linkage: Speeds up page discovery.
  • Root path upload: Must be in the root directory.

Common CMS Standard Paths

CMS Platform Block Targets (Disallow) Crawl Delay Standard
WordPress /wp-admin/, /wp-includes/ None (Default)
Shopify /cart, /orders, /checkout None (Default)
Magento /checkout/, /app/, /customer/ None (Default)

Frequently Asked Questions

Where should I upload my generated robots.txt file?
You must upload the file to your website host's absolute root directory (typically the `public_html` or `www` folder). It must be accessible directly at `https://yourdomain.com/robots.txt`.
Do all search engines respect the robots.txt file?
Yes, all major, reputable search crawlers (Google, Bing, Yahoo, Yandex, DuckDuckGo) follow compliant robots.txt specifications. However, malicious scraper bots may ignore them.
What happens if my robots.txt file is missing?
If a crawler cannot find a robots.txt file in your domain root, it assumes you have no disallow guidelines, meaning it will attempt to index and crawl every publicly accessible folder and link on your site.