Robots.txt File Generator
Create search engine crawler directories indexing instructions. Use standard presets for WordPress, Shopify, and Magento, and compile highly secure crawlers rules easily.
Rules Configuration
Slows down search bots. Useful for small/shared servers to prevent overload.
Add Custom Crawl Rule
Active Instruction Rules
Robots.txt Output
# Compiled robots.txt goes here
What is a Robots.txt File?
A **robots.txt file** is a plain text document that resides in the root directory of your website domain. It communicates directly with search engine indexing bots (crawlers like Googlebot, Bingbot, YandexBot) to specify which directories, files, or parameters they are allowed to crawl. Proper robots.txt structuring prevents crawlers from wasting crawl budget on private database folders (`/wp-admin/`, `/checkout/`, `/tmp/`), directly optimizing your domain indexation capacity.
Understanding directives: Allow vs Disallow
Directives are simple rules instructing crawlers. `Disallow: /admin/` tells all compliant bots to stay away from your administrative folder. `Allow: /wp-admin/admin-ajax.php` serves as an exception to let Wordpress plugins load assets dynamically. Always linking your XML Sitemap inside robots.txt guarantees that crawlers discover all target organic posts instantly.
Robots Checklist
- User-agent: *: Standard rule for all bots.
- Disallow: /private/: Hides sensitive paths.
- Crawl-delay limits: Protects server health.
- Sitemap linkage: Speeds up page discovery.
- Root path upload: Must be in the root directory.
Common CMS Standard Paths
| CMS Platform | Block Targets (Disallow) | Crawl Delay Standard |
|---|---|---|
| WordPress | /wp-admin/, /wp-includes/ | None (Default) |
| Shopify | /cart, /orders, /checkout | None (Default) |
| Magento | /checkout/, /app/, /customer/ | None (Default) |