# Serve relevant ads on any page: User-agent: Mediapartners-Google User-agent: Adsbot-Google Disallow: # Big, public search engines can access most of the site: User-agent: Googlebot User-agent: Googlebot-Image User-agent: Googlebot-Mobile User-agent: Slurp User-agent: yahoo-mmcrawler User-agent: MSNBot User-agent: Teoma User-agent: Twiceler User-agent: baiduspider User-agent: ia_archiver User-agent: Robozilla Disallow: /icons/ Disallow: /style/ Disallow: /this-is-a-bad-url/ # Everyone else is not welcome to crawl: User-agent: * Disallow: / # And here's where to find everything: Sitemap: http://dictionary.die.net/sitemap_index.xml