#### Agents to block User-agent: htdig Disallow: / User-agent: ia_archiver Disallow: / #### Our sitemap Sitemap: http://www.cs.rit.edu/cs_sitemap.xml #### General indexing rules User-agent: * Allow: /~ark/ Disallow: /~ Disallow: /images/ Disallow: /pictures/ Disallow: /flash/ Disallow: /forms/ Disallow: /sigcse/ # Drupal directories Disallow: /includes/ Disallow: /misc/ Disallow: /modules/ Disallow: /profiles/ Disallow: /scripts/ Disallow: /sites/ Disallow: /themes/ # Drupal Files Disallow: /CHANGELOG.txt Disallow: /cron.php Disallow: /INSTALL.mysql.txt Disallow: /INSTALL.pgsql.txt Disallow: /install.php Disallow: /INSTALL.txt Disallow: /LICENSE.txt Disallow: /MAINTAINERS.txt Disallow: /update.php Disallow: /UPGRADE.txt Disallow: /xmlrpc.php # Drupal links (clean URLs) Disallow: /admin/ Disallow: /user/ Disallow: /logout/ Disallow: /search/ Disallow: /node/add/ # Drupal links (dirty URLs) Disallow: /?q=