User-agent: Peregrinator-Mathematicsrecord in their /robots.txt file.
So far few mathematics Web administrators are making use of this file: of about 90 servers indexed by the Peregrinator on its first run, only two (one of them my own) had a /robots.txt. (Another half dozen erroneously return their default page when /robots.txt is requested: this seems to be an idiosyncrasy of the GN HTTP daemon.)
User-agent: Peregrinator-Mathematics Disallow: /CompSci/ # (though the Maths-CS boundary is ill-defined) Disallow: /Games/ Disallow: /Unixhelp/ Disallow: /Bigdummy/ Disallow: /cgi-bin/finger Disallow: /man/This would of course require variations depending on what is present and how it is named on a given server.
User-agent: Peregrinator-Mathematics Disallow: /To block all robots, you could say
User-agent: * Disallow: /However, it would be a pity if many sites did this, as the information they contain would not then be automatically indexed by any conforming robot.
Since so far /robots.txt files are rare, the Peregrinator only checks for new or changed ones about once a week.