Date: Fri, 8 Feb 2002 17:10:09 -0600 (CST) From: Gilles Detillieux To: "ht://Dig mailing list" Subject: [htdig] PATCH - more documentation fixes for 3.1.6 This patch fixes a few more documentation errors in 3.1.6. Some links in the FAQ were relative when the should have been absolute. The links for the Standard for Robot Exclusion were outdated and should have been fixed back in December when I first noticed the problem. Finally, there are updates to FAQ 1.15 and the where.html file. Apply this patch in your main htdig-3.1.6 source directory using the command: patch -p0 < this-message-file --- ../htdig3-1-x/htdoc/FAQ.html Thu Jan 31 11:45:37 2002 +++ htdoc/FAQ.html Fri Feb 8 15:33:19 2002 @@ -408,6 +408,12 @@ You can find the answer yourself to almost all "how can I..." questions by exploring what the various configuration attributes and search form input parameters can do. + Also have a look at our collection of + Contributed Guides + for help on things like + HTML + forms and CGI, tutorials on installing, configuring, using, and + internationalizing ht://Dig, as well as using PHP with htsearch.

Finally, if you've exhausted all the online documentation, there's the @@ -503,7 +509,7 @@ version of ht://Dig?

The ht://Dig package can be built on the Win32 platform when using the Cygwin package. For details, see the contributed guide, - + Idiot's Guide to Installing ht://Dig on Win32.

@@ -719,7 +725,7 @@ following line would do it:

For CGI and PHP, you need a "wrapper" script to do that. For perl script examples, see the files in contrib/ewswrap. The PHP guide (see contributed + href="http://www.htdig.org/contrib/guides.html">contributed guides) not only describes a wrapper script for PHP, but also offers a step by step tutorial to the basics of ht://dig and is well worth reading. @@ -1235,7 +1241,7 @@ require group foo 4.22. How can I get htdig to ignore the robots.txt file or meta robots tags?

You can't, and you shouldn't. The - + Standard for Robot Exclusion exists for a very good reason, and any well behaved indexing engine or spider should conform to it. If you have a problem with a robots.txt file, you really should @@ -1857,7 +1863,7 @@ find /path/to/your/document/root -type f

forbidden by server robots.txt!
A substring of the URL matches one of the items disallowed in the servers robots.txt file. See - + A Standard for Robot Exclusion. This message exists only in the 3.2.0 betas. In 3.1.5 and earlier, this condition is only caught later, resulting in the message --- ../htdig3-1-x/htdoc/htdig.html Sun Jan 27 07:44:57 2002 +++ htdoc/htdig.html Fri Feb 8 15:33:19 2002 @@ -218,7 +218,7 @@ htmerge, htsearch, Configuration file format, and - + A Standard for Robot Exclusion.
--- ../htdig3-1-x/htdoc/meta.html Sun Jan 27 07:44:57 2002 +++ htdoc/meta.html Fri Feb 8 15:33:19 2002 @@ -211,8 +211,8 @@ "follow" and "all" which obviously are the opposite of the other values and are the default behavior. For more information on META robots tags, check out the - - HTMLAuthor's Guide to the Robots META tag. + + HTML Author's Guide to the Robots META tag.
keywords --- ../htdig3-1-x/htdoc/require.html Mon Jan 28 07:44:51 2002 +++ htdoc/require.html Fri Feb 8 15:33:19 2002 @@ -43,7 +43,7 @@ * Robot exclusion is supported
- The + The Standard for Robot Exclusion is supported by ht://Dig.
--- ../htdig3-1-x/htdoc/where.html Mon Jan 28 07:44:51 2002 +++ htdoc/where.html Fri Feb 1 12:47:21 2002 @@ -82,13 +82,13 @@ htdig.europeanservers.net (HTTP) - (FTP) + (HTTP) - (FTP) + (HTTP) - (FTP) + (HTTP) - (FTP) + www.it.htdig.org @@ -103,6 +103,13 @@   + opdenbrouw.nl + (HTTP) + (HTTP) + (HTTP) + (HTTP) + +
-- Gilles R. Detillieux E-mail: Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list To unsubscribe, send a message to with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html