These are example scripts that you can use the with "prog" document source
feature of Swish-e.

The "prog" document source feature of Swish-e allow you to index any type of
document, provided you can convert the document into a format that Swish-e
can parse (text, html, or xml).

spider.pl
	Working example of a web spider.  This program is a full-featured
	spider, that is fully customizable through its configuration file.

        Note: spider.pl is installed in the scripts directory.
              Running swish-e -h will display the scripts directory.

SwishSpiderConfig.pl
	Example configuration file for the spider.pl program

file.pl 
	A very simple examle of a program that feeds documents to swish.
	Its purpose it to demonstrate how to write a program for use with
	Swish-e's "prog" input method.
	

DirTree.pl 
	A slightly more advanced example that reads a directory tree and indexes
	a few files types.  Uses the pdf2xml module for pdf files.
	Its purpose it to demonstrate how to write a program for use with
	Swish-e's "prog" input method.


MySQL.pl
	Another simple example that shows how to index data stored in a
	MySQL database.  Instructions are included on how to configure the
	swish.cgi program 


index_hypermail.pl
	An example program for indexing mailing list archives that are created
	with the popular Hypermail program.


pdf2xml.pm and pdf2html.pm
	Perl modules to convert pdf to xml documents for indexing.
	Requires the pdftotext program.  Type perldoc pdf2xml.pm
	or perldoc pdf2html.pm from the prog-bin directory for documentation.
	

doc2txt.pm
	Perl module to convert MS Word documents to text.
	Requires the catdoc program.  Type perldoc doc2txt.pm
	from the prog-bin directory for documentation.


Note: The modules to convert PDF and MS Word documents are outdated.
See SWISH::Filter for more information.
