Parameter Description
Directory & file names
Web directories to index:
Root directory of the web-tree:
Create an index of all files in, and under the web directory
(or web directories ).
The web directories
should NOT be fully qualified names. They should
be relative to the root directory of the web tree .
Enter web directories using a space delimited list.
SWISH index name (fully qualified):
(a relative file name is assumed
to be relative to your SWISH directory )
Search-form document to create:
(a relative file name is assumed
to be relative to your web directory )
The name of the SWISH index file to create. If not specified,
a random name, written to your "SWISH" directory, will be created.
The SWISH directory is specified with the INDEX_DIR parameter
in GOSWISH.CMD
The search-form document is an HTML document that will
contain a link to the search mode of GOSWISH .
Indexing Rules
A replace with rule:
This should contain two quoted strings. It is mainly used
to convert filenames into URLS. For example:
"\www\samples" "http://www.mysite.net/samples"
By default, a replace rule that generates URLS back to the file
will be used.
We HIGHLY recommend use of this default!
Files to index:
Files with these extensions will be indexed. Both the file name, and
the contents of the file will be indexed.
Do not index contents of these files:
Files with these names will not have their contents indexed
(just the filenames will be indexed). This list must be a subset
of the files to index list.
Do not index rules.
PathName:
Directory:
Filename:
Title:
These file rules are used to limit what directories and files are searched.
The first word should be contains , followed by a space delimited list.
PathName : If the pathname (to the file, or to the directory)
contains any of these strings, do not index.
Directory : If one of these files is in the directory, do not index any file
in the directory
Filename : If the filename contains one of these strings, do not index the file.
Title :If the title contains one of these strings, do not index.
More Options
Two limits (percent #_files) to use to identify common words:
After indexing, swish can automatically tell which words are the most common and
omit them from the index according to these parameters. For example:
IgnoreLimit 75 250 --
ignore all words that occur in over 75% of the files and that
also occur in over 250 different files.
Common words (to be ignored):
Ignore these "commonly occuring" words. If you leave this blank (or enter
SwishDefault ), a default set (of about thousand words) will be used
The name of this index:
the administrator:
the description:
a pointer:
These are strictly optional items used to identify the index. Leave them
blank and some basic client, server, and selector
information will be used.
Description Cache
Create a description cache file for:
No files
HTML documents
All text documents
For every match found during a search of a SWISH index,
the URL of the matching document, it's TITLE, and a relevancy score are displayed.
You can also display a description , that is generated from the contents of
the document.
List of HTML document extensions:
Files ending with these extensions are treated as HTML documents
(descriptions will use meta elements, headers, etc.)
Name of directory specific description file:
Filename.ext only : do not include a path or drive.
To allow you to specify your own descriptions (say, for image files), you
can create a (set of) directory specific description files .