root/swish-e/trunk/pod/README.pod

Revision 1663, 8.2 kB (checked in by whmoseley, 4 years ago)

Remove bit about pdf docs

  • Property svn:eol-style set to native
  • Property svn:keywords set to Author Date Id Revision
Line 
1 =head1 NAME
2
3 The Swish-e README File
4
5 =head1 Upgrading?
6
7 If you are upgrading Swish-e, please review the CHANGES file before installation.
8 The index format may change and existing indexes may need to be re-created before
9 use.
10
11 =head1 OVERVIEW
12
13 Swish-e is B<S>imple B<W>eb B<I>ndexing B<S>ystem for B<H>umans - B<E>nhanced.
14 Swish-e can quickly and easily index directories of files or remote web sites
15 and search the generated indexes.
16
17 Swish-e is extremely fast in both indexing and searching, highly
18 configurable, and can be seamlessly integrated with existing web sites to
19 maintain a consistent design. Swish-e can index web pages, but can just as
20 easily index text files, mailing list archives, or data stored in a
21 relational database.
22
23 Swish is designed to index small- to medium-sized collection of documents,
24 Although a few users are indexing over a million documents, typical usage
25 is more often in the tens of thousands.  Currently, Swish-e only indexes
26 eight bit character encodings.
27
28 Swish-e version 2.2 was a major rewrite of the code and the addition of many
29 new features.  Memory requirements for indexing have been reduced and
30 indexing speed is significantly improved from previous versions. New
31 features allow more control over indexing, better document parsing, improved
32 indexing and searching logic, better filter code, and the ability to index
33 from any data source.
34
35 Swish-e version 2.4 includes a major rewrite of the C API and a new Perl
36 module for accessing the Swish-e C library.  In addition, Swish-e 2.4 uses
37 the GNU Auto Tools.  The significant changes are where files are installed,
38 and the use of Libtool to create the Swish-e library as a shared library on
39 many platforms.  Basically, installation is easier than previous versions,
40 and more files are installed in "standard" locations (e.g. documentation
41 is installed in C<$prefix/share/doc/swish-e>).
42
43 Note: Due to the new build and installation system in Swish-e 2.4, some
44 documentation may incorrectly list the location of files.  Please report
45 any documentation errors to the Swish-e Discussion list.
46
47 Swish-e is not a "turn-key" indexing and searching solution.  The Swish-e
48 distribution contains most of the parts to create such a system, but you
49 need to put the parts together as best meets your needs. This gives you the
50 power to index and search your documents the way you wish and to seamlessly
51 integrate a search engine into your web site or application.
52
53 To use Swish-e, you will need to configure Swish-e to index your documents,
54 create an index by running Swish-e, and setup an interface such as a CGI
55 script (a script is included) to search the index and display results.
56 Swish uses helper programs to index documents of types that Swish-e cannot
57 natively index.  These programs may need to be installed separately from
58 Swish-e.
59
60 Swish-e is an Open Source (see: http://opensource.org ) program supported by
61 developers and a large group of users. Please take time to join the Swish-e
62 discussion list at http://Swish-e.org .
63
64
65 =head2 Key features
66
67 =over 4
68
69 =item *
70
71 Quickly index a large number of documents in different formats
72 including text, HTML, and XML.
73
74 =item *
75
76 Use "filters" to index other types of files such as PDF, gzip, or
77 PostScript.
78
79 =item *
80
81 Includes a web spider for indexing remote documents over HTTP.
82 Follows Robots Exclusion Rules (including META tags).
83
84 =item *
85
86 Can use an external program to supply documents to Swish-e, such as an
87 advanced spider for your web server or a program to read and format
88 records from a relational database.
89
90 =item *
91
92 Document "properties" (some subset of the source document, usually defined
93 as a META or XML elements) may be stored in the index and returned with
94 search results.
95
96 =item *
97
98 Document summaries can be returned with each search.
99
100 =item *
101
102 Word stemming, soundex, metaphone, and double-metaphone indexing for "fuzzy" searching
103
104 =item *
105
106 Phrase searching and wildcard searching
107
108 =item *
109
110 Limit searches to HTML links.
111
112 =item *
113
114 Use powerful Regular Expressions to select documents for indexing or exclusion.
115
116 =item *
117
118 Easily limit searches to parts or all of your web site.
119
120 =item *
121
122 Results can be sorted by relevance or by any number of properties
123 in ascending or descending order.
124
125 =item *
126
127 Limit searches to parts of documents such as certain HTML tags
128 (META, TITLE, comments, etc.) or to XML elements.
129
130 =item *
131
132 Can report structural errors in your XML and HTML documents.
133
134 =item *
135
136 Index file is portable between platforms.
137
138 =item *
139
140 A Swish-e library is provided to allow embedding Swish-e into your applications for
141 very fast searching.
142 A Perl module is available that provides a standard API for accessing Swish-e.
143
144 =item *
145
146 Includes example search script with context summaries and search term and phrase highlighting.
147 Can be used with popular Perl templating systems.
148
149 =item *
150
151 Swish-e is fast.
152
153 =item *
154
155 It's Open Source and FREE!  You can customize Swish-e and you can
156 contribute your fancy new features to the project.
157
158 =item *
159
160 Supported by on-line user and developer groups.
161
162 =back
163
164
165 =head1 Where do I get Swish-e?
166
167 The current version of Swish-e can be found at:
168
169 http://Swish-e.org
170
171 Please make sure you use a current version of Swish-e.
172
173 Information about Windows binary distributions can also be found at
174 this site.
175
176 =head1 How Do I Install Swish-e?
177
178 Read the L<INSTALL|INSTALL> page.
179
180 Building from source is recommended.  On most platforms, Swish-e should build without problems.
181 A list of platforms where Swish-e has been built can be found in the L<INSTALL|INSTALL> page.
182 Information on building for VMS and Win32 can be found in sub-directories of the C<src> directory.
183 Check the Swish-e site for information about binary distributions (such as for Windows).
184
185 In addition to the INSTALL page, make sure you read the L<SWISH-FAQ|SWISH-FAQ> page if
186 you have any questions, or to get an idea of questions that you might someday ask.
187
188 Problems or questions about installing Swish-e should be directed to the Swish-e discussion list (see the
189 Swish-e web site at http://Swish-e.org).
190
191 Please read C<Where do I get help with Swish-e?> below before posting any questions to the
192 Swish-e list.
193
194
195 =head1 The Swish-e Documentation
196
197 Documentation is provided as HTML pages installed in
198 $prefix/share/doc/swish-e where $prefix is /usr/local if building from
199 source, or /usr if installed as part of a package from your OS vendor.
200 Under Windows $prefix is selected at installation time.
201
202 A subset of the documentation is installed as system man pages as well.
203
204 Documentation is also available on-line at http://swish-e.org.
205
206 Patches or updates to the documentation should be done against the POD files,
207 located in the pod directory of the distribution, or (preferably) against
208 the CVS repository.
209
210 =head1 Where do I get help with Swish-e?
211
212 If you need help with installing or using Swish-e, please subscribe to
213 the Swish-e mailing list.  Visit the Swish-e web site (listed above)
214 for information on subscribing to the mailing list.
215
216 Before posting any questions, please read
217 L<QUESTIONS AND TROUBLESHOOTING|INSTALL/"QUESTIONS AND TROUBLESHOOTING">.
218
219 =head1 Speling mistakes
220
221 Please contact the Swish-e list with corrections to this documentation.
222 Any help in cleaning up the docs will be appreciated!
223
224 Any patches should be made against the C<.pod> files, not the C<.html> files.
225
226 =head1 Swish-e Development
227
228 Swish-e is currently being developed as an Open-Source project on
229 SourceForge http://sourceforge.net.
230
231 Contact the Swish-e list for questions about Swish-e development.
232
233 =head1 Swish-e's History
234
235 SWISH was created by Kevin Hughes, circa 1994, to fill the need of the growing number
236 of Web administrators on the Internet - many of the indexing systems were
237 not well documented, were hard to use and install, and were too complex
238 for their own good. The system was widely used for several years, long
239 enough to collect some bug fixes and requests for enhancements.
240
241 In Fall 1996, The Library of UC Berkeley received permission from
242 Kevin Hughes to implement bug fixes and enhancements to the original
243 binary. The result is Swish-enhanced or Swish-e, brought to you by the
244 Swish-e Development Team.
245
246 =head1 Document Info
247
248 Each document in the Swish-e distribution contains this section.
249 It refers only to the specific page it's located in, and not to the
250 Swish-e program or the documentation as a whole.
251
252 $Id$
253
254 .
Note: See TracBrowser for help on using the browser.