Changeset 1623

Show
Ignore:
Timestamp:
02/03/05 14:56:38 (4 years ago)
Author:
whmoseley
Message:

Escape < and > as reported by Pod::POM processing

Files:

Legend:

Unmodified
Added
Removed
Modified
Copied
Moved
  • trunk/swish-e/filters/SWISH/Filter.pm.in

    r1586 r1623  
    125125=over 4 
    126126 
    127 =item $filter = SWISH::Filter->new() 
     127=item $filter = SWISH::Filter-E<gt>new() 
    128128 
    129129This creates a SWISH::Filter object.  You may pass in options as a list or a hash reference. 
     
    131131=back 
    132132 
    133 =head2 SWISH::Filter->new Options 
     133=head2 SWISH::Filter-E<gt>new Options 
    134134 
    135135There is currently only one option that can be passed in to new(): 
     
    218218 
    219219 
    220 =item $doc_object = $filter->convert(); 
     220=item $doc_object = $filter-E<gt>convert(); 
    221221 
    222222This method filters a document.  Returns an object of the class SWISH::Filter::document 
     
    411411 
    412412 
    413 =item $filter->mywarn() 
     413=item $filter-E<gt>mywarn() 
    414414 
    415415Internal function used for writing warning messages to STDERR if 
     
    425425} 
    426426 
    427 =item @filters = $filter->filter_list; 
     427=item @filters = $filter-E<gt>filter_list; 
    428428 
    429429Returns a list of filter objects installed. 
     
    514514} 
    515515 
    516 =item @filter = $filter->can_filter( $content_type ); 
     516=item @filter = $filter-E<gt>can_filter( $content_type ); 
    517517 
    518518This is useful for testing to see if a mimetype might be handled by SWISH::Filter 
     
    585585 
    586586Once a filter returns something other than undef no more filters will be 
    587 called.  If the filter calls $filter->set_continue then processing will 
     587called.  If the filter calls $filter-E<gt>set_continue then processing will 
    588588continue as if the file was not filtered.  For example, a filter can uncompress 
    589 data and then set $filter->set_continue and let other filters process the 
     589data and then set $filter-E<gt>set_continue and let other filters process the 
    590590document. 
    591591 
     
    834834 
    835835 
    836 =item $doc_ref = $doc_object->fetch_doc_reference; 
     836=item $doc_ref = $doc_object-E<gt>fetch_doc_reference; 
    837837 
    838838Returns a scalar reference to the document.  This can be used when the filter 
     
    842842If the file is currently on disk then it will be read into memory.  If the file was stored 
    843843in a temporary file on disk the file will be deleted once read into memory. 
    844 The file will be read in binmode if $doc->is_binary is true. 
    845  
    846 Note that $doc_object->fetch_doc is an alias. 
     844The file will be read in binmode if $doc-E<gt>is_binary is true. 
     845 
     846Note that $doc_object-E<gt>fetch_doc is an alias. 
    847847 
    848848=cut 
     
    861861 
    862862 
    863 =item $was_filtered = $doc_object->was_filtered 
     863=item $was_filtered = $doc_object-E<gt>was_filtered 
    864864 
    865865Returns true if some filter processed the document 
     
    872872} 
    873873 
    874 =item $content_type = $doc_object->content_type; 
     874=item $content_type = $doc_object-E<gt>content_type; 
    875875 
    876876Fetches the current content type for the document. 
     
    887887} 
    888888 
    889 =item $type = $doc_object->swish_parser_type 
     889=item $type = $doc_object-E<gt>swish_parser_type 
    890890 
    891891Returns a parser type based on the content type 
     
    915915} 
    916916 
    917 =item $doc_object->is_binary 
     917=item $doc_object-E<gt>is_binary 
    918918 
    919919Returns true if the document's content-type does not match "text/". 
     
    932932=over 4 
    933933 
    934 =item $file_name = $doc_object->fetch_filename; 
     934=item $file_name = $doc_object-E<gt>fetch_filename; 
    935935 
    936936Returns a path to the document as stored on disk. 
     
    941941the file name passed to be the real path of the document. 
    942942 
    943 The file will be written in binmode if $doc->is_binary is true. 
     943The file will be written in binmode if $doc-E<gt>is_binary is true. 
    944944 
    945945This method is not normally used by end-users of SWISH::Filter. 
     
    958958} 
    959959 
    960 =item $doc_object->set_continue; 
     960=item $doc_object-E<gt>set_continue; 
    961961 
    962962Processing will continue to the next filter if this is set to a true value. 
     
    979979 
    980980 
    981 =item $doc_object->set_content_type( $type ); 
     981=item $doc_object-E<gt>set_content_type( $type ); 
    982982 
    983983Sets the content type for a document. 
     
    10361036 
    10371037 
    1038 =item $doc_object->name 
     1038=item $doc_object-E<gt>name 
    10391039 
    10401040Fetches the name of the current file.  This is useful for printing out the 
    10411041name of the file in an error message. 
    1042 This is the name passed in to the SWISH::Filter->convert method. 
     1042This is the name passed in to the SWISH::Filter-E<gt>convert method. 
    10431043It is optional and thus may not always be set. 
    10441044 
     
    10471047 
    10481048 
    1049 =item $doc_object->user_data 
     1049=item $doc_object-E<gt>user_data 
    10501050 
    10511051Fetches the the user_data passed in to the filter. 
    1052 This can be any data or data structure passed into SWISH::Filter->new. 
     1052This can be any data or data structure passed into SWISH::Filter-E<gt>new. 
    10531053 
    10541054This is an easy way to pass special parameters into your filters. 
     
    11051105=over 4 
    11061106 
    1107 =item $self->type 
     1107=item $self-E<gt>type 
    11081108 
    11091109This method fetches the type of the filter.  The value returned sets the 
     
    11191119sub type { 2 }; 
    11201120 
    1121 =item $self->priority 
     1121=item $self-E<gt>priority 
    11221122 
    11231123This method fetches the priority of the filter.  The value returned sets the 
     
    11361136sub priority { 50 };  # default priority 
    11371137 
    1138 =item @types = $self->mimetypes 
     1138=item @types = $self-E<gt>mimetypes 
    11391139 
    11401140Returns the list of mimetypes (as regular expressions) set for the filter. 
     
    11541154} 
    11551155 
    1156 =item $pattern = $self->can_filter_mimetype( $content_type ) 
     1156=item $pattern = $self-E<gt>can_filter_mimetype( $content_type ) 
    11571157 
    11581158Returns true if passed in content type matches one of the filter's mimetypes 
     
    11831183} 
    11841184 
    1185 =item $boolean = $self->set_programs( @program_list ); 
     1185=item $boolean = $self-E<gt>set_programs( @program_list ); 
    11861186 
    11871187Returns true if all the programs listed in @program_list are found 
     
    12241224 
    12251225 
    1226 =item $path = $self->find_binary( $prog ); 
     1226=item $path = $self-E<gt>find_binary( $prog ); 
    12271227 
    12281228Use in a filter's new() method to test for a necesary program located in $PATH. 
     
    13061306} 
    13071307 
    1308 =item $bool = $self->use_modules( @module_list ); 
     1308=item $bool = $self-E<gt>use_modules( @module_list ); 
    13091309 
    13101310Attempts to load each of the module listed and calls its import() method. 
     
    13421342} 
    13431343 
    1344 =item $doc_ref = $self->run_program( $program, @args ); 
     1344=item $doc_ref = $self-E<gt>run_program( $program, @args ); 
    13451345 
    13461346Runs $program with @args.  Must pass in @args. 
  • trunk/swish-e/perl/API.pm

    r1586 r1623  
    136136=over 4 
    137137 
    138 =item $swish = SWISH::API->new( $index_files ); 
     138=item $swish = SWISH::API-E<gt>new( $index_files ); 
    139139 
    140140This method returns a swish handle object blessed into the SWISH::API class. 
     
    143143Caller must check for errors (see below). 
    144144 
    145 =item @indexes = $swish->IndexNames; 
     145=item @indexes = $swish-E<gt>IndexNames; 
    146146 
    147147Returns a list of index names associated with the swish handle. 
    148 These were the indexes specified as a parameter on the SWISH::API->new call. 
     148These were the indexes specified as a parameter on the SWISH::API-E<gt>new call. 
    149149This can be used in calls below that require specifying the index file name. 
    150150 
    151 =item @header_names = $swish->HeaderNames; 
     151=item @header_names = $swish-E<gt>HeaderNames; 
    152152 
    153153Returns a list of possible header names.  These can be used to lookup 
    154154header values.  See C<SwishHeaderValue> method below. 
    155155 
    156 =item @values = $swish->HeaderValue( $index_file, $header_name ); 
     156=item @values = $swish-E<gt>HeaderValue( $index_file, $header_name ); 
    157157 
    158158A swish-e index has data associated with it stored in the index header.  This method 
     
    164164The list of possible header names can be obtained from the SwishHeaderNames method. 
    165165 
    166 =item $swish->RankScheme( 0|1 ); 
     166=item $swish-E<gt>RankScheme( 0|1 ); 
    167167 
    168168Similar to the -R option with the swish-e command line tool. The default 
     
    196196=over 4 
    197197 
    198 =item $swish->Error 
     198=item $swish-E<gt>Error 
    199199 
    200200Returns true if an error occurred on the last operation.  On errors the value returned 
    201201is the internal Swish-e error number (which is less than zero). 
    202202 
    203 =item $swish->CriticalError 
     203=item $swish-E<gt>CriticalError 
    204204 
    205205Returns true if the last error was a critical error 
    206206 
    207 =item $swish->AbortLastError 
     207=item $swish-E<gt>AbortLastError 
    208208 
    209209Aborts the running program and prints an error message to STDERR. 
    210210 
    211 =item $str = $swish->ErrorString 
     211=item $str = $swish-E<gt>ErrorString 
    212212 
    213213Returns the string description of the current error (based on the value 
    214 returned by $swish->Error).  This is a generic error string. 
    215  
    216 =item $msg = $swish->LastErrorMsg 
     214returned by $swish-E<gt>Error).  This is a generic error string. 
     215 
     216=item $msg = $swish-E<gt>LastErrorMsg 
    217217 
    218218Returns a string with specific information about the last error, if any. 
     
    221221    badmeta=foo 
    222222 
    223 and "badmeta" is an invalid metaname $swish->ErrorString 
    224 might return "Unknown metaname", but $swish->LastErrorMsg might return "badmeta". 
     223and "badmeta" is an invalid metaname $swish-E<gt>ErrorString 
     224might return "Unknown metaname", but $swish-E<gt>LastErrorMsg might return "badmeta". 
    225225 
    226226 
     
    231231=over 4 
    232232 
    233 =item $search = $swish->New_Search_Object( $query ); 
     233=item $search = $swish-E<gt>New_Search_Object( $query ); 
    234234 
    235235This creates a new search object blessed into the SWISH::API::Search class.  The optional 
     
    246246    } 
    247247 
    248 =item $results = $swish->Query( $query ); 
     248=item $results = $swish-E<gt>Query( $query ); 
    249249 
    250250This is a short-cut which avoids the step of creating a separate search object. 
     
    266266=over 4 
    267267 
    268 =item $search->SetQuery( $query ); 
     268=item $search-E<gt>SetQuery( $query ); 
    269269 
    270270This will set (or replace) the query string associated with a search object. 
     
    272272actual query or when creating a search object. 
    273273 
    274 =item $search->SetStructure( $structure_bits ); 
     274=item $search-E<gt>SetStructure( $structure_bits ); 
    275275 
    276276This method may change in the future. 
     
    294294 
    295295 
    296 =item $search->PhraseDelimiter( $char ); 
     296=item $search-E<gt>PhraseDelimiter( $char ); 
    297297 
    298298Sets the character used as the phrase delimiter in searches.  The default 
    299299is double-quotes ("). 
    300300 
    301 =item $search->SetSearchLimit( $property, $low, $high ); 
     301=item $search-E<gt>SetSearchLimit( $property, $low, $high ); 
    302302 
    303303Sets a range from $low to $high inclusive that the give $property must be in 
     
    323323method first. 
    324324 
    325 =item $search->ResetSearchLimit; 
     325=item $search-E<gt>ResetSearchLimit; 
    326326 
    327327Clears the limit parameters for the given object.  This must be called if 
    328328the limit parameters need to be changed. 
    329329 
    330 =item $search->SetSort( $sort_string ); 
     330=item $search-E<gt>SetSort( $sort_string ); 
    331331 
    332332Sets the sort order of search results.  The string is a space separated 
     
    352352=over 4 
    353353 
    354 =item $results = $search->Execute( $query ); 
     354=item $results = $search-E<gt>Execute( $query ); 
    355355 
    356356Executes a query based on the parameters in the search object. 
     
    373373=over 4 
    374374 
    375 =item $hits = $results->Hits; 
     375=item $hits = $results-E<gt>Hits; 
    376376 
    377377Returns the number of results for the query.  If zero and no errors were reported 
    378 after calling $search->Execute then the query returned zero results. 
    379  
    380 =item @parsed_words = $results->ParsedWords( $index_name ); 
     378after calling $search-E<gt>Execute then the query returned zero results. 
     379 
     380=item @parsed_words = $results-E<gt>ParsedWords( $index_name ); 
    381381 
    382382Returns an array of tokenized words and operators with stopwords removed. 
     
    384384 
    385385$index_name must match one of the index files specified on the creation of 
    386 the swish object (via the SWISH::API->new call). 
     386the swish object (via the SWISH::API-E<gt>new call). 
    387387 
    388388The parsed words are useful for highlighting search terms in associated documents. 
    389389 
    390 =item @removed_stopwords = $results->RemovedStopwords( $index_name) ; 
     390=item @removed_stopwords = $results-E<gt>RemovedStopwords( $index_name) ; 
    391391 
    392392Returns an array of stopwords removed from a query, if any, for the index 
     
    394394 
    395395$index_name must match one of the index files specified on the creation of 
    396 the swish object (via the SWISH::API->new call). 
    397  
    398 =item $results->SeekResult( $position ); 
     396the swish object (via the SWISH::API-E<gt>new call). 
     397 
     398=item $results-E<gt>SeekResult( $position ); 
    399399 
    400400Seeks to the position specified in the result list.  Zero is the first position 
    401 and $results->Hits-1 is the last position.  Seeking past the end of results 
     401and $results-E<gt>Hits-1 is the last position.  Seeking past the end of results 
    402402sets a non-critical error condition. 
    403403 
    404404Useful for seeking to a specific "page" of results. 
    405405 
    406 =item $result = $results->NextResult; 
     406=item $result = $results-E<gt>NextResult; 
    407407 
    408408Fetches the next result from the list of results.  Returns undef if no 
     
    418418=over 4 
    419419 
    420 =item $prop = $result->Property( $prop_name ); 
     420=item $prop = $result-E<gt>Property( $prop_name ); 
    421421 
    422422Fetches the property specified for the current result. 
     
    429429format the strings (or just call scalar localtime( $prop ) ). 
    430430 
    431 =item $prop = $result->ResultPropertyStr( $prop_name ); 
     431=item $prop = $result-E<gt>ResultPropertyStr( $prop_name ); 
    432432 
    433433Fetches and formats the property.  Unlike above, invalid property names return the 
     
    437437 
    438438 
    439 =item $value = $result->ResultIndexValue( $header_name ); 
     439=item $value = $result-E<gt>ResultIndexValue( $header_name ); 
    440440 
    441441Returns the header value specified.  This is similar to 
    442 $swish->HeaderValue(), but the index file is not specified 
     442$swish-E<gt>HeaderValue(), but the index file is not specified 
    443443(it is determined by the result). 
    444444 
     
    449449=over 4 
    450450 
    451 =item @metas = $swish->MetaList( $index_name ); 
     451=item @metas = $swish-E<gt>MetaList( $index_name ); 
    452452 
    453453Swish-e has "MetaNames" which allow searching by fields in the index. 
     
    470470value is zero. 
    471471 
    472 =item @props = $swish->PropertyList( $index_name ); 
     472=item @props = $swish-E<gt>PropertyList( $index_name ); 
    473473 
    474474Swish-e can store content or "properties" in the index and return this data 
     
    494494value is zero. 
    495495 
    496 =item @propes = $result->PropertyList; 
    497  
    498 =item @meta = $result->MetaList; 
     496=item @propes = $result-E<gt>PropertyList; 
     497 
     498=item @meta = $result-E<gt>MetaList; 
    499499 
    500500These also return a list of Property or Metaname description objects, but are 
     
    504504 
    505505 
    506 =item $stemmed_word = $swish->StemWord( $word ); 
     506=item $stemmed_word = $swish-E<gt>StemWord( $word ); 
    507507 
    508508*Deprecated* 
     
    515515 
    516516 
    517 =item $fuzzy_word = $swish->Fuzzy( $indexname, $word ); 
     517=item $fuzzy_word = $swish-E<gt>Fuzzy( $indexname, $word ); 
    518518 
    519519Like StemWord used to work, only it uses whatever stemmer is named in $indexname. 
    520520Returns the same kind of fuzzy_word object as the FuzzyWord() method. 
    521521 
    522 =item $mode_string = $result->FuzzyMode; 
     522=item $mode_string = $result-E<gt>FuzzyMode; 
    523523 
    524524Returns the string (e.g. "Stemming_en", "Soundex", "None" ) indicating the stemming 
    525525method used while indexing the given document. 
    526526 
    527 =item $fuzzy_word = $result->FuzzyWord( $word ); 
     527=item $fuzzy_word = $result-E<gt>FuzzyWord( $word ); 
    528528 
    529529Converts $word using the same fuzzy mode used to index the $result. 
     
    531531to access the converted words and other data as shown below. 
    532532 
    533 =item $count = $fuzzy_word->WordCount; 
     533=item $count = $fuzzy_word-E<gt>WordCount; 
    534534 
    535535Returns the number of output words.  Normally this is the value one, but may 
     
    537537for a single input string. 
    538538 
    539 =item $status = $fuzzy_word->WordError; 
     539=item $status = $fuzzy_word-E<gt>WordError; 
    540540 
    541541Returns any error code that the stemmer might set.  Normally, this return value 
     
    543543are defined in the swish-e source file /src/stemmer.h. 
    544544 
    545 =item @words = $fuzzy_word->WordList; 
     545=item @words = $fuzzy_word-E<gt>WordList; 
    546546 
    547547Returns the converted words from the stemming/fuzzy operation.  Normally, the array will 
     
    550550 
    551551In the event that a word does not stem (e.g. trying to stem a number), this method 
    552 will return the original input word specified when $result->FuzzyWord( $word ) 
     552will return the original input word specified when $result-E<gt>FuzzyWord( $word ) 
    553553was called. 
    554554 
    555555 
    556 =item @parsed_words = $swish->SwishWords( $string, $index_file ); 
     556=item @parsed_words = $swish-E<gt>SwishWords( $string, $index_file ); 
    557557 
    558558* Not implemented * 
     
    594594 
    595595But as long as a SWISH::API::Result object is around, so is the entire list 
    596 of results generated by the $handle->Query() call, and the index file is 
     596of results generated by the $handle-E<gt>Query() call, and the index file is 
    597597still open (because a SWISH::API::Result depends on a SWISH::API::Results object, which 
    598598depends on a SWISH::API object). 
  • trunk/swish-e/pod/CHANGES.pod

    r1613 r1623  
    616616 
    617617 
    618 You should have swish-e packages in your RPMS/<arch> directory.  [augur] 
     618You should have swish-e packages in your RPMS/$arch directory.  [augur] 
    619619 
    620620=item * Changed default perl binary location 
     
    748748Filters (FileFilter directive) did not work correctly when spidering 
    749749with the -S http method.  A new filter system was developed and now 
    750 filtering of documents (e.g. pdf->html or MSWord->text) is handled 
     750filtering of documents (e.g. pdf-E<gt>html or MSWord-E<gt>text) is handled 
    751751by the src/SwishSpider program. 
    752752 
     
    840840 
    841841If you are parsing output headers in a program then you may need to 
    842 adjust your code.  There's a new switch <-H> to control the level of 
     842adjust your code.  There's a new switch '-H' to control the level of 
    843843header output when searching. 
    844844 
     
    11381138=item * New directive: ImageLinksMetaName 
    11391139 
    1140 Defines a metaname to use for indexing src links in <img> tags. 
     1140Defines a metaname to use for indexing src links in E<lt>imgE<gt> tags. 
    11411141Allow you to search image pathnames within HTML pages.  Available only 
    11421142with libxml2 parser. 
  • trunk/swish-e/pod/SWISH-CONFIG.pod

    r1613 r1623  
    395395=item * 
    396396 
    397 L<StoreDescription|/StoreDescription> [XML <tag>|HTML <meta>|TXT size] 
     397L<StoreDescription|/StoreDescription> [XML E<lt>tagE<gt>|HTML E<lt>metaE<gt>|TXT size] 
    398398 
    399399=item * 
     
    403403=item * 
    404404 
    405 L<SwishSearchDefaultRule|/SwishSearchDefaultRule>   [<AND-WORD>|<or-word>] 
    406  
    407 =item * 
    408  
    409 L<SwishSearchOperators|/SwishSearchOperators> <and-word> <or-word> <not-word
     405L<SwishSearchDefaultRule|/SwishSearchDefaultRule>   [E<lt>AND-WORDE<gt>|E<lt>or-wordE<gt>] 
     406 
     407=item * 
     408 
     409L<SwishSearchOperators|/SwishSearchOperators> E<lt>and-wordE<gt> E<lt>or-wordE<gt> E<lt>not-wordE<gt
    410410 
    411411=item * 
     
    545545    " "  = following word will be searched in documents 
    546546 
    547 =item SwishSearchOperators <and-word> <or-word> <not-word
     547=item SwishSearhOperators E<lt>and-wordE<gt> E<lt>or-wordE<gt> E<lt>not-wordE<gt
    548548 
    549549B<NOTE>: This following item is currently not available. 
     
    557557    SwishSearchOperators   UND  ODER  NICHT 
    558558 
    559 =item SwishSearchDefaultRule   [<AND-WORD>|<or-word>] 
     559=item SwishSearchDefaultRule   [E<lt>AND-WORDE<gt>|E<lt>or-wordE<gt>] 
    560560 
    561561B<NOTE>: This following item is currently not available. 
     
    10941094=item IndexAltTagMetaName *tagname*|as-text 
    10951095 
    1096 Allows indexing of images <IMG> ALT tag text.  Specify either a tag name which will be 
     1096Allows indexing of images E<lt>IMGE<gt> ALT tag text.  Specify either a tag name which will be 
    10971097used as a metaname, or the special text "as-text" which says to index the ALT text as 
    10981098if it were plain text at the current location. 
     
    11361136If this is set true then Swish-e will attempt to convert relative URIs 
    11371137extracted from HTML documents for use with C<HTMLLinksMetaName> and 
    1138 C<ImageLinksMetaName> into absolute URIs.  Swish-e will use any <BASE> tag 
     1138C<ImageLinksMetaName> into absolute URIs.  Swish-e will use any E<lt>BASEE<gt> tag 
    11391139found in the document, otherwise it will use the file's pathname.  The pathname 
    11401140used will be the pathname *after* C<ReplaceRules> has been applied to the 
     
    13231323    Indexing done! 
    13241324 
    1325 One thing to note is that the first <person> block finds a class name 
     1325One thing to note is that the first E<lt>personE<gt> block finds a class name 
    13261326"student" so all metanames that are created from attributes use the 
    1327 combined name "person.student".  The second <person> block doesn't contain 
     1327combined name "person.student".  The second E<lt>personE<gt> block doesn't contain 
    13281328a "class" so, the attribute name is combined directly with the element 
    13291329name (e.g. "person.greeting"). 
     
    15771577 
    15781578 
    1579 =item StoreDescription [XML <tag> size|HTML <meta> size|TXT size] 
     1579=item StoreDescription [XML E<lt>tagE<gt> size|HTML E<lt>metaE<gt> size|TXT size] 
    15801580 
    15811581B<StoreDescription> allows you to store a document description in the index 
  • trunk/swish-e/pod/SWISH-FAQ.pod

    r1613 r1623  
    11201120 
    11211121That means there was one instance of our word in the title of the file. 
    1122 It's context was in the <head> tagset, inside the <title>.  
    1123 The <title> is the most specific structure, so it gets the 
     1122It's context was in the E<lt>headE<gt> tagset, inside the E<lt>titleE<gt>.  
     1123The E<lt>titleE<gt> is the most specific structure, so it gets the 
    11241124RANK_TITLE score: 7. The base rank of 1 plus the structure score of 7 equals 8. If there 
    11251125had been two instances of this word in the title, then the score would have been C<8 + 8 = 16>. 
  • trunk/swish-e/pod/SWISH-RUN.pod

    r1492 r1623  
    594594EM, or STRONG), and c is HTML comment tags 
    595595 
    596 search only in header (<H*>) tags 
     596search only in header (E<lt>H*E<gt>) tags 
    597597 
    598598    swish-e -w word -t h 
     
    914914    -x "xml_out: <swishtitle fmt='<title>%s</title>'>\n" 
    915915 
    916 =item -H [0|1|2|3|<n>]  (header output verbosity) 
     916=item -H [0|1|2|3|E<lt>nE<gt>]  (header output verbosity) 
    917917 
    918918The C<-H n> switch generates extened I<header> output.  This is most useful when searching more than one 
  • trunk/swish-e/prog-bin/spider.pl.in

    r1598 r1623  
    17101710The spider does require Perl's LWP library and a few other reasonably common 
    17111711modules.  Most well maintained systems should have these modules installed. 
    1712 See  L<REQUIREMENTS> below for more information.  It's a good idea to check 
     1712See  L</"REQUIREMENTS"> below for more information.  It's a good idea to check 
    17131713that you are running a current version of these modules. 
    17141714 
     
    17291729 
    17301730By default, this script will not spider files blocked by F<robots.txt>.  In addition, 
    1731 The script will check for <meta name="robots"..> tags, which allows finer 
     1731The script will check for E<lt>meta name="robots"..E<gt> tags, which allows finer 
    17321732control over what files are indexed and/or spidered. 
    17331733See http://www.robotstxt.org/wc/exclusion.html for details. 
    17341734 
    1735 This spider provides an extension to the <meta> tag exclusion, by adding a 
     1735This spider provides an extension to the E<lt>metaE<gt> tag exclusion, by adding a 
    17361736B<NOCONTENTS> attribute.  This attribute turns on the C<no_contents> setting, which 
    17371737asks swish-e to only index the document's title (or file name if not title is found). 
     
    23462346 
    23472347The function calls are wrapped in an eval, so calling die (or doing something that dies) will just cause 
    2348 that URL to be skipped.  If you really want to stop processing you need to set $server->{abort} in your 
     2348that URL to be skipped.  If you really want to stop processing you need to set $server-E<gt>{abort} in your 
    23492349subroutine (or send a kill -HUP to the spider). 
    23502350 
     
    25972597 
    25982598Note that you can create your own counters to display in the summary list when spidering 
    2599 is finished by adding a value to the hash pointed to by C<$server->{counts}>. 
     2599is finished by adding a value to the hash pointed to by C<$server-E<gt>{counts}>. 
    26002600 
    26012601    test_url => sub { 
  • trunk/swish-e/src/metanames.c

    r1544 r1623  
    452452        efree( meta->metaName ); 
    453453 
    454 #ifndef USE_BTREE 
     454#ifndef USE_PRESORT_ARRAY 
    455455        if ( meta->sorted_data) 
    456456            efree( meta->sorted_data );