| 1 |
# ----- Example 8 - Filtering PDF files ------- |
|---|
| 2 |
# |
|---|
| 3 |
# Please see the swish-e documentation for |
|---|
| 4 |
# information on configuration directives. |
|---|
| 5 |
# Documentation is included with the swish-e |
|---|
| 6 |
# distribution, and also can be found on-line |
|---|
| 7 |
# at http://swish-e.org |
|---|
| 8 |
# |
|---|
| 9 |
# |
|---|
| 10 |
# This example demonstrates how to use swish's |
|---|
| 11 |
# "filter" feature to index PDF documents. |
|---|
| 12 |
# |
|---|
| 13 |
# Filters can be used to filter PDF or MS Word documents |
|---|
| 14 |
# to uncompress gzipped files, or to modify content |
|---|
| 15 |
# before indexing. |
|---|
| 16 |
# |
|---|
| 17 |
# You will need the xpdf package installed to use |
|---|
| 18 |
# this filter. |
|---|
| 19 |
# |
|---|
| 20 |
# See filter-bin/_pdf2html.pl for more information. |
|---|
| 21 |
# |
|---|
| 22 |
# Please see the documentation on File Filters in |
|---|
| 23 |
# the SWISH-CONFIG.pod manual page. |
|---|
| 24 |
# |
|---|
| 25 |
# Note: |
|---|
| 26 |
# If you are filtering many documents and/or using |
|---|
| 27 |
# a perl script to filter, see example9.config for |
|---|
| 28 |
# perhaps a faster way to filter. |
|---|
| 29 |
# |
|---|
| 30 |
#--------------------------------------------------- |
|---|
| 31 |
|
|---|
| 32 |
# Include our site-wide configuration settings: |
|---|
| 33 |
|
|---|
| 34 |
IncludeConfigFile example4.config |
|---|
| 35 |
|
|---|
| 36 |
# Index the example config files and .pdf files |
|---|
| 37 |
# in the current directory (and sub directories) |
|---|
| 38 |
|
|---|
| 39 |
IndexDir . |
|---|
| 40 |
IndexOnly .config .pdf |
|---|
| 41 |
|
|---|
| 42 |
|
|---|
| 43 |
# Assign the pdf2text.pl filter to .pdf files |
|---|
| 44 |
# Please see docs on what data can be passed to the filter. |
|---|
| 45 |
|
|---|
| 46 |
FileFilter .pdf ../filter-bin/_pdf2html.pl |
|---|
| 47 |
|
|---|
| 48 |
|
|---|
| 49 |
# end of example |
|---|
| 50 |
|
|---|