next up previous contents index
Next: 4.5 Use Cases Up: 4. The Parser Previous: 4.3.4 [WINDOWS]   Contents   Index


4.4 The Exclusion List

Sometimes, the STAYONHOST and STAYBELOW attributes do not give enough selection ability to include only those things that you want included. The parser looks for so called exclusion lists in form of text files. It first loads the file $HOME/.plucker/exclusionlist.txt (if it exists), then it loads exclusionlist.txt in the directory you specified with the -p flag (again, only if it exists) and finally it loads any files specified on the command line using the -E option and/or found in the configuration file. A default exclusionlist.txt file is included with Plucker.

The filter options in the exclusion list has a very flexible format:

<prio>:<action>:<regexp>

where:

<prio>
is an integer (negative numbers are also valid) specifying the priority. Rules with higher priorities are considered before rules with lower priorities. Rules of equal priority are considered in the sequence that they appear in the file.
<action>
is either a plus or a minus sign. Plus means 'include this document' while minus means 'do not include this document'.
<regexp>
a valid regular expression (as known, e.g. from perl).

Leading and trailing white space is ignored and empty lines or lines starting with a '#' are considered comments.

Examples:

0:-:.*\.mp3$
0:-:.*\.wav$

## Ignore known advertisement sites:
0:-:http://.*ad\.de\.doubleclick\.net/.*
0:-:http://.*ad\.linkexchange\.com/.*


next up previous contents index
Next: 4.5 Use Cases Up: 4. The Parser Previous: 4.3.4 [WINDOWS]   Contents   Index
The Plucker Team