- <title>Program configuration: <code>programs.xml</code></title>
-
- <p>
- The <code>programs.xml</code> file contains the following
- configuration items:
- </p>
- <ul>
- <li>Notification configuration: Describing how to
- do notification of the results of crawling the site. </li>
- <li>Zero or more configurations of interesting programs. </li>
- </ul>
- <section>
- <title>Notification configuration</title>
- <p>
- Notification is configured in the (surprise, surprise!)
- <code>notification</code> element. This notification element
- is used to configure respectively sender mail address (= reply
- address), recipient address, subject of the email, smtp server
- host and port and optional username and password.
- In addition it contains the names of the stylesheets to
- generate the HTML and Text reports. These stylesheets
- should not be changed.
- </p>
- </section>
-
- <section>
- <title>Program configuration</title>
- <p>
- Interesting TV shows are described using <code>program</code>
- elements. Each <code>program</code> element contains
- one or more <code>match</code> elements that describe
- a condition that the interesting program must match.
- </p>
- <p>
- Matching can be done on the following properties of a program:
- </p>
- <table>
- <tr><th>Field name</th>
- <th>Description</th></tr>
- <tr>
- <td>name</td>
- <td>Program name</td>
- </tr>
- <tr>
- <td>description</td>
- <td>Program description</td>
- </tr>
- <tr>
- <td>channel</td>
- <td>Channel name</td>
- </tr>
- <tr>
- <td>keywords</td>
- <td>Keywords/classification of the program.</td>
- </tr>
- </table>
- <p>
- The field to match is specified using the <code>field</code>
- attribute of the <code>match</code> element. If no field name
- is specified then the program name is matched. Matching is done
- by converting the field value to lowercase and then doing a
- perl-like regular expression match of the provided value. As a
- result, the content of the match element should be specified in
- lower case otherwise the pattern will never match.
- If multiple <code>match</code> elements are specified for a
- given <code>program</code> element, then all matches must
- apply for a program to be interesting.
- </p>
- <p>
- Example patterns:
- </p>
- <table>
- <tr>
- <th>Pattern</th>
- <th>Example of matching field values</th>
- </tr>
- <tr>
- <td>the.*x.*files</td>
- <td>"The X files", "The X-Files: the making of"</td>
- </tr>
- <tr>
- <td>star trek</td>
- <td>"Star Trek Voyager", "Star Trek: The next generation"</td>
- </tr>
- </table>
-
- <p>
- It is possible that different programs cannot be recorded at
- since they overlap. To deal with such conflicts, it is possible
- to specify a priority using the <code>priority</code> element.
- Higher values of the priority value mean a higher priority.
- If two programs have the same priority, then it is (more or less)
- unspecified which of the two will be recorded, but it will at least
- record one program. If no priority is specified, then the
- priority is 1 (one).
- </p>
-
- <p>
- Since it is not always desirable to try to record every
- program that matches the criteria, it is also possible to
- generate notifications for interesting programs only without
- recording them. This is done by specifying the
- <code>action</code> alement with the content <code>notify</code>.
- By default, the <code>action</code> is <code>record</code>.
- To make the mail reports more readable it is possible to
- also assign a category to a program for grouping interesting
- programs. This can be done using the <code>category</code>
- element. Note that if the <code>action</code> is
- <code>notify</code>. then the <code>priority</code> element
- is not used.
- </p>
-
- </section>
-
-
+ <title>Program configuration</title>
+ <p> Interesting TV shows are described using <code>program</code> elements. Each
+ <code>program</code> element contains one or more <code>match</code> elements that
+ describe a condition that the interesting program must match. </p>
+ <p> Matching can be done on the following properties of a program: </p>
+ <table>
+ <tr>
+ <th>Field name</th>
+ <th>Description</th>
+ </tr>
+ <tr>
+ <td>name</td>
+ <td>Program name</td>
+ </tr>
+ <tr>
+ <td>description</td>
+ <td>Program description</td>
+ </tr>
+ <tr>
+ <td>channel</td>
+ <td>Channel name</td>
+ </tr>
+ <tr>
+ <td>keywords</td>
+ <td>Keywords/classification of the program.</td>
+ </tr>
+ </table>
+ <p> The field to match is specified using the <code>field</code> attribute of the
+ <code>match</code> element. If no field name is specified then the program name is
+ matched. Matching is done by converting the field value to lowercase and then doing a
+ perl-like regular expression match of the provided value. As a result, the content of the
+ match element should be specified in lower case otherwise the pattern will never match. If
+ multiple <code>match</code> elements are specified for a given <code>program</code>
+ element, then all matches must apply for a program to be interesting. </p>
+ <p> Example patterns: </p>
+ <table>
+ <tr>
+ <th>Pattern</th>
+ <th>Example of matching field values</th>
+ </tr>
+ <tr>
+ <td>the.*x.*files</td>
+ <td>"The X files", "The X-Files: the making of"</td>
+ </tr>
+ <tr>
+ <td>star trek</td>
+ <td>"Star Trek Voyager", "Star Trek: The next generation"</td>
+ </tr>
+ </table>
+
+ <p> It is possible that different programs cannot be recorded since they overlap. To deal
+ with such conflicts, it is possible to specify a priority using the <code>priority</code>
+ element. Higher values of the priority value mean a higher priority. If two programs have
+ the same priority, then it is (more or less) unspecified which of the two will be
+ recorded, but it will at least record one program. If no priority is specified, then the
+ priority is 1 (one). </p>
+
+ <p> Since it is not always desirable to try to record every program that matches the
+ criteria, it is also possible to generate notifications for interesting programs only
+ without recording them. This is done by specifying the <code>action</code> alement with
+ the content <code>notify</code>. By default, the <code>action</code> is
+ <code>record</code>. To make the mail reports more readable it is possible to also assign
+ a category to a program for grouping interesting programs. This can be done using the
+ <code>category</code> element. Note that if the <code>action</code> is
+ <code>notify</code>. then the <code>priority</code> element is not used. </p>
+
+ </section>
+
+ <section>
+ <title>Notification configuration</title>
+ <p> Edit the configuration file <code>org.wamblee.crawler.properties</code>. The properties
+ file is self-explanatory. </p>