<header>
<title>Automatic Recording for KiSS Hard Disk Recorders</title>
</header>
- <body>
+ <body>
+ <warning>
+ KiSS makes regular updates to their site that sometimes require adaptations
+ to the crawler. If it stops working, check out the most recent version here.
+ </warning>
+ <section id="changelog">
+ <title>Changelog</title>
+
+ <section>
+ <title>31 August 2006</title>
+ <ul>
+ <li>Added windows bat file for running the crawler under windows.
+ Very add-hoc, will be generalized. </li>
+ </ul>
+ </section>
+ <section>
+ <title>24 August 2006</title>
+ <ul>
+ <li>The crawler now uses desktop login for crawling. Also, it is much more efficient since
+ it no longer needs to crawl the individual programs. This is because the channel page
+ includes descriptions of programs in javascript popups which can be used by the crawler.
+ The result is a significant reduction of the load on the KiSS EPG site. Also, the delay
+ between requests has been increased to further reduce load on the KiSS EPG site. </li>
+ <li>
+ The crawler now crawls programs for tomorrow instead of for today.
+ </li>
+ <li>
+ The web based crawler is configured to run only between 7pm and 12pm. It used to run at
+ 5am.
+ </li>
+ </ul>
+ </section>
+
+ <section>
+ <title>13-20 August 2006</title>
+ <p>
+ There were several changes to the login procedure, requiring modifications to the crawler.
+ </p>
+ <ul>
+ <li>The crawler now uses the 'Referer' header field correctly at login.</li>
+ <li>KiSS now uses hidden form fields in their login process which are now also handled correctly by the
+ crawler.</li>
+ </ul>
+ </section>
+ </section>
<section id="overview">
<title>Overview</title>
<title>General usage</title>
<p>
When the crawler runs, it
- retrieves the programs for today. As a result, it is advisable
+ retrieves the programs for tomorrow. As a result, it is advisable
to run the program at an early point of the day as a scheduled
task (e.g. cron on unix). For the web application this is
preconfigured at 5AM.
</p>
+ <note>
+ If you deploy the web application today, it will run automatically
+ on the next (!) day. This even holds if you deploy the application
+ before the normal scheduled time.
+ </note>
+
<p>
Modifying the program to allow it to investigate tomorrow's
programs instead is easy as well but not yet implemented.