(no commit message)

[utils] / trunk / crawler / kiss / docs / content / xdocs / index.xml
diff --git a/trunk/crawler/kiss/docs/content/xdocs/index.xml b/trunk/crawler/kiss/docs/content/xdocs/index.xml

index 1592cea52fdee21007672e143b916763822bb291..ee17d82fd6d050701f973297768d8b640425cb5d 100644 (file)
--- a/trunk/crawler/kiss/docs/content/xdocs/index.xml
+++ b/trunk/crawler/kiss/docs/content/xdocs/index.xml
@@ -20,7 +20,64 @@
    <header> 
      <title>Automatic Recording for KiSS Hard Disk Recorders</title> 
    </header> 
-  <body> 
+  <body>
+    <warning>
+      KiSS makes regular updates to their site that sometimes require adaptations
+      to the crawler. If it stops working, check out the most recent version here. 
+    </warning> 
+    <section id="changelog">
+      <title>Changelog</title>
+      
+         <section>
+        <title>7 September 2006</title>
+        <ul>
+          <li>KiSS modified the login procedure. It is now working again.</li>
+          <li>Generalized the startup scripts. They should now be insensitive to the specific libraries used. </li>
+        </ul>
+      </section>
+      <section>
+        <title>31 August 2006</title>
+        <ul>
+          <li>Added windows bat file for running the crawler under windows.
+              Very add-hoc, will be generalized. </li>
+        </ul>
+      </section>
+      <section>
+        <title>17 November 2006</title>
+        <ul>
+          <li>Corrected the packed distributions. The standalone distribution
+              had an error in the scripts and was missing libraries </li>
+
+        </ul>
+        <title>24 August 2006</title>
+        <ul>
+          <li>The crawler now uses desktop login for crawling. Also, it is much more efficient since
+          it no longer needs to crawl the individual programs. This is because the channel page
+            includes descriptions of programs in javascript popups which can be used by the crawler.
+          The result is a significant reduction of the load on the KiSS EPG site. Also, the delay
+            between requests has been increased to further reduce load on the KiSS EPG site. </li>
+          <li>
+            The crawler now crawls programs for tomorrow instead of for today.
+          </li>
+          <li>
+            The web based crawler is configured to run only between 7pm and 12pm. It used to run at
+            5am. 
+          </li>
+        </ul>
+      </section>
+      
+      <section>
+        <title>13-20 August 2006</title>
+        <p>
+          There were several changes to the login procedure, requiring modifications to the crawler.
+        </p>
+        <ul>
+          <li>The crawler now uses the 'Referer' header field correctly at login.</li>
+          <li>KiSS now uses hidden form fields in their login process which are now also handled correctly by the
+              crawler.</li>
+        </ul>
+      </section>
+    </section>
      <section id="overview">
        <title>Overview</title>
     
@@ -66,8 +123,8 @@
        </p>
        <p>
          The easy way to start is the 
-        <a href="installs/crawler/kiss/kiss-crawler-bin.zip">standalone program binary version</a>
-        or using the <a href="installs/crawler/kissweb/wamblee-crawler-kissweb.war">web
+        <a href="installs/crawler/target/wamblee-crawler-0.2-SNAPSHOT-kissbin.zip">standalone program binary version</a>
+        or using the <a href="installs/crawler/kissweb/target/wamblee-crawler-kissweb.war">web
            application</a>.
        </p>
        <p>
@@ -258,11 +315,17 @@
          <title>General usage</title>
          <p>
            When the crawler runs, it 
-          retrieves the programs for today. As a result, it is advisable 
+          retrieves the programs for tomorrow. As a result, it is advisable 
            to run the program at an early point of the day as a scheduled
            task (e.g. cron on unix). For the web application this is 
            preconfigured at 5AM. 
          </p>
+        <note>
+          If you deploy the web application today, it will run automatically
+          on the next (!) day. This even holds if you deploy the application
+          before the normal scheduled time.  
+        </note>
+        
          <p>
            Modifying the program to allow it to investigate tomorrow's
            programs instead is easy as well but not yet implemented.