From 418d394bf21c6a4bd7de938958391ce657e81804 Mon Sep 17 00:00:00 2001 From: erik Date: Thu, 16 Mar 2006 18:56:43 +0000 Subject: [PATCH] --- crawler/ABOUT.txt | 2 ++ crawler/basic/ABOUT.txt | 9 +++++++++ crawler/kiss/ABOUT.txt | 5 +++++ 3 files changed, 16 insertions(+) create mode 100644 crawler/ABOUT.txt create mode 100644 crawler/basic/ABOUT.txt create mode 100644 crawler/kiss/ABOUT.txt diff --git a/crawler/ABOUT.txt b/crawler/ABOUT.txt new file mode 100644 index 00000000..ba9a7ad3 --- /dev/null +++ b/crawler/ABOUT.txt @@ -0,0 +1,2 @@ +This directory contains a generic web crawler (basic directory) and several useful implementations build on top of this. + diff --git a/crawler/basic/ABOUT.txt b/crawler/basic/ABOUT.txt new file mode 100644 index 00000000..b61c613d --- /dev/null +++ b/crawler/basic/ABOUT.txt @@ -0,0 +1,9 @@ +This is a general library for implementing a web crawler. + +The crawler works by retrieving an HTML page and transforming the HTML +(content + presentation) into content using XSLT stylesheets. Using a convention +for links in the converted content, it becomes possible to build a generic interface on the retrieved pages for navigating through the content. + +A configuration file determines how a certain page must be retrieved and transformed. + + diff --git a/crawler/kiss/ABOUT.txt b/crawler/kiss/ABOUT.txt new file mode 100644 index 00000000..4f23b392 --- /dev/null +++ b/crawler/kiss/ABOUT.txt @@ -0,0 +1,5 @@ +This is a crawler for the KiSS Electronic Program Guide that can be used for instance with the KiSS DP558 hard-disc recorder. It uses the basic crawler for its implementation. + +Based on preferences for recording programs, the crawler automatically records programs that are scheduled to run on the same day. This saves a lot of manual work in recording programs. + +The final idea is to define ones own interests in television programs and have the crawler record them automatically or send notifications of possibly interesting programs. Whether programs should be recorded can be determined by several criteria such as program title, channel, time of day, and keywords in the description. -- 2.31.1