DadaImcStaticCopy

Table of content :

Creating a static copy

fetching the content

http://www.httrack.com/ is a tool for website mirroring. wget could also be used but httrack has some advantages (bandwith + http connection limit, powerfull link conversion)

as Dada has stability issues under load situations you should just use ONE http connection and a maximum of 100 KByte/sec

 httrack -c 1 -A100000 http://at.indymedia.org
mirror's the URL to `pwd`/at.indymedia.org

removing selectboxes and other active elements

maybe some stuff can be turned off in DadaIMC before fetching the content ??

Categories

Idea:

point httrack to URL of the query for each seperate category and save results in different folders named as the categories

httrack -c 1 -A100000 http://at.indymedia.org/index.php?STRINGOFQUERY

then exchange category selection with some static html that links to those subpages and to the main page ('unfiltered')

Replacing Search with htdig

as search is implemented as an SQL query it can't be replaced by static html

http://www.htdig.org/ is a http search engine (similar to google and others but small scale)

htdig should be customized for you language TODO

'rundig' starts the indexing. be prepared for heavy load on web server AND machine wer htdig is running on.

on a 1200MHz Epia CPU it took 2 days to index a middle sized DadaIMC site
Topic revision: r2 - 28 Jan 2007, XaOk
This site is powered by FoswikiCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Foswiki? Send feedback