Timo Sirainen schrieb:
On Thu, 2007-03-01 at 15:14 +0100, Johannes Berg wrote:
On Thu, 2007-03-01 at 15:10 +0100, Luca Corti wrote:
pdf2txt? HTML would be great. Both HTML and PDF could be generated by an intermediate format like docbook by the way. MoinMoin contains a script to dump the whole wiki into html:
MoinMoin/script/moin.py export dump --target-dir=/some/where
I was trying to change that to get it to dump to plain text but that didn't work, I can try again and talk to the Moin developers if you want that.
I think the documentation should be in plaintext. At least I don't like reading HTML docs myself unless they're really in the web (lynx kind of sucks).
There is more than half a dozen HTML-to-text converters I know of off-hand: lynx, links, elinks, w3m, one htmltotext tool whose name I forgot (check freshmeat.net), dillo.
Oh, and I just remembered HTMLDOC.org of which there's a FrOSS version, which can convert HTML to PDF or PS and generates quite handsome results if used properly. If HTML contains sufficient markup, perhaps that's even a better way than the route via docbook and XSL-FO.