First cut. Distintly raw around the edges:
* Assumes it will be running in /home/ali/wk/slashem/web.scripts
* Assumes cache directory will be in topdir
* No build system (simple compiling and linking against libxml2)
* No configure system (eg., tagsoup)
* Output XML untested
* Doesn't set bugzilla maintainer or exporter
* Handling of artifact priorities and resolution is suspect
1 The cache directory contains the following:
3 sf/attachments/<file_id>
4 Raw attachments as downloaded from sourceforge by sf2bz
6 sf/artifacts/<atid>/<aid>.html
7 Tagsoup detailed artifacts as downloaded from sourceforge by sf2bz
9 sf/users/<user_id>.html
10 Tagsoup user profiles as downloaded from sourceforge by sf2bz
12 attachments/<file_id>.xml
13 Attachments converted to xml by sf2bz
15 artifacts/<atid>/<aid>.xhtml
16 Conversion of detailed artifacts to xhtml by tagsoup
19 Conversion of user profiles to xhtml by tagsoup