I would need an application that is capable of transforming articles in PDF format into HTML format. The good news is that the structure of the source files would not be very complicated, and the output would not need to track the formatting of the input. However, headings and subheadings should be identified as such and bolded and italicized text should be preserved. The HTML file should be one continuous file, rather than a series of files, broken down by page. In converting the files, the application should (1) recognize page breaks in the PDF file and insert the tag "[*x]" in the corresponding place of the HTML file, whereby x would signify the page number of the new page and (2) recognize footnotes in the PDF file and insert them in the HTML file after the relevant page - for example, there might be a delimiter "----- BEGIN FOOTNOTES -----" to indicate that the fotnote text is about to start and a delimiter "----- END FOOTNOTES -----" to indicate that the footnote text is about to finish.
The use of open source/GPLed components is permitted.
The application should be delivered with Windows and Linux binaries plus source code.
I have usually had the best results on GAF when I gave the project to someone who was able to show me a demo, so if you could do a demo, that would almost certainly win you the job (unless, of course, someone else has done an equally good demo before you...).
I don't like the downpayments but I am happy to use the GAF escrow arrangements (but note that these guys deduct a discount, so it would be better to do it the classic way - near-final version first against payment against final code). References upon request.