This Project is the extraction of all documents from page: [url removed, login to view]
What you need to deliver includes the following:
- folder structure as described in documentation(Pls see the attached documentation for details).
- XML with metadata extracted from each document.
- Original PDF and parsed plain text.
Pls see the attached documents and provide a solution in Perl, Only Serious Bidders who are proficient in Perl and XML need to apply.