We need infrastructure design for this project:
- database type
- computing power estimation
- database structure proposal - Client / Client of our Client (CC) / CC documents.
- No of Clients estimation - 500
- No of CC estimation - 15.000
- structure to be scalable to 2000 Clients and 600.000 CC.
We are developing a cloud app that needs to process large amounts of documents - PDF and XML files attached to the pdf to extract data from it and store it for further analyses. Information extracted should be structured as following:
Our client has multiple other clients, we process data for our client clients.
Data extractions consist in an average of 30 fields per document.
Number of documents to work with ~ 25000 documents per day - 8 hours a day to take the process, it must be done during working hours.
There are 2 types of documents and each would have more then 20 variants, each month we would have same document types only other data that needs to be extracted.
I've attached the 2 types of documents with 1 variant.
12 freelancer chào giá trung bình$93 cho công việc này
Hi I am a writer and I have a skill to convert pdf and other file I have a speed in typing and will complete you project at time.i hope you will like my work