Đã Hủy

Build a C# Scanned Document Filer -- 2 - 14/10/2017 18:01 EDT

Job Requirements

Looking to build an automatic filing system for our scanner.

We want to keep this as simple as possible. We already have an API which connects to our database and downloads to an XML file.

We need to have a SQL Lite database which converts this xml file into a database.

(See the attached .xml file)

Here is the job flow:

1. We scan a document from our scanner.

2. The scanner uses a OCR reader on the .pdf and saves it into the “Scan Folder”

3. The scanned items will appear in the GUI interface of our c# program ,which will be watching the “Scan Folder” and will update it as items appear. Every client in the database will have there own folder (this needs to be managed by syncing the folders with the .xml files. The updates need to happen automatically every hour.

4. All of the following will only happen once we push the “Automatch folder:

5. We then parse the .pdf with iText 7 Community .NET and save the parsed text to an array.

6. We then use regex to search for the TFN or ABN numbers. The TFN numbers are 8 digits or 9 digits in a row, and an ABN is an 11 digit number. We then run the whole document through [url removed, login to view] API to identify who the main “People” or “Organisations” are in case a TFN or ABN is not present. In the situation when a TFN or ABN is not present, then we will have to match the results from this against all of the customers in our database (Scan Folder). The year also needs to be regexed based on the parsed text.

7. Once we have identified potential matches we will display them in the GUI interface

8. The folder is just the client name, the Match, needs to display on what basis there is a match. If the TFN is automatched and the data is automatched then it can just be automatically allocated to the client folder. In which case the status updates to Complete . If only the TFN is automatched then it needs to confirm the date.

9. If the ABN is matched/there is a name match between the .pdf and the database we need the possible matches to be identified, and then the users will have to select the right one.

10. Once an item has been automatched or manually matched, it needs to have the status of complete, and we need it to be saed to the [url removed, login to view] file. It also needs to be moved to the correct client folder.

Kỹ năng: .NET, Lập trình C#, Microsoft SQL Server, SQL, XML

Xem thêm: edit scanned document, how do i scan a document as a pdf, convert scan to pdf online, how to convert scanned document to pdf, how can i scan a document and save it as a pdf?, how to convert scanned document to pdf for free, scan documents to pdf free, scanned document editor online, scanned document excel, extract data scanned document, convert scanned document database, convert scanned document excel, convert scanned document jpg excel file, scanned document convert excel worksheet, copying scanned document word 2007

Về Bên Thuê:
( 1 nhận xét ) Sydney, Australia

Mã Dự Án: #15400475

Đã trao cho:


Hi I bid as we discussed for the requirements in the project description Relevant Skills and Experience C# Proposed Milestones $650 AUD - Full

$650 AUD trong 3 ngày
(93 Đánh Giá)

2 freelancer đang chào giá trung bình $450 cho công việc này


Hi sir I can do this job for you right away away as I have great experience with this Relevant Skills and Experience PHP Proposed Milestones $250 AUD - default

$250 AUD trong 3 ngày
(11 Đánh Giá)