Đã Đóng

Create a tesseract configuration to split documents based on pagecode T pages

we have two type of documents:

- multipage PDF files (could already contain also OCR detected text)

- multipage Tiff files

These pages contain the standarized patchcode T separator pages.

Samples of the patchcode T

- [url removed, login to view] on page 11

- [url removed, login to view] on page 75

Your job is to provide us a shell script which

- gets as input either a PDF file or a Tiff file (choosable by param)

- parses through the file and splits the file the by given patchcode T into multiple files (with same filetype)

- does OCR of the content (shall be switchable with on/off to decide if OCR shall be done or not)

Ensure the pagecode page can have any arbitrary content between the code lines (like in the samples)


alternative to Shell-Script is also a Java-Implementation

Kĩ năng: Java, OCR, Shell Script

Xem nhiều hơn: best ocr for tables, tesseract table layout, tabula ocr, extracting table data from pdfs with ocr, ocr table online, ocr tabular data, free ocr table recognition, tesseract ocr table recognition, Create a kick-ass Pop-up Div with lightbox based, create your own logo on t shirt, create your own logo for t shirt corel draw, create one page html website based on a template, word pages split, saving 1000 web pages html documents, yellow pages scraper web based

Về Bên Thuê:
( 10 nhận xét ) Stuttgart, Germany

ID dự án: #15704623

7 freelancer đang chào giá trung bình €64 cho công việc này

skfaroo123

Hi I am very interested in your project. I have strong experiences with yours I am looking forward to discussing your project. Best Regards. Relevant Skills and Experience ocr Proposed Milestones €100 EUR - negotiabl Thêm

€36 EUR trong 10 ngày
(13 Nhận xét)
4.5
iitmshanker

A proposal has not yet been provided

€200 EUR trong 2 ngày
(1 Nhận xét)
3.2
ranzhie07

i have existing project here ready and similar to your needs i use enhanced tesseract ocr Stay tuned, I'm still working on this proposal.

€61 EUR trong 0 ngày
(1 Nhận xét)
1.2
sreejith1993

Hi, I hope you are doing fine, I have relevant experience in parsing PDF and reading TIFF images using Java. I have also worked on Tesseract OCR engine as well and I assure you i am the best fit. Relevant Skills and Thêm

€45 EUR trong 10 ngày
(0 Nhận xét)
0.0
Codingtech1

Hi, I have read your projects descriptions so I can do it perfectly. I have 7 years+ experience with all Software development, Programming languages. I have completed lot of projects with Script, Java, OCR which is re Thêm

€15 EUR trong 5 ngày
(0 Nhận xét)
0.0
Fabritech

we have professional, innovative, and authentic logo developers and web designers to create a website and logo for your company. Relevant Skills and Experience PHP (Wordpress, Codeigniter, Joomla, Magento, Drupal) a Thêm

€36 EUR trong 2 ngày
(0 Nhận xét)
0.0
livezingy

Thanks very much for your invitation! I'm not the right one, because I'm not familiar with Shell-Script and Java. I come here to say thanks and sorry to you . Best Wishes for you! Relevant Skills and Experience Thêm

€55 EUR trong 10 ngày
(0 Nhận xét)
0.0