Đã hoàn thành

Looking for a developer who is familiar with OCR technologies to develop a software that will be capable of extracting text from pdf files / images and saving the output in a database -- 2

Job Description:

Hello everybody,

We need your help to develop a software that will be capable (through a wizard import) of extracting text from pdf files / images and saving the output in a database.

The files from which we should extrapolate the text are mainly CVs.

So the fields that interest us are Name, Surname, Date of Birth, Email, Address, Region, Education, Work Experience ...etc…

We know from the beginning how the files are visually made:

- European format curriculum;

- Linkedin Curriculum;

- Indeed CV.

But it would be better if we could build something using machine learning and train each time different model

The mechanism will mainly be like this:

Create a dashboard and distinguish two types of users-roles, Admin and SuperAdmin.

Admin Side:

1. The Admin log in on the portal;

2. Choose the type of Curriculum vitae (Eu format, Linkedin or Indeed);

3. Upload one or more files, for example 10 files at a time;

4. Start importing;

5. We should carry out the guided import more or less as it happens in this video ([login to view URL]), with the preview of the files on the left and the imported fields on the right, giving the possibility to modify and correct them, click on next and go to process next file.

Once the process is complete, save everything to the database.

In the dashboard the Admin will have the possibility to:

1. Search, consult, modify and categorize (with labels) the information imported during the ocr recognition process;

2. Select some fields such as Name, Surname, Email and export them in csv, xlsx or pdf file.

SuperAdmin side:

In addition to the Admin capabilities, the SuperAdmin user can:

1. Create / delete Admin users;

2. Check the overall report of all imported data;

3. Check the report of imported data for a given Admin user;

We should then create a module to be installed separately (for both the Admin and SuperAdmin roles) to send single or mass emails to those people whose imported the data.

A resume will surely contain an email field.

Then the "Mail Module" will allow you to select (checkbox) the relevant rows and then click on a button for massive email, where a popup will open with the text to be written.

The "Mail Module" will contain a section called "Settings" where it will be possible to:

1. Configure the email that will be used, then email, password, smtp address, port, ssl / tls encryption

2. Email signature.

Searching the web I found a library called "tesseract-ocr"

[login to view URL]

A wrapper to use it with PHP

[login to view URL]

or directly in Python

[login to view URL]

or on Node.js

[login to view URL]

The latter clearly offers the possibility of using frontend frameworks such as Vue.js or Angular.js

With Vue: [login to view URL]

With Angular: [login to view URL]

With React: [login to view URL]

Typescript: [login to view URL]

Below there is a tutorial on how to create a ocr microservice with Tesseract, PDFBox and Docker

[login to view URL]

Better solutions are welcome!


This is a project that will require future changes and updates, it is not a one-time-job, but it is an investment in a product that will be resold to many (hopefully) customers and which will therefore require (paid) intervention by of a developer for the initial configuration.

Who wants to get on the train?

Tickets are on sale ... :)

Kĩ năng: OCR, Data Extraction, React.js, MongoDB, Machine Learning (ML)

Về khách hàng:
( 9 nhận xét ) Napoli, Italy

ID dự án: #30543607

Được trao cho:


OnPremise Software Delivery with following modules - - User Module with below features - Manual mode - Semi auto - Automatic - Batch processing support - Multi-language - Dashboard - Email module Thêm

$2250 USD trong 80 ngày
(0 Đánh Giá)

30 freelancer chào giá trung bình$2355 cho công việc này


Good day. Hope this proposal finds you in the best of your health. It is my humble offer to present my services to you for this project related to software that will be capable of extracting text from pdf files / image Thêm

$3000 USD trong 25 ngày
(3 Nhận xét)

https://www.freelancer.com/u/nemanjadevelope2 Hello, I am very good at computer vision like OCR. Please check my profile. I have done projects about OCR. Please open chat so let's discuss more. Thank you. Nemanja.

$3000 USD trong 7 ngày
(10 Nhận xét)

Hi, How are you, I have read your description carefully and understood your requirements. As you can see on my portfolio, I am a senior software developer who expertise desktop app development, ML and algorithimic prob Thêm

$2250 USD trong 7 ngày
(1 Nhận xét)

✨ Hi, Good day! ✨ I have great interest in the project as I have all qualities you need. I have a great relevant experience, which is very similar to your project so I am very confident I would be an excellent addition Thêm

$2500 USD trong 21 ngày
(6 Nhận xét)

Hi, We at Tecogno Solutions are a team of Passionate Data Science and Full Stack professionals having more than five years of combined experience in multiple areas including Backend, Frontend, Machine learning (ML), C Thêm

$3000 USD trong 21 ngày
(2 Nhận xét)

Hi!, I am a professional data scientist with 5 years of experience. I hold an MBA and first Degree in statistics which provides me with the necessary background to handle your project. I've carefully checked your requ Thêm

$2500 USD trong 1 ngày
(7 Nhận xét)

Hello, I read your proposal very carefully and thank you for your all kind url. May I help you? I think ur project requires new thechs, maybe I don't know all, but love to do it because I can expand my skills. I like j Thêm

$2000 USD trong 7 ngày
(2 Nhận xét)

Hi, there. Hope you are doing well. I will develop a software that extracting text from pdf files and saving the output in a database. I have been working as a senior full stack developer for over 5 years and have a to Thêm

$1500 USD trong 7 ngày
(7 Nhận xét)

Hi, Greeting of the day. I have gone through your ocr project. There are many ML and image processing based libraries available for OCR. Tesseract is a classical tool and also many new deep learning based open-source Thêm

$2550 USD trong 15 ngày
(3 Nhận xét)

Dear Hiring Manager, I have experience in image processing with python such as cropping, merging, OCR. In the last project I've implemented that comparing system with .docx and converted .pdf files with OCR. For compa Thêm

$3000 USD trong 25 ngày
(2 Nhận xét)

★★★★★ You will succeed!!! ★★★★★ I really want to be contributed to letting your vision come true and have such great ability and proficiency. I have +6 years of experiences in ReactJS, Next and Material-UI are my best Thêm

$2250 USD trong 7 ngày
(2 Nhận xét)

Hi how are you doing I have checked your project's description in detail I think I can complete your OCR projectr perfectly because I have rich experience in this kinda Machine learning project development for 10+ yea Thêm

$2500 USD trong 25 ngày
(1 Nhận xét)

Hi, I read your requirement carefully. I am a professional MERN(MongoDB, Express, ReactJS, NodeJS ) Stack developer. As I have skills like JavaScript, Website Design, Graphic Design, HTML,PHP, ReactJS, NodeJS, MySQL an Thêm

$1500 USD trong 7 ngày
(2 Nhận xét)

hi how are you ? I have an experience with OCR more than five years, but i use C# and ASP.NET. i will do all requirements you need. good day for you

$1500 USD trong 7 ngày
(4 Nhận xét)

Hello. Thanks for your job posting. I just checked your project carefully. So it is very motivated and interesting for me. It is an ideal match for my skill and experience. I have rich experience in PHP, Laravel, React Thêm

$2500 USD trong 30 ngày
(1 Nhận xét)

Hello, Will you please share a detailed requirements of the project along with a sample mockup design of the application? Being a Machine Learning enthusiast and also a full-stack (MERN) developer, I would like to go Thêm

$2000 USD trong 30 ngày
(2 Nhận xét)

Hi there, How u doing? I have came across ur project and i believe i can help u with it as i have great working experience in Data Extraction, Machine Learning (ML), React.js, OCR and MongoDB. Please have a look at my Thêm

$3000 USD trong 14 ngày
(0 Nhận xét)

Hello, I'm a full-stack developer for 3+ years. I looked carefully at your project description. I have a wealth of experiences in development of several backend and frontend frameworks and libraries such as Laravel Thêm

$2000 USD trong 7 ngày
(0 Nhận xét)

Hello, With 7+ years of experiencei in creating and delivering user-centric applications and solutions, I look forward to bringing my strong creative, technical, and analytical skills to your project. Throughout my ca Thêm

$2500 USD trong 7 ngày
(0 Nhận xét)

Hello, How are you? Thank you for watching my offers. Please check my portfolio. I am React, Nodejs expert and have already developed many projects such as Object Recognition and Tracking(CNN, Yolo), Face Recognition( Thêm

$2250 USD trong 10 ngày
(0 Nhận xét)