Looking for a developer who is familiar with OCR technologies to develop a software that will be capable of extracting text from pdf files / images and saving the output in a database -- 2
Ngân sách $1500-3000 USD
Job Description:
Hello everybody,
We need your help to develop a software that will be capable (through a wizard import) of extracting text from pdf files / images and saving the output in a database.
The files from which we should extrapolate the text are mainly CVs.
So the fields that interest us are Name, Surname, Date of Birth, Email, Address, Region, Education, Work Experience ...etc…
We know from the beginning how the files are visually made:
- European format curriculum;
- Linkedin Curriculum;
- Indeed CV.
But it would be better if we could build something using machine learning and train each time different model
The mechanism will mainly be like this:
Create a dashboard and distinguish two types of users-roles, Admin and SuperAdmin.
Admin Side:
1. The Admin log in on the portal;
2. Choose the type of Curriculum vitae (Eu format, Linkedin or Indeed);
3. Upload one or more files, for example 10 files at a time;
4. Start importing;
5. We should carry out the guided import more or less as it happens in this video ([login to view URL]), with the preview of the files on the left and the imported fields on the right, giving the possibility to modify and correct them, click on next and go to process next file.
Once the process is complete, save everything to the database.
In the dashboard the Admin will have the possibility to:
1. Search, consult, modify and categorize (with labels) the information imported during the ocr recognition process;
2. Select some fields such as Name, Surname, Email and export them in csv, xlsx or pdf file.
SuperAdmin side:
In addition to the Admin capabilities, the SuperAdmin user can:
1. Create / delete Admin users;
2. Check the overall report of all imported data;
3. Check the report of imported data for a given Admin user;
We should then create a module to be installed separately (for both the Admin and SuperAdmin roles) to send single or mass emails to those people whose imported the data.
A resume will surely contain an email field.
Then the "Mail Module" will allow you to select (checkbox) the relevant rows and then click on a button for massive email, where a popup will open with the text to be written.
The "Mail Module" will contain a section called "Settings" where it will be possible to:
1. Configure the email that will be used, then email, password, smtp address, port, ssl / tls encryption
2. Email signature.
Searching the web I found a library called "tesseract-ocr"
[login to view URL]
A wrapper to use it with PHP
[login to view URL]
or directly in Python
[login to view URL]
or on Node.js
[login to view URL]
The latter clearly offers the possibility of using frontend frameworks such as Vue.js or Angular.js
With Vue: [login to view URL]
With Angular: [login to view URL]
With React: [login to view URL]
Typescript: [login to view URL]
Below there is a tutorial on how to create a ocr microservice with Tesseract, PDFBox and Docker
[login to view URL]
Better solutions are welcome!
Attention:
This is a project that will require future changes and updates, it is not a one-time-job, but it is an investment in a product that will be resold to many (hopefully) customers and which will therefore require (paid) intervention by of a developer for the initial configuration.
Who wants to get on the train?
Tickets are on sale ... :)
Được trao cho:
OnPremise Software Delivery with following modules - - User Module with below features - Manual mode - Semi auto - Automatic - Batch processing support - Multi-language - Dashboard - Email module Thêm
30 freelancer chào giá trung bình$2355 cho công việc này
https://www.freelancer.com/u/nemanjadevelope2 Hello, I am very good at computer vision like OCR. Please check my profile. I have done projects about OCR. Please open chat so let's discuss more. Thank you. Nemanja.
Hi, How are you, I have read your description carefully and understood your requirements. As you can see on my portfolio, I am a senior software developer who expertise desktop app development, ML and algorithimic prob Thêm
✨ Hi, Good day! ✨ I have great interest in the project as I have all qualities you need. I have a great relevant experience, which is very similar to your project so I am very confident I would be an excellent addition Thêm
Hi!, I am a professional data scientist with 5 years of experience. I hold an MBA and first Degree in statistics which provides me with the necessary background to handle your project. I've carefully checked your requ Thêm
Hello, I read your proposal very carefully and thank you for your all kind url. May I help you? I think ur project requires new thechs, maybe I don't know all, but love to do it because I can expand my skills. I like j Thêm
Hi, there. Hope you are doing well. I will develop a software that extracting text from pdf files and saving the output in a database. I have been working as a senior full stack developer for over 5 years and have a to Thêm
★★★★★ You will succeed!!! ★★★★★ I really want to be contributed to letting your vision come true and have such great ability and proficiency. I have +6 years of experiences in ReactJS, Next and Material-UI are my best Thêm
hi how are you ? I have an experience with OCR more than five years, but i use C# and ASP.NET. i will do all requirements you need. good day for you
Hello. Thanks for your job posting. I just checked your project carefully. So it is very motivated and interesting for me. It is an ideal match for my skill and experience. I have rich experience in PHP, Laravel, React Thêm
Hi there, How u doing? I have came across ur project and i believe i can help u with it as i have great working experience in Data Extraction, Machine Learning (ML), React.js, OCR and MongoDB. Please have a look at my Thêm
Hello, I'm a full-stack developer for 3+ years. I looked carefully at your project description. I have a wealth of experiences in development of several backend and frontend frameworks and libraries such as Laravel Thêm
Hello, With 7+ years of experiencei in creating and delivering user-centric applications and solutions, I look forward to bringing my strong creative, technical, and analytical skills to your project. Throughout my ca Thêm