
Đã đóng
Đã đăng vào
Thanh toán khi bàn giao
I need a lightweight, computer-vision bot that relies on on-screen image recognition rather than Selenium or similar browser-automation libraries. The workflow is straightforward: the bot launches a local browser window, navigates to a specific site I will provide, and—through visual cues—automatically clicks the required elements, fills in form fields, and finally extracts the resulting text, images, and tabular data that the page returns. The process must loop continuously so it can watch for page changes and react in real time. A small delay between cycles is fine as long as the interaction remains smooth and does not overload the system or the target site. Robust error handling is important; if a button is missing or the page layout shifts, the bot should retry gracefully or log the issue without crashing. Please choose your preferred vision stack—OpenCV, PyAutoGUI, SikuliX, or an equivalent solution—and include lightweight OCR where necessary for the text capture. Java, Python, or another language is acceptable as long as setup remains minimal and cross-platform. Key deliverables: • Ready-to-run script or executable with clear configuration for URLs, visual anchors, and form data. • Brief README outlining dependencies, setup steps, and how to extend or retrain the visual templates. • Logging/reporting module that shows each cycle’s success status and the data captured (CSV or JSON is fine). I will provide the site URL, form values, and any visual references you need once we start. Looking forward to a clean, reliable solution built purely on vision-based automation techniques.
Mã dự án: 40349548
72 đề xuất
Dự án từ xa
Hoạt động 5 ngày trước
Thiết lập ngân sách và thời gian
Nhận thanh toán cho công việc
Phác thảo đề xuất của bạn
Miễn phí đăng ký và cháo giá cho công việc
72 freelancer chào giá trung bình $1.775 USD cho công việc này

I am an experienced software developer specializing in browser automation and computer vision solutions. With a background in Python and OpenCV, I have successfully created bots that leverage on-screen image recognition for robust automation tasks. My experience aligns well with your requirement for a lightweight, vision-driven automation bot that avoids traditional libraries like Selenium. My expertise with OpenCV and PyAutoGUI allows me to design bots that smoothly interact with web elements based on visual cues. I understand the importance of continuous looping and real-time reaction for your project's workflow, and I am skilled in implementing robust error handling, ensuring the bot adapts gracefully to page changes or unexpected layout shifts. I am interested in discussing your project further to understand your specific requirements in detail. Could you provide more information on the target website and the expected data flow? I am ready to deliver a clean, reliable solution with comprehensive logging for monitoring bot performance.
$2.500 USD trong 10 ngày
8,5
8,5

With over a decade of experience in vision-driven automation solutions and high-performance systems, I understand your need for a lightweight, computer-vision bot that can navigate websites seamlessly through image recognition. My expertise in building high-complexity systems, such as serving over 1 million users and developing high-security FinTech applications, directly aligns with the challenges your project presents. For your vision-driven browser automation bot, I recommend leveraging OpenCV for robust image processing and PyAutoGUI for seamless user interactions. My past success in developing automation tools with similar complexity ensures that I can deliver a reliable and efficient solution for your specific requirements. I am eager to discuss how we can collaborate to bring your vision-driven automation bot to life. Please feel free to reach out to me to further explore the roadmap for this project.
$2.800 USD trong 45 ngày
8,0
8,0

⭐⭐⭐⭐⭐ Create a Computer Vision Bot for Efficient Form Automation ❇️ Hi My Friend, Hope you are doing well. I have reviewed your project needs and see you are looking for a computer vision bot. You don’t need to look any further; Zohaib is here to help you! My team has successfully completed 50+ similar projects focused on vision-based automation. I will create a lightweight bot that uses on-screen image recognition to perform tasks smoothly and efficiently. ➡️ Why Me? I can easily build your computer vision bot as I have 5 years of experience in automation and image recognition. My expertise includes Python, OpenCV, and robust error handling. I also have a strong grip on other relevant technologies, ensuring a seamless experience for your project. ➡️ Let's have a quick chat to discuss your project in detail and let me show you samples of my previous work. I look forward to discussing this with you in our chat. ➡️ Skills & Experience: ✅ Python Programming ✅ OpenCV ✅ PyAutoGUI ✅ SikuliX ✅ Image Recognition ✅ Error Handling ✅ Automation Scripting ✅ Data Extraction ✅ Logging and Reporting ✅ Cross-Platform Development ✅ Visual Cues Implementation ✅ User-Friendly Configuration Waiting for your response! Best Regards, Zohaib
$708 USD trong 2 ngày
8,1
8,1

Hi, We can build a lightweight, vision-based automation bot exactly as you described no Selenium, purely driven by on-screen recognition. Our team has experience with tools like OpenCV and PyAutoGUI to create stable, responsive bots that interact with websites using visual cues. We’ll develop a solution that launches the browser, detects UI elements, fills forms, extracts text/images/tables (with OCR where needed), and runs in a smooth continuous loop. It will include strong error handling, retry logic, and logging to ensure reliability even if layouts shift. You’ll receive a ready-to-run script, clean configuration for easy updates, a simple README, and structured output (CSV/JSON) with cycle reporting. Let’s build a clean, efficient automation system that runs reliably in real time. With Warm Regards, Jean-Pierre Full Stack Web Developer
$1.755 USD trong 7 ngày
6,7
6,7

Hi there, Can I know why you don't want to play with Selenium? Vision-based automation is not stable. I guess because of anti-bot or contents of the target web?
$1.755 USD trong 7 ngày
6,4
6,4

Hi, you need a browser automation bot that relies on visual feedback rather than brittle DOM selectors to bypass anti-bot detection and handle dynamic UI changes. I specialize in building these vision-driven agents that "see" the page exactly like a human user. I’ve previously developed facial liveness CNN models and performed complex image segmentation, which allows me to integrate computer vision directly into your automation pipeline. I’ll implement this using OpenCV for real-time element recognition and Playwright for the interaction layer, ensuring the bot remains robust against layout shifts. My background in optimizing recurrent neural networks means the processing latency will be minimal. How are you currently handling the anti-bot triggers or CAPTCHAs that typically block automated navigation on your target site?
$3.150 USD trong 7 ngày
6,2
6,2

❇️ Hi My Friend, I hope you're doing well. I've reviewed your project requirements and see you're looking for a lightweight, computer-vision bot that interacts with a website through visual recognition rather than browser automation libraries. Look no further; Muhammad Adil is here to assist you! I will build a reliable bot that clicks elements, fills forms, extracts text/images/tables, and loops continuously while handling errors gracefully. ➡️ Why Me? I have extensive experience with vision-based automation using Python (OpenCV, PyAutoGUI, Tesseract OCR) and cross-platform scripts. I can implement real-time page monitoring, smooth interaction cycles, robust retry logic, and flexible configuration for URLs, visual anchors, and form data. Logging will capture each cycle’s results and store them in CSV or JSON for easy review. ➡️ Let's have a quick chat to discuss your site, visual references, and form details so I can deliver a ready-to-run, lightweight solution with setup instructions and a clean workflow. I look forward to discussing this with you in our chat. ➡️ Skills & Experience: ✅ Vision-Based Automation (OpenCV, PyAutoGUI, SikuliX) ✅ OCR Text Capture (Tesseract) ✅ Cross-Platform Python Development ✅ Form Filling & Element Interaction ✅ Real-Time Monitoring & Looping ✅ Robust Error Handling & Logging ✅ CSV/JSON Data Export ✅ Lightweight, Minimal Dependencies Waiting for your response! Best Regards, Muhammad Adil
$600 USD trong 4 ngày
5,4
5,4

Hello, I’m Karthik with 15+ years of experience in Python automation, computer vision, OCR, and workflow scripting. I can build a lightweight vision-driven browser bot that works through on-screen recognition only, without Selenium or browser automation frameworks. Approach: • Use Python + OpenCV + PyAutoGUI (or SikuliX if better suited) for image-based interaction • Launch browser, navigate to your target URL, detect buttons/fields visually, click/type as needed • Add OCR where required to capture returned text and support layout changes • Extract output data (text/images/tables) into CSV/JSON • Run in continuous loop with safe delays, retries, screenshots, and detailed logging Deliverables: • Ready-to-run script/executable • Configurable URL, form values, and visual anchor templates • Logging/reporting module for each cycle • README with setup, dependencies, and template update steps I focus on reliable visual matching, graceful recovery, and clean maintainable code so the bot keeps running even when minor UI shifts happen. Warm Regards, Karthik B
$2.955 USD trong 7 ngày
5,3
5,3

Hi, I hope you are doing well. Your project 'Vision-Driven Browser Automation Bot' definitely caught my attention. I work extensively with Python and AI to build automation systems, just like you need for this project. I know how to leverage computer vision to build efficiency into these systems rather than relying on outmoded techniques. I'm confident that I can deliver clean, tested, and production-ready code to elevate your project. Your requirements are clear, but let's discuss the project scope in a bit more detail to ensure we are on the same page. Looking forward to possibly working with you.
$2.300 USD trong 18 ngày
5,0
5,0

As an experienced Full-Stack Developer with a special focus on AI and machine learning, I am well-positioned to meet your needs for this project. My proficiency in Python and knowledge of libraries like OpenCV and PyAutoGUI would ensure efficient execution and smooth operation of your browser automation bot. Having worked on similar projects requiring real-time image recognition and data extraction, I can showcase my prowess through building a ready-to-run script or executable that is lightweight, easy to set up, and compatible across platforms. I place significant emphasis on robust error handling and graceful retries, which are vital for such an automation process where buttons may be missing or page layouts may change unexpectedly. My work consistently delivers clean logs and reports that outline each cycle's success status along with the captured data in CSV or JSON format - just as you've asked. Moreover, I am well aware of the crucial role that documentation plays in empowering future development. I assure you a comprehensive README that not only outlines setup requirements but explains how to extend or retrain visual templates, should it be necessary. Stemming from my seven-plus years in the field, diligent communication is embedded in my work ethics; you'll get regular updates about progress, realistic timelines on deliverables as well as continuous support even after the initial project completion.
$1.700 USD trong 7 ngày
4,8
4,8

Hi, I am a full-stack AI developer with 8 years of rich experience with a background in computer vision and automation systems. I am familiar with Python, OpenCV, OCR, automation, data extraction. For this project, the most important part is building a stable vision-based detection flow, because the bot must keep working even when screen elements shift or load inconsistently. I will create a lightweight automation system using image recognition and OCR, with retry logic, logging, and clean data export so the full cycle runs reliably and is easy to maintain. I'm an individual freelancer and can work on any time zone you want. Please contact me with the best time for you to have a quick chat. Looking forward to discussing more details. Thanks. Emile.
$2.500 USD trong 7 ngày
4,3
4,3

Hi there, Strong alignment with this project comes from experience building automation systems using computer vision instead of traditional DOM-based tools like Selenium. Clear understanding of the requirement to create a vision-driven bot that interacts with a browser via on-screen detection, performs actions (click, input, extract), and runs in a continuous loop with robust error handling. Hands-on expertise ensures efficient implementation using Python with OpenCV + PyAutoGUI + OCR (Tesseract/EasyOCR), delivering reliable visual anchoring, adaptive retries, and structured data extraction (CSV/JSON). Risk stays controlled through fallback detection strategies, configurable anchors, logging per cycle, and performance tuning to avoid overload or detection issues. Available to start immediately happy to discuss approach and sample workflow. Recent work: https://www.freelancer.com/u/chiragardeshna Regards Chirag
$1.755 USD trong 7 ngày
4,4
4,4

Hello There!!! ★★★★ ( Vision-based browser automation bot using OpenCV and OCR for real-time actions ) ★★★★ I understand you need a lightweight automation bot that uses on-screen image recognition instead of Selenium, to navigate a site, interact via visual cues, fill forms, and extract data continuously with error handling and logging. ⚜ Vision-based automation using OpenCV/PyAutoGUI ⚜ OCR integration for text extraction ⚜ Automated clicking, typing, and navigation ⚜ Continuous loop with delay and stability ⚜ Smart retry and error handling system ⚜ Logging output in CSV/JSON format ⚜ Easy config for URLs and visual anchors I have worked on automation tools using computer vision and OCR, focusing on reliability even with UI changes. I design bots to adapt using anchor matching and fallback logic. I’ll build a clean Python-based solution with simple setup and extendable configs. Let’s discuss your use-case and get started. Warm Regards, Farhin B.
$1.755 USD trong 7 ngày
4,0
4,0

Hi, This is Jagrati. I understand you need a lightweight, vision-based automation bot that interacts with a browser purely through on-screen image recognition (no Selenium or DOM automation), capable of navigating a site, performing clicks, filling forms, and extracting results in a continuous loop with strong error handling. My approach would be to build this using Python with a combination of OpenCV for template matching, PyAutoGUI for screen interaction, and OCR (Tesseract) for text extraction where needed. The bot will operate by capturing screen regions, identifying UI elements via visual anchors (templates), and executing actions such as clicks and keystrokes based on detected positions. OCR will be used to read dynamic text and validate page states. Deliverables: • Ready-to-run script with configurable visual templates and workflows • OCR integration for text capture and validation • Continuous monitoring loop with retry logic • Logging system outputting cycle status and extracted data (CSV/JSON) • README with setup instructions, dependencies, and guidance for adding/retraining templates Once you share the target site, visual references, and form data, I can tailor the detection logic and interaction flow precisely. Looking forward to working on this. Best regards, JP
$1.500 USD trong 7 ngày
3,5
3,5

With your project description, it's evident that you need a skilled programmer who can design an efficient and lightweight vision-based browser automation solution. As a full-stack software engineer with a penchant for automation, I'm confident that my six-plus years of technical experience will enable me to craft the perfect system for your needs. Not only have I built production web applications end-to-end, but I also specialize in automating system workflows, which aligns perfectly with your goals. In terms of language preference and setup minimization, I am highly skilled in both Python and JavaScript; this allows me to choose the right implementation language based on efficiency and compatibility. I am familiar with OCR technologies such as OpenCV, PyAutoGUI, and SikuliX - ensuring a robust image-recognition system integrated seamlessly into your browser automation bot. When working on projects like yours, I prioritize clean, maintainable architecture to ensure longevity and easy extendibility. Additionally, my experience in deploying and iterating features throughout the development cycle will provide you with an effective workflow that produces bug-free code. Lastly, my strength in data pipelines and analytics can be leveraged for creating detailed logs or reporting modules in CSV or JSON formats. I eagerly look forward to injecting my expertise into crafting a reliable and adept vision-driven browser automation bot for you. Let's solve this challenge together!
$1.755 USD trong 7 ngày
3,3
3,3

Hi, I will build a lightweight computer-vision bot that utilizes visual recognition for seamless interaction with your specified site. With extensive experience in OpenCV and PyAutoGUI, I will ensure the bot efficiently clicks elements, fills forms, and extracts data while maintaining a smooth operation without overloading the system. My approach will include robust error handling to manage layout changes or missing elements gracefully. The bot will loop continuously, allowing real-time reactions to page changes. I’ll provide a ready-to-run script with clear configuration for URLs and visual anchors, along with a concise README for setup and extension. To enhance the solution, I’ll integrate lightweight OCR for text capture and implement a logging module to report each cycle’s success and captured data in your preferred format. I’m ready to start immediately and can adapt the implementation as necessary for optimal performance. Thank you.
$2.016,75 USD trong 7 ngày
3,3
3,3

Hello, I can build a reliable, vision-based automation bot that operates entirely through on-screen recognition without relying on Selenium or browser automation frameworks. I have strong experience in computer vision and automation using Python with OpenCV, PyAutoGUI, and OCR tools like Tesseract, allowing me to create systems that interact with UI elements based purely on visual cues. For your workflow, I will develop a lightweight script that launches a browser, navigates to your target site, and performs all required actions—clicking buttons, filling forms, and extracting text, images, and tabular data—using image matching and OCR where needed. The bot will run in a continuous loop with controlled delays, ensuring smooth operation without overloading the system or target site. I will implement robust error handling so the bot can adapt to minor UI changes, retry failed actions, and log issues without crashing. The final deliverable will include a ready-to-run script or executable, configurable visual anchors and inputs, and a clear README with setup instructions and guidance on updating or retraining templates. Additionally, I will include a logging module that records each cycle’s status and outputs captured data in CSV or JSON format.
$2.100 USD trong 7 ngày
3,0
3,0

Hello, I will build a Python based bot using a popular computer vision library like OpenCV and a GUI automation tool like PyAutoGUI. The bot will identify on screen elements by matching image patterns and interact with the browser directly. I will integrate a lightweight OCR engine to extract text and tabular data from the page. The system will run in a continuous loop with built in error handling to retry actions if the page layout changes or an element is missing. This method avoids traditional browser drivers and relies entirely on visual recognition. 1) Which specific website will the bot be interacting with? 2) Are there any specific screen resolutions or browser types I should optimize for? 3) What is the expected volume of data to be extracted per cycle? Thanks, Nivedita
$600 USD trong 8 ngày
3,4
3,4

You need a vision-driven bot for browser automation that uses on-screen image recognition to navigate, interact, and extract data from a specific site. No Selenium or similar libraries involved. Bot Design: I will use OpenCV for image recognition and PyAutoGUI for interaction. The bot will launch a local browser, navigate to your specified site, and use visual cues to click elements, fill forms, and extract data. Continuous Monitoring: The bot will loop continuously, watching for page changes and reacting in real-time. A small delay will be implemented to ensure smooth interaction without overloading the system or site. Error Handling: Robust error handling will be included to retry or log issues gracefully if elements are missing or layouts change. Deliverables: A ready-to-run script with configuration options for URLs and visual anchors, a README for setup and extension, and a logging module for cycle success and data capture. Timeline: 5 days. Could you confirm if there are any specific visual elements or challenges on the site that might affect the bot's interaction?
$600 USD trong 5 ngày
1,8
1,8

Hey — saw your post about building a vision-driven browser automation bot that relies on on-screen image recognition instead of Selenium. The tricky part with these setups is keeping them stable when UI layouts, colors, or resolutions change even slightly. Quick question before I suggest an approach: Do you already have a preferred CV stack in mind (e.g., OpenCV, template matching, or something like YOLO/ML-based), or are you open to recommendations based on performance needs? I’ve worked on similar vision-based automation where everything is driven by what’s actually on the screen, including handling different resolutions, minor UI changes, and timing issues. If you can share a short spec, a screen recording, or the target site/app, I’ll review it and tell you what’s realistically doable with a lightweight approach.
$1.755 USD trong 7 ngày
2,0
2,0

Dhaka, Bangladesh
Phương thức thanh toán đã xác thực
Thành viên từ thg 4 5, 2026
$250-750 USD
$10-300 USD
₹1500-12500 INR
₹37500-75000 INR
$15-25 USD/ giờ
$10-30 USD
₹1500-12500 INR
$10 USD
₹12500-37500 INR
$8-15 USD/ giờ
$5000-10000 USD
$20000-50000 USD
$30-250 USD
$30-250 USD
₹150000-250000 INR
₹12500-37500 INR
€8-30 EUR
$2-8 USD/ giờ
₹12500-37500 INR
₹12500-37500 INR
$30-250 USD
$5000-10000 USD