
Đã đóng
Đã đăng vào
Thanh toán khi bàn giao
I’m looking for a well-structured Python solution, built around BeautifulSoup (BS4) and any supportive libraries you deem essential, that reliably pulls both product details and customer reviews from Lazada on a daily schedule. The data will fuel ongoing competitor research, so consistency and clarity of the output are critical. I looking specifically to get data using bs4 by bypassing the captcha Here’s how I picture the flow: • Input: category URL(s) or product list I supply in a CSV/JSON. • Scrape: title, price, promos, specs, images, ratings, full review texts, review dates, and reviewer scores. • Output: clean CSV or JSON dropped into a dated folder after each run. Make the script easy to tweak if Lazada changes its markup. Acceptance criteria 1. Script written in Python 3, primary parser BS4, with clear README on setup and dependencies (requests, selenium for dynamic pages if needed, etc.). 2. One-click (or single command) launch that completes without errors and produces a sample file from a test URL I provide. 3. Simple logging that flags failed pages or blocked requests so I can retry. 4. Code remains within Lazada’s permissible request rate to minimize captchas or bans. Deliver the .py file(s), [login to view URL], [login to view URL], and a short video or screenshots demonstrating a successful daily run. If something in the brief is unclear, feel free to suggest improvements—I’m open to your best practices. Note: is it possible to scrape 3000 urls with in 1 hour.
Mã dự án: 40216136
22 đề xuất
Dự án từ xa
Hoạt động 1 tháng trước
Thiết lập ngân sách và thời gian
Nhận thanh toán cho công việc
Phác thảo đề xuất của bạn
Miễn phí đăng ký và cháo giá cho công việc
22 freelancer chào giá trung bình ₹32.130 INR cho công việc này

I'll deliver a Python solution using BeautifulSoup and supportive libraries to scrape Lazada product details and customer reviews on a daily schedule, handling captcha bypass and providing a well-structured output in CSV or JSON format, with a focus on consistency, clarity, and adaptability to Lazada's markup changes, and ensuring compliance with Lazada's request rate to minimize captchas or bans, I'll also consider the possibility of scraping 3000 urls within 1 hour. Waiting for your response in chat! Best Regards.
₹100.000 INR trong 3 ngày
5,0
5,0

I am ready to create scraper code in Python using BS4 and Selenium to bypass captchas, ensuring daily data extraction from Lazada. I will ensure high-quality, timely delivery, clear communication throughout the process, and continued support if required after implementation. Note: The script will handle CSV/JSON inputs, scrape product details and reviews, and output clean files into dated folder and Wil include logging for errors and adhere to Lazada’s request limits. Please chat me to start on this.
₹7.500 INR trong 3 ngày
3,7
3,7

Hello, I’ll create a Python scraper using BS4 and Selenium to bypass captchas, ensuring daily data extraction from Lazada. The script will handle CSV/JSON inputs, scrape product details and reviews, and output clean files into dated folders. I’ll include logging for errors and adhere to Lazada’s request limits. With 5+ years of experience, I’ll ensure the code is modular and easy to update. Let me know if you’d like to see samples of similar projects. Thanks, Adegoke. M
₹5.625 INR trong 3 ngày
3,0
3,0

Hello, I will develop a reliable, well-structured Python scraping solution using BeautifulSoup (BS4) and supplementary libraries (like Requests) to extract product details and customer reviews from Lazada on a daily schedule. The script will ingest your category URLs or product lists and focus on accurately scraping all required fields, including title, price, promos, specs, images, ratings, and full review texts/scores. Crucially, I will implement a robust method to handle and bypass the site's anti-bot mechanisms, like CAPTCHAs, without relying on computationally heavy, full-browser automation. The output will be a clean CSV or JSON file dropped into a dated folder after each run, with a modular design to ensure easy tweaking for future markup changes. 1) What is the highest projected number of total unique product pages the script needs to visit daily? 2) Do you prefer the final output format to be CSV or JSON? 3) Are you providing a proxy list or a third-party CAPTCHA-solving service API key? Thanks, Nivedita
₹12.000 INR trong 7 ngày
3,4
3,4

Hi there, I have analyzed your requirement to scrape Lazada (Products & Reviews). Regarding your specific question: "Is it possible to scrape 3000 URLs in 1 hour?" The Answer: YES, it is possible, but strictly under one condition: We must use Asynchronous Requests or Multi-threading combined with proper header rotation to avoid IP Bans. My Technical Approach (The Safe Way): Since BS4 alone cannot bypass Captchas (it is only a parser), I will build a robust hybrid solution: Request Layer: I will use a specialized library (like cloudscraper or selenium-stealth) to handle the initial connection and bypass Lazada's anti-bot check/captcha. Parsing Layer (BS4): Once the HTML is retrieved safely, I will feed it into BeautifulSoup (BS4) for extremely fast extraction of Title, Price, Specs, and Reviews. Speed Optimization: I will implement Multi-threading to process multiple URLs in parallel, achieving your 3000/hour target efficiently. Deliverables: Clean Python Script (.py) with requirements.txt. CSV/JSON Output logic. Error Logging (Retry logic for failed requests). A video demo showing the script running at speed. I understand Lazada's structure well. Let's discuss the details.
₹27.000 INR trong 7 ngày
2,7
2,7

Hello Srinivas V., I always focus on understanding the full scope of a project before getting started, ensuring that every detail aligns with your goals and expectations. We are an expert team which have many years of experience on JavaScript, Python, Web Scraping, Software Architecture, JSON, Data Extraction, BeautifulSoup, Data Analysis, Selenium, Automation Please come over chat and discuss your requirement in a detailed way. Regards
₹7.000 INR trong 7 ngày
0,0
0,0

Hello, I’m Aditya Prasetya, an experienced full-stack developer with a strong focus on providing scalable and efficient solutions. My expertise in JavaScript, Python, and PHP, along with my diverse project background, makes me a great fit for your Lazada Daily Scraper job. I’ve honed my web scraping skills through work on ERP applications integrated with platforms like e-commerce, AI, and accounting systems. I have proven experience with databases like MySQL, PostgreSQL, and MongoDB, essential for managing the large amounts of data your project requires. My expertise with BeautifulSoup (BS4), as requested, further strengthens my ability to tackle this project effectively. I’m skilled at scraping while bypassing Captcha codes efficiently, ensuring compliance with Lazada’s request rate to avoid bans and Captchas. Beyond my technical capabilities, my approachable, solution-oriented approach makes me easy to collaborate with. If there’s any ambiguity or opportunity for improvement in your brief, I’m open to suggestions and will work closely with you to ensure a perfect match with your requirements. In summary, my skills, experience, and commitment make me an ideal candidate to streamline your competitor research by providing precise, reliable data for your needs.
₹12.500 INR trong 14 ngày
0,0
0,0

Hi, I reviewed your requirements for a daily Lazada scraper and this is well within the kind of Python automation I build. I’d approach this with a clean BS4-first architecture, keeping the parser modular so selectors can be updated easily if Lazada’s markup changes. For pages that require rendering, Selenium can be used selectively rather than globally to keep runs stable and efficient. For captchas and rate limits, the focus would be on request pacing, session handling, headers, and retries within permissible limits, with clear logging when a page is blocked so it can be retried safely. I wouldn’t rely on brittle hacks that break after a few runs. Output would be consistent CSV/JSON files in dated folders, with simple logs indicating success, partial failures, or blocks. I’ll also include a short README and a one-command run setup. Regarding volume, scraping thousands of URLs depends on page weight, throttling, and whether dynamic rendering is needed. I can help you benchmark this realistically and tune the run so it’s reliable day-to-day rather than fast once and broken later. Happy to clarify inputs or suggest improvements if needed before starting.
₹5.000 INR trong 5 ngày
0,0
0,0

GSINFOTECH OPC Pvt. Ltd. – Your Trusted Tech Partner Based in New Delhi, GSINFOTECH OPC Pvt. Ltd. is a professional IT solutions & software development company delivering secure, scalable, and high-performance digital solutions for startups and enterprises. We help businesses convert ideas into powerful, market-ready products. Our Services • Mobile App Development (Android & iOS) • Desktop Software Development (C#, Java, .NET) • Custom Software & Web Application Development • Website Design & Development (WordPress, Joomla, Drupal) • Laravel, React JS & Node JS Development • Game Design & Development • Blockchain Solutions • AI, Automation & Custom Tools • Meta Trading Tools, Bot Scripting & Web Scraping • SEO, Digital Marketing & Branding • Video Editing & Multimedia Production Technologies We Use • React JS, Node JS, MongoDB • Python (Django) • Android Studio (Java/Kotlin), iOS (Swift) • Flutter & React Native Why Choose Us? ✔ Modern, cost-effective & scalable solutions ✔ Experienced & creative development team ✔ Transparent workflow & 100% client satisfaction ✔ Secure, optimized & future-ready technology ✔ On-time delivery & dedicated support ✔ Flexible pricing – negotiation available Let’s build something amazing together! Hire GSINFOTECH OPC Pvt. Ltd. to take your project to the next level.
₹4.000 INR trong 7 ngày
0,0
0,0

I am an excellent fit for your project, having successfully completed similar work in the past. Your need for a clean, professional, user-friendly Python script that uses BeautifulSoup to scrape Lazada product details and customer reviews daily, while bypassing captchas, aligns perfectly with my skills. Ensuring seamless, integrated, and automated data extraction with clear outputs like CSV or JSON is absolutely doable. I specialize in web scraping using Python, BS4, requests, and Selenium for dynamic content. Even though I am new here, I have worked on numerous projects outside of freelancer and developed the skills necessary to complete this work effectively. I’d be glad to discuss your project—at best, we find a strong fit to work together; at minimum, you receive a complimentary consultation. Regards, Keagan.
₹5.750 INR trong 14 ngày
0,0
0,0

I will develop a robust Python-based Lazada scraper using BeautifulSoup for precise HTML parsing, ensuring reliable daily extraction of product data, prices, and ratings. My technical approach includes structured error handling, logging, and scheduler-driven automation for consistent operation, with output in CSV/JSON format. The implementation plan involves two phases: development/testing followed by deployment with monitoring. My budget for this complete solution is 9,980 INR. Could you confirm if you require real-time stock status tracking alongside the product data?
₹9.980 INR trong 4 ngày
0,0
0,0

YELAHANKA, India
Phương thức thanh toán đã xác thực
Thành viên từ thg 3 25, 2018
₹600-1500 INR
₹600-1500 INR
₹600-1500 INR
₹1500-12500 INR
₹1500-12500 INR
$15-25 USD/ giờ
$1500-3000 USD
$10-30 USD
$30-250 USD
€30-250 EUR
$2-8 AUD/ giờ
$150-350 USD
$15-25 USD/ giờ
$15-25 USD/ giờ
$250-750 USD
$30-250 AUD
$8-15 USD/ giờ
₹12500-37500 INR
₹1500-12500 INR
$25-50 USD/ giờ
£250-750 GBP
$250-750 USD
$750-1500 USD
$30-250 AUD
$30-250 CAD