Bộ lọc

Tìm kiếm gần đây của tôi
Lọc theo:
Ngân sách
đến
đến
đến
Loại
Nhiều kỹ năng
Ngôn ngữ
    Tình trạng công việc
    3,443 scrapy công việc được tìm thấy

    ...tiết ngay sau khi bắt đầu dự án). Mục tiêu – Lấy đầy đủ tên sản phẩm, giá hiện tại, giá gốc (nếu có), thuộc tính chính, tên shop, lượt bán, lượt thích, điểm rating và đường dẫn ảnh. – Crawler phải đi hết các trang trong danh mục, không bỏ sót, không trùng dữ liệu. – Kết quả xuất ra CSV / Excel; cột nào ra cột nấy, tiếng Việt không lỗi font. Yêu cầu kỹ thuật – Ưu tiên Python sử dụng Scrapy, Requests + BeautifulSoup hoặc Selenium nếu cần vượt qua cơ chế chống bot của Shopee. – Code sạch, chú thích ngắn gọn, dễ đọc; tôi cần toàn bộ source kèm hướng dẫn chạy trên...

    $379 Average bid
    $379 Giá đặt trung bình
    17 lượt đặt giá
    Project for duongpv7
    Đã kết thúc left

    Hi Dương, công ty Silicon Straits Saigon hiện tại có nhu cầu tìm một bạn Python part time giúp viết các scrapper trên ngôn ngữ Python/Scrapy. Công việc có thể làm remote hoàn toàn tuy nhiên buổi đầu tiên hy vọng bạn có thể ghé văn phòng để thảo luận công việc & hợp đồng lâu dài. Nếu bạn mong muốn tìm hiểu thêm có thể trả lời trực tiếp cho mình qua hệ thống này.

    $500 Average bid
    $500 Giá đặt trung bình
    1 lượt đặt giá
    Multi-Platform Social Data Scrape
    6 ngày left
    Đã xác thực

    ...comment text, number of comments, likes, reposts/shares, the post date and any other readily available metadata (author handle, follower count, post URL, media links, etc.). Accuracy is critical because the data will feed a trend-analysis dashboard later. Please build the workflow in a way that respects rate limits and login requirements: if you intend to use official APIs, private APIs, Selenium, Scrapy, Playwright, or headless browsers, spell that out so I know how sustainable the solution will be. The final hand-off should include: • A clean, well-commented reusable script (Python preferred) • A short README explaining environment setup, keyword input format and how to extend to new regions • The full export in CSV so I can validate before sign-off If an...

    $728 Average bid
    $728 Giá đặt trung bình
    12 lượt đặt giá

    ...reliable associated sources. Specific sources: Euromillones: (since Feb 13, 2004) La Primitiva: (since Oct 17, 1985 – modern version) El Gordo de la Primitiva: (since Oct 31, 1993) Updates automatic at exactly 00:02 the day after each draw, using ethical scraping (BeautifulSoup/Scrapy) with proper user-agent headers to mimic human behavior. Store data in PostgreSQL (structured) or MongoDB (flexible), including all prize categories to enable ROI calculations and backtesting. 2.2. Number Prediction Generate predictions for Euromillones, La Primitiva and/or El Gordo simultaneously using explicit advanced AI models: Machine Learning ensembles (Random Forests) for frequency/statistical

    $1421 Average bid
    $1421 Giá đặt trung bình
    78 lượt đặt giá

    I have three specific school-website links that list all current teachers and administrators. From each page I need a clean scrape of every staff member’s name, role, email address, plus the city/town and the school name, compiled into a single Excel workbook. ...points to keep in mind: • Final deliverable: one Excel file ready for copy-and-paste outreach. • Source material: my three school websites and the driver URLs I will supply. No other sources are required. • Required fields: Name, Role, Email, City/Town, School (or Company for drivers). • Accuracy matters; no duplicate or bounced addresses. If you normally work with Python, BeautifulSoup, Scrapy, or similar web-scraping tools, that’s perfect—as long as the end result is the...

    $25 Average bid
    Bảo đảm
    $25
    30 bài tham dự

    ...scraped, the information should be organised into a clean CSV file—one row per page—with columns for page URL, full body text, image file names, and link destinations. Please download the images themselves as well and bundle them in a separate folder (a simple ZIP is fine); the CSV should reference the exact filenames so everything lines up. I’m happy for you to use Python with BeautifulSoup, Scrapy, Selenium or whichever stack you prefer, as long as the final output meets these acceptance criteria: • Complete CSV containing text, image names, and link URLs for each page • All images successfully downloaded and accessible via the filenames listed in the CSV • No duplicates or missing pages from the target site * Images need to be sort...

    $213 Average bid
    $213 Giá đặt trung bình
    171 lượt đặt giá

    I have a data-analysis pipeline that relies on a ...after award). • Payload: high-resolution image files plus a CSV/JSON map linking each file to product ID, title, price, and category text that you extract during the same run. • Scale: thousands of products per crawl; a resumable approach is essential so partial failures don’t force a full restart. • Frequency: I’ll trigger the crawl weekly, so reusable code is a must. I’m happy with Python—Scrapy, Selenium, Playwright, or a headless solution of your choice—as long as it respects the site’s anti-bot measures and keeps requests polite. Please include a brief outline of how you’ll handle pagination, lazy-loaded images, and rate limiting. Let me know your proposed stac...

    $140 Average bid
    $140 Giá đặt trung bình
    159 lượt đặt giá
    MassageRepublic Phone Data Extraction
    5 ngày left
    Đã xác thực

    ...Excel workbook. Please crawl the entire site, not just a few sections, and return each number alongside the key profile details that make the data usable at a glance—name, profile URL, and any other easily captured identifiers shown next to the number. A clean .xlsx with one row per profile, no duplicates, and clearly labelled columns is the only deliverable I’m expecting. If you prefer Python, Scrapy, Selenium, Beautiful Soup or a comparable stack, go ahead; I’m interested in results, not the specific toolset, as long as the script can be rerun later should the site content change. Before delivery, double-check that: • every row contains a valid phone number and url • no pages on the site were skipped • the sheet opens flawlessly in the late...

    $134 Average bid
    $134 Giá đặt trung bình
    65 lượt đặt giá
    Iberia Avios Flight API Backend -- 2
    5 ngày left
    Đã xác thực

    ...issue and validate JWT tokens for every request beyond the public health-check route. Token refresh, revocation, and a simple role model (“user” vs. “admin”) should be built in from the start. Flight data extraction I do not have official Iberia developer access, so we will need to pull the data ourselves. I’m open to whichever tooling you are most comfortable with — BeautifulSoup, Selenium, Scrapy, or a hybrid approach — as long as the final solution is headless, resilient to minor layout changes, and respectful of Iberia’s rate limits. Only flights that are bookable with Avios need to be captured; no hotel or car-rental data is required. Deliverables • Clean, modular Python code (FastAPI or Flask preferred, but I’...

    $144 Average bid
    $144 Giá đặt trung bình
    53 lượt đặt giá
    Iberia Avios Flight API Backend
    5 ngày left
    Đã xác thực

    ...issue and validate JWT tokens for every request beyond the public health-check route. Token refresh, revocation, and a simple role model (“user” vs. “admin”) should be built in from the start. Flight data extraction I do not have official Iberia developer access, so we will need to pull the data ourselves. I’m open to whichever tooling you are most comfortable with — BeautifulSoup, Selenium, Scrapy, or a hybrid approach — as long as the final solution is headless, resilient to minor layout changes, and respectful of Iberia’s rate limits. Only flights that are bookable with Avios need to be captured; no hotel or car-rental data is required. Deliverables • Clean, modular Python code (FastAPI or Flask preferred, but I’...

    $142 Average bid
    $142 Giá đặt trung bình
    76 lượt đặt giá
    Large-Scale E-commerce Data Scraping
    4 ngày left
    Đã xác thực

    I need a senior-level specialist to harvest product data from several e-commerce sites and deliver it in a single, well-structured CSV file. The task demands production-ready techniques—think Scrapy spiders hardened with rotating proxies, Selenium or Playwright for dynamic content, and solid anti-bot countermeasures. The information I’m after is very specific: product names, prices, pictures, and SKU. Nothing less, nothing more. Your solution must run reliably at scale, cope with frequent layout changes, and leave no trace that could trigger blocks. Python is the preferred stack, but if you have a proven alternative that meets the same bar, I’m open to hearing it. To be considered, include in your proposal: • At least one example of a comparable e-commerce...

    $465 Average bid
    $465 Giá đặt trung bình
    144 lượt đặt giá
    Webscrape Florida Health Education Contacts
    4 ngày left
    Đã xác thực

    I’m expanding our Florida outreach list and need a reliable web-scraped data set of school, college, and university administrators who oversee Nursing or other Healthc...address • State (always Florida) Format & delivery – Send the file in Excel (.xlsx). – First progress drop: within 5 days so I can spot-check. – Final, fully cleaned file: no later than 10 calendar days from project start. Quality matters because this list feeds straight into our marketing campaigns. I’ll spot-verify a sample for accuracy. Feel free to leverage Python, BeautifulSoup, Scrapy, or similar tooling—whatever lets you move quickly while respecting each site’s robots.txt. Let me know if anything needs clarifying before you begin, otherwise I&rsqu...

    $32 / hr Average bid
    $32 / hr Giá đặt trung bình
    202 lượt đặt giá
    Expert Python Data Scraper
    4 ngày left
    Đã xác thực

    ...need a seasoned Python developer to build a robust scraper that collects the required data and writes it straight to JSON—no additional cleaning or processing necessary. Once we begin I’ll provide the target URL(s) and any access details; for now, assume a standard public site with pagination and occasional anti-bot checks. Core expectations • Written in Python 3 using requests/BeautifulSoup or Scrapy; resort to Selenium only if there’s no lighter workaround. • Handles pagination, retries, and polite delays gracefully so the run can complete unattended. • Config file or clear constants for headers, cookies, and start URLs, letting me tweak targets without editing core logic. • Produces a single JSON file (or one file per page if that...

    $143 Average bid
    $143 Giá đặt trung bình
    160 lượt đặt giá

    ...build a reliable, well-structured lead list and I already know exactly what it should contain. The task is to extract contact information—email addresses, phone numbers and full mailing addresses—from three sources: company and organisation websites, their public social-media profiles, and well-known online directories. I expect the data to be gathered with a solid scraping workflow (Python, Scrapy, BeautifulSoup, Selenium or an equivalent stack is fine) and then verified so that bounced emails and dead numbers are kept to an absolute minimum. Deliverables • One CSV or Excel file with separate columns for name, company, job title, email, phone, street address, city, state, ZIP/postcode, country, source URL and date collected. • No duplicates; every...

    $2 / hr Average bid
    $2 / hr Giá đặt trung bình
    13 lượt đặt giá

    Preciso de um especialista em web scraping para coletar in...informações específicas consultando CPF em um site. Campos necessários: - Nome completo - Data de nascimento - Endereço - E-mails - Telefones - Veículo (marca/modelo) - Ano de fabricação - Ocupação - Faixa salarial - Provável empresa Habilidades e Experiência Ideais: - Experiência comprovada em web scraping - Proficiência em ferramentas como Python, Beautiful Soup, Scrapy, ou similares - Capacidade de trabalhar com estruturas de dados complexas - Atenção aos detalhes e precisão na extração de dados - Familiaridade com questões legais e éticas de ...

    $119 Average bid
    $119 Giá đặt trung bình
    23 lượt đặt giá
    E-commerce Text & Image Scraper Needed
    4 ngày left
    Đã xác thực

    I have a data-analysis pipeline that relies on a ...after award). • Payload: high-resolution image files plus a CSV/JSON map linking each file to product ID, title, price, and category text that you extract during the same run. • Scale: thousands of products per crawl; a resumable approach is essential so partial failures don’t force a full restart. • Frequency: I’ll trigger the crawl weekly, so reusable code is a must. I’m happy with Python—Scrapy, Selenium, Playwright, or a headless solution of your choice—as long as it respects the site’s anti-bot measures and keeps requests polite. Please include a brief outline of how you’ll handle pagination, lazy-loaded images, and rate limiting. Let me know your proposed stac...

    $29 Average bid
    $29 Giá đặt trung bình
    59 lượt đặt giá

    ...precise location coordinates directly from Google Maps. The second will crawl a set of websites I will supply and pull out product information, on-page contact details, and any user-generated content that appears alongside those products. Please structure every field into one tidy CSV per source so I can plug the results straight into my BI dashboards. I am comfortable if you lean on Python, Scrapy, BeautifulSoup, Selenium, or similar tools, provided the script is well-commented and can run headless behind rotating proxies without tripping rate limits. Deliverables: • 4 working scripts (Maps + websites) with clear setup instructions • Sample output files proving all requested fields are captured correctly • Output data must have City Name > (Excel fi...

    $61 Average bid
    $61 Giá đặt trung bình
    17 lượt đặt giá

    I need clean, structured prod...stay lean and purpose-built. I already have a clear idea of the attributes I want captured (title, price, SKU, description, availability, image URL). Once we agree on the target sites, you can build a scraper, run it, and hand back the CSV along with the script or notebook so I can reproduce the results later if needed. Please let me know: • Which language or framework you plan to use (Python, Scrapy, BeautifulSoup, Selenium, Playwright, etc.). • How you’ll handle pagination, anti-bot measures, and site structure changes. • An estimated turnaround and any milestones you suggest. Accuracy, deduplication, and clarity in the final CSV will be the acceptance criteria. If this sounds like your bread-and-butter, I’m ready...

    $96 Average bid
    $96 Giá đặt trung bình
    6 lượt đặt giá

    I need a developer to collect data from multip...repeatable solution (script or small app) that I can run on demand Basic documentation: how to run it, how to adjust settings, where outputs go Quality requirements Reliable scraping with error handling and retries Respectful request rate / throttling to avoid overloading sites Clear logging (success/fail, pages processed) Ability to adapt if page structure changes Experience with Python (Scrapy/BeautifulSoup/Selenium/Playwright) or Node.js Proxy / rotating user-agents experience (only if needed) Scheduling/automation (cron, Docker, or cloud run) Deliverables Working scraper + instructions Sample output file(s) Final dataset from agreed sources (initial run) To apply, please include Examples of similar scraping work you...

    $142 Average bid
    $142 Giá đặt trung bình
    174 lượt đặt giá
    Scrape Zillow Agent Data
    1 ngày left
    Đã xác thực

    I have an urgent need for a clean, well-structured dataset containing the listing agent’s first name, last name, mailing address, and phone number for well over 500 active Zillow listings. Speed is critical, but accuracy matters just as much; the final file should be ready for immediate import into my CRM. You are free to use whichever stack you prefer—Python with BeautifulSoup or Scrapy, Selenium, residential proxies, even the unofficial Zillow API—so long as rate-limits are respected and the data is complete. I don’t need property details or price history; the focus is strictly on the agent contact fields. Deliverables • CSV or XLSX with a separate column for each required field • A short read-me explaining the script or method so I can reru...

    $24 Average bid
    $24 Giá đặt trung bình
    69 lượt đặt giá

    ...visible textual content I specify, and returning it in a machine-readable format. I’m flexible on the final file type; CSV, Excel, or JSON all work as long as the fields are clearly labeled and easy for me to manipulate later. A small sample first will help confirm we’re on the same page before you run the full extraction. Please use whatever stack you prefer—Python with BeautifulSoup or Scrapy, JavaScript with Puppeteer, or a tool that suits the task best—just be sure to respect and provide the code so I can rerun the process when the site updates. Deliverables: • Re-usable script or notebook with clear comments • Complete dataset containing all extracted text, delivered in my chosen format • Brief read-me explaining setup, ...

    $66 Average bid
    $66 Giá đặt trung bình
    13 lượt đặt giá

    ...public websites * Parse HTML, JSON, CSV, and PDF files * Clean and normalize messy real-world data * Write clear, maintainable utility scripts * Deliver working code (not just prototypes) --- ### Required Skills * Strong Python fundamentals * Real experience with web scraping * Data parsing and data cleaning * Comfortable working independently and async --- ### Nice to Have * BeautifulSoup, Scrapy, Playwright, or Selenium * pandas / numpy * Experience scraping government or legacy websites * Experience handling PDFs (text extraction, OCR) --- ### How We Evaluate * This role includes a **paid trial task (1–3 days)** * We care about **output and correctness**, not resumes * Clean, working code matters more than clever abstractions --- ### Important * Please includ...

    $11 / hr Average bid
    $11 / hr Giá đặt trung bình
    104 lượt đặt giá
    Cross-Site Job Scraper Build
    Đã kết thúc left

    I need a reliable scraping solution that collects every open position from ten job-board and company-career sites in one specific country. I already have the full URL list and will share it right after kickoff. Scope • Write and schedule a separate scr...postings, basic keyword search in the frontend, and an export button for CSV or Excel, but these are optional. Deliverables 1. Source code for all scrapers and the data pipeline. 2. Database schema or JSON structure. 3. Front-end webview ready to run locally. 4. README covering installation, configuration, and update routine. I’m happy to discuss your preferred stack—Python with BeautifulSoup/Scrapy or Node with Cheerio/Puppeteer are both fine—as long as the final result is stable and well documented....

    $98 Average bid
    $98 Giá đặt trung bình
    63 lượt đặt giá
    Automated Job & CV Scraper
    Đã kết thúc left

    ...LinkedIn, Indeed and HelloWork. • Captures, at minimum, the job title, full description, company name and location. • Stores everything in a structured database I can easily query or export. • Retrieves complete CVs from LinkedIn and, when possible, other social platforms, then links each profile to the same database scheme. Feel free to choose the most stable stack you trust—Python with Scrapy or Selenium, Node with Puppeteer, direct GraphQL or REST endpoints, etc.—as long as it runs unattended, copes gracefully with rate limits / captchas, and offers a simple way for me to schedule or trigger updates. Acceptance will be based on: 1. A repeatable script or service I can host (Docker image or cloud function are fine). 2. A concise setup guid...

    $1279 Average bid
    $1279 Giá đặt trung bình
    152 lượt đặt giá

    ...build a reliable, well-structured lead list and I already know exactly what it should contain. The task is to extract contact information—email addresses, phone numbers and full mailing addresses—from three sources: company and organisation websites, their public social-media profiles, and well-known online directories. I expect the data to be gathered with a solid scraping workflow (Python, Scrapy, BeautifulSoup, Selenium or an equivalent stack is fine) and then verified so that bounced emails and dead numbers are kept to an absolute minimum. Deliverables • One CSV or Excel file with separate columns for name, company, job title, email, phone, street address, city, state, ZIP/postcode, country, source URL and date collected. • No duplicates; every...

    $2 / hr Average bid
    $2 / hr Giá đặt trung bình
    19 lượt đặt giá
    Maritime Job Board Scraping
    Đã kết thúc left

    ...following fields: • Job title and full description • Company name plus location (city, state/region, country) • Employment type and any salary or rate information available Your scraper should store results in a clean, normalized CSV (or optionally a relational DB if you prefer) and be easy for me to rerun on demand. I’m comfortable with Python, so a script leveraging requests/BeautifulSoup, Scrapy, or Playwright makes sense, but if another stack delivers better reliability feel free to suggest it. Key expectations • Site recommendations presented first for my approval before you start coding • Respect , add configurable request delays, and build basic anti-block measures (user-agent rotation, retries) • Clear documentation ex...

    $275 Average bid
    $275 Giá đặt trung bình
    63 lượt đặt giá
    Scrape 15K Australian Records
    Đã kết thúc left

    I ...first line of address, state, city, postcode • Format: every column saved as plain text (no numeric or date formatting) Delivery schedule • First 5,000 fully cleaned rows required within the first 6 hours • Remainder on a rolling basis until the full 15,000 are complete I will supply a surname list to guide the searches. A straightforward Python (requests / BeautifulSoup or Selenium) or Scrapy workflow is fine as long as the final output arrives in a single Excel file (.xlsx) that opens error-free in Microsoft Excel. Accuracy matters more than speed—random spot checks will be run. Any duplicates, blanks, or malformed addresses will be sent back for correction. Once the first 5,000 pass review, I’ll green-light the rest of the scrape so we ca...

    $23 Average bid
    $23 Giá đặt trung bình
    47 lượt đặt giá
    Data Scraping Specialist
    Đã kết thúc left

    Description: - We are looking for an experienced Data Scraping / Web Scraping expert. - We will share the industry name, and the...- Suggest suitable websites/sources to scrape - Suggest countries/regions that can be covered - Share estimated data volume & approach - After approval, the freelancer will scrape and deliver clean, structured data. Data Required (example): - Company name - Location - Contact details (email/phone/website – if available) Requirements: - Proven experience in data scraping - Knowledge of Python, Scrapy, Selenium, APIs, etc. - Ability to scrape multi-country data (based on feasibility) Deliverables: - Data in Excel / CSV / Google Sheets - Basic info of sources used To Apply, share: - Similar scraping work - Tools you use - Your approach after ...

    $67 Average bid
    $67 Giá đặt trung bình
    16 lượt đặt giá

    I need all publicly available customer-facing email addresses extracted from a list of e-commerce websites that I will supply once the project begins. Please crawl only the domains I provide, respect where possible, and avoid triggering any rate limits or security blocks—rotating proxies or headless browsing with tools such as Python, Scrapy, BeautifulSoup, Selenium, or similar is fine as long as the result is reliable. Deliverable • One clean, de-duplicated CSV file containing the harvested email addresses, ready for direct import into my CRM. Acceptance criteria • Every email must originate from the target e-commerce domains. • No duplicates, placeholders, or obviously invalid addresses. • File encodes as UTF-8 and opens without warnings in Exc...

    $16 Average bid
    $16 Giá đặt trung bình
    45 lượt đặt giá

    ...associated images—then converts and calculates the raw values exactly as we define before pushing them straight into WooCommerce. My customers must only ever see the WooCommerce front end, so the sync has to feel native and instant. The portal changes frequently, so please code the extractor so that selectors and credentials can be updated without touching the core logic. I am open to Python (Scrapy, BeautifulSoup, Selenium), PHP or Node as long as the finished solution talks cleanly to the WooCommerce REST API and leaves no manual steps. Deliverables • Scraper that logs in and captures product details, stock, prices and images in real time or on a schedule we agree on • Conversion layer that performs the unit/price calculations before data enters WooCommerce ...

    $1305 Average bid
    $1305 Giá đặt trung bình
    232 lượt đặt giá

    I need a developer to collect data from multip...repeatable solution (script or small app) that I can run on demand Basic documentation: how to run it, how to adjust settings, where outputs go Quality requirements Reliable scraping with error handling and retries Respectful request rate / throttling to avoid overloading sites Clear logging (success/fail, pages processed) Ability to adapt if page structure changes Experience with Python (Scrapy/BeautifulSoup/Selenium/Playwright) or Node.js Proxy / rotating user-agents experience (only if needed) Scheduling/automation (cron, Docker, or cloud run) Deliverables Working scraper + instructions Sample output file(s) Final dataset from agreed sources (initial run) To apply, please include Examples of similar scraping work you...

    $166 Average bid
    $166 Giá đặt trung bình
    166 lượt đặt giá
    UK Vinyl Shops Data Scrape
    Đã kết thúc left

    ...directory • → Record Stores tab • → search term “record shops” For each shop, capture these fields exactly: – Business name – Email address – Phone number All three data sets should be merged into one unified file; no source labels or separate sheets are required. Please scrape or crawl the sites directly—automated methods such as Python, BeautifulSoup, Scrapy, or similar tools are fine so long as the final output arrives de-duplicated and ready to open in any spreadsheet application that supports CSV. Accuracy matters more than speed, so feel free to build in basic checks (e.g., email format validation, obvious duplicate removal). Once complete, send the single CSV plus a brief note on how you gather...

    $134 Average bid
    $134 Giá đặt trung bình
    134 lượt đặt giá
    Website Product Data Scraping
    Đã kết thúc left

    I need an automated scraping solution that reliably collects product data from targeted websites and delivers it in a clean, structured file I can plug straight into my workflow. You’re free to use Python (BeautifulSoup, Scrapy, Selenium, Playwright) or a simple cloud instance, and the output lands in CSV or JSON.

    $88 Average bid
    $88 Giá đặt trung bình
    91 lượt đặt giá
    NZ Blinds & Awnings Data Scrape
    Đã kết thúc left

    ...directories you can legally access. For each product, capture at minimum the product name, its full description or spec blurb, and the page URL so I can verify the entry later. If additional details such as model numbers or imagery links are readily available while you scrape, feel free to include them as extra columns, but the name and description are non-negotiable. A Python-based workflow using Scrapy, BeautifulSoup, or a comparable toolset is fine by me so long as the end result arrives as a single Excel workbook, neatly separated by sheet or filterable fields. Please ensure your methods comply with site terms of service and New Zealand data-privacy requirements. I will consider the job complete when: • Every known NZ supplier type (storefront, manufacturer, d...

    $121 Average bid
    $121 Giá đặt trung bình
    52 lượt đặt giá

    I need the brochure catalogue of a JavaScript-heavy e-commerce site captured and delivered as a clean CSV. My focus is on accurate prices and every available variant, pulled from each category the site offers. Python is the language of choice and I’m flexible on tooling—Scrapy + Playwright, straight Playwright, Selenium, or another robust approach—provided the code is modular, well-documented, and easy for me to rerun when the store layout shifts. If you already have proxy rotation or rate-limit handling baked into your pipeline, that will be an advantage. What has to happen • Crawl through every category filter so no product slips through the cracks. • Render dynamic content fully to capture price and variant data, along with URL, SKU, net price an...

    $99 Average bid
    $99 Giá đặt trung bình
    30 lượt đặt giá

    ...SOMEONE WHOS PROFILE PICTURE MATCHES WHO THEY ACTUALLY ARE. A VIDEO CALL TO DISCUSS WILL BE NEEDED TO DISCUSS DETAILS PRIOR TO HIRING. **THIS IS A LONG TERM PROJECT THAT WILL BE HEAVILY FRONT LOADED HOPEFULLY MAKING YOUR LIFE EASIER OVER THE DURATION OF THE PROJECT. WITH THIS I PLAN ON PROVIDING A MONTHLY STIPEND THAT WILL EVEN OUT PAY OVER THAT TERM. Your toolkit is up to you, but Python with Scrapy or Selenium, a tidy Pandas workflow, and solid MySQL / PostgreSQL skills fit naturally with what we already run on AWS. Clean code, deduping, error logging, and clear documentation are non-negotiable; everything has to slip straight into the pipelines that feed our mobile app. ** I HAVE EXISTING SCRIPTS FROM THE PREVIOUS FREELANCER THAT WILL HELP YOU EXPEDITE THE WORK YOU DO. De...

    $487 Average bid
    $487 Giá đặt trung bình
    206 lượt đặt giá
    NRL Data Scraping Pipeline
    Đã kết thúc left

    ...then continues to run weekly. Scrapy is strongly preferred because I want to spin the whole thing up inside a GitHub Codespaces Docker container and keep deployment friction to a minimum. Scope of data • Players page first: full player-level stats are the initial priority. • Draw page next: I specifically need match dates, kick-off times and the team & player stats embedded in each fixture. • Ladder and any other public stats pages can follow once the core player and draw feeds are solid. Data model & output All raw HTML should be parsed into tidy, flat tables (CSV or Parquet). Please create sensible surrogate keys so that tables for players, matches, teams and ladder positions can be joined cleanly in a downstream warehouse. Deliverables &bull...

    $113 Average bid
    $113 Giá đặt trung bình
    67 lượt đặt giá
    Python Scrape Horse Racing Data
    Đã kết thúc left

    ...horse’s career starts, total earnings and current trainer. What I’m missing is the data itself. Two different public websites publish this information; I’m happy for you to pull from whichever source (or a mix of both) gives the most complete and accurate results. Your task is to automate the extraction—ideally with a clean, well-commented Python script that uses requests/BeautifulSoup, Selenium, Scrapy or any other library you prefer—and then populate my spreadsheet with the fetched records. When the job is done I need: • the updated Excel file, fully filled out and spot-checked for accuracy • the script and quick usage notes so I can rerun it later if the sites update That’s it. If you can turn this around quickly and the n...

    $17 Average bid
    $17 Giá đặt trung bình
    28 lượt đặt giá
    One-Time Product Data Scrape
    Đã kết thúc left

    I need a single, clean pull of all product information from an e-commerce site. The scope is limited to product names, full descriptions, and the corresponding images; no price or contact data is required. A fresh scrape is needed only once, so scheduling or cron work is unnecessary. You may use Python with BeautifulSoup, Scrapy, Selenium, or any comparable stack—what matters is that the final dataset is complete and easy for me to consume. Deliverables • CSV or XLSX listing every product with its name and full description • A folder (or ZIP) containing every product image, with filenames mapped to the rows in the data file • A brief README outlining the scrape process and any setup steps I would need to reproduce it locally Acceptance criteria ...

    $31 Average bid
    $31 Giá đặt trung bình
    91 lượt đặt giá

    ... The page also includes the serving city, state, and ZIP code. I need all these elements pulled out, cleaned (no stray markup or line breaks), and placed into a single, well-structured Excel workbook that’s ready for immediate analysis and this data is used to create a dashboard analyzing negative and positive reviews. You are free to choose the scraping approach—Python with BeautifulSoup or Scrapy, a headless browser like Playwright, or any tool you trust—as long as the end result is an accurate Excel file with clear columns for Review_Text, Review_Date (YYYY-MM-DD), and Zip_Code. Deliverable: • One .xlsx file containing every review currently live on the site, fully deduplicated and formatted. * One dashboard for easy analysis. *sentiment analysis...

    $34 Average bid
    $34 Giá đặt trung bình
    48 lượt đặt giá
    Simple Google Scraper Tool -- 2
    Đã kết thúc left

    ...social-media link that appears on the page. • Storage: write everything to our MySQL database. • Data quality: run only basic validation—remove obvious duplicates, trim whitespace, keep field formats consistent. • Export: one-click download to XLS or CSV. • Deployment: install on our Linux server and document the steps so I can redeploy if needed. Preferred stack Python with BeautifulSoup, Scrapy, or Selenium is fine, as long as it’s cleanly modular. Use pandas or a similar library for the export. Deliverables 1. Source code and (or equivalent). 2. SQL script to create/update the necessary tables. 3. README covering setup, cron scheduling, and how to adjust throttling. 4. Working front-end files matching the mock-up. Keep it s...

    $102 Average bid
    $102 Giá đặt trung bình
    72 lượt đặt giá

    ...compact, AI-assisted web-scraping module that plugs straight into my existing stack and pulls live product information—specifically price and availability—from several retailer websites. The scraper should detect layout changes automatically, respect where permitted, and expose a simple JSON or CSV feed so I can drop the data into my pricing engine without extra parsing. Python with Scrapy, Playwright, or a similar headless-browser approach is ideal, but I’m open to alternative tools if you can show they handle anti-bot measures reliably. Deliverables • Clean, well-commented source code for the scraper • A lightweight API or CLI wrapper that returns price and availability in structured form • Basic setup guide and a quick demo run showi...

    $19 / hr Average bid
    $19 / hr Giá đặt trung bình
    78 lượt đặt giá
    Google Maps Data Extraction
    Đã kết thúc left

    ...extract phone number from all those businesses • Business name • Full street address (including city, state and ZIP) • Primary phone number The scrape must span all states and territories so the end result reflects a true national snapshot. An Excel or CSV file is fine for delivery; please keep one row per location and label columns clearly. If you intend to automate with Python, Selenium, Scrapy, the Google Maps API or a similar approach, note that accuracy and duplicate handling matter more to me than the tool you choose. Captcha-handling and rate-limiting should be built in so the run completes cleanly without blocks. Acceptance criteria 1. At least 95 % of returned rows contain the four core fields above. 2. No obvious duplicates (same name, phone...

    $16 Average bid
    $16 Giá đặt trung bình
    16 lượt đặt giá
    GeM Portal Data Scraper
    Đã kết thúc left

    ...Deliverables The Executable/Script: A Python script (preferred) or a standalone desktop tool. Excel Output: A clean .xlsx file with headers for all the data points mentioned above. Documentation: A brief "How-to" guide on running the tool and updating search parameters (e.g., how to change the Category URL or Keywords). 5. Preferred Freelancer Skills Expertise in Python (Selenium, BeautifulSoup, or Scrapy). Experience with E-commerce scraping and bypassing bot detection. Previous experience with Indian Government Portals (GeM, CPP, or Tenders) is a major plus. Ability to deliver a tool that does not get our IP blocked (rate-limiting features). Important Note on Compliance Since GeM is a government portal, ensure your developer follows ethical scraping practices (...

    $66 Average bid
    $66 Giá đặt trung bình
    17 lượt đặt giá
    Website Text Scraping to CSV
    Đã kết thúc left

    ...of websites from which I need specific text captured—product descriptions, article titles, meta-data, and similar fields—and placed into a clean, well-structured CSV file. Whenever automated collection isn’t possible, the same information will have to be entered manually, so accuracy in both scraping and data entry is essential. I don’t mind which stack you prefer—Python with BeautifulSoup or Scrapy, browser automation with Selenium, or a different approach—so long as the final CSV follows the column order I provide and you can replicate the steps for future runs. A short note explaining your method and any scripts you write will be part of the hand-off. Turnaround is flexible as long as we agree on it before each batch, and there will be...

    $402 Average bid
    $402 Giá đặt trung bình
    199 lượt đặt giá

    ...business you can find in Mumbai, Pune, Nashik, and Surat. Your sources should be Google Maps plus the listings on Justdial and Yellow Pages. For each company, capture exactly three fields—business name, full address, and phone number—keeping the phone exactly “as listed” on the source page (no re-formatting into international or local styles). I do not mind which stack you prefer—Python with Scrapy or BeautifulSoup, browser automation with Selenium, or a different approach—so long as the final spreadsheet is comprehensive, de-duplicated across the three sources, and ready for me to open in Excel without further cleaning. Before delivery, please spot-check for obvious errors (e.g., mismatched phone numbers, partial addresses) and remove...

    $17 Average bid
    $17 Giá đặt trung bình
    6 lượt đặt giá
    Real-Time Scraping & AI Agent
    Đã kết thúc left

    ...into an AI agent, and immediately turns the results into usable insights and automated replies. Here’s the flow I’m aiming for: the scraper pulls fresh information from the sources I’ll share with you, filters and structures it on the fly, then hands it to the agent. The agent should run on-the-spot data analysis and generate context-aware responses without human intervention. Think Python with Scrapy or Selenium for the collection layer, fast in-memory handling (Redis, Kafka, or similar) to keep everything real-time, and an LLM framework such as LangChain or a custom GPT wrapper for the reasoning layer. If you have a better tech stack in mind, I’m open to it as long as it remains fast and easy to scale. Deliverables I need to sign off on: • A wor...

    $75 Average bid
    $75 Giá đặt trung bình
    63 lượt đặt giá
    Amazon Book Data Scraper
    Đã kết thúc left

    ...product fields (title, price, images, description, etc.) Requirements: Proven experience scraping Amazon & similar websites. Ability to bypass anti-bot systems safely. High accuracy & clean data. Automation for bulk scraping preferred. Experience with WordPress product import is a plus. Final backup file in Excel/CSV also required. Please share: Similar past projects Your approach & tools (Python, Scrapy, API, etc.) Timeline Total cost Looking forward to working together....

    $85 Average bid
    $85 Giá đặt trung bình
    43 lượt đặt giá
    Daily car.gr Data Scraper
    Đã kết thúc left

    I need a reliable, fully-automated pipeline that pulls fresh information from every listing on once per day, stores it in a query-friendly repository, and highlights what changed since the previous run. Core requirements • Daily...cron-ready execution script and README. 4. Sample CSV/JSON export demonstrating daily deltas and summary stats. Acceptance criteria • A 24-hour test run proves 100 % listing coverage with no duplicate rows. • Second run correctly labels adds/removals and updates analytic tables. • Installation from a clean server takes under 15 minutes using only the supplied documentation. If you’ve built Scrapy spiders, headless-browser collectors, or data pipelines on AWS/Lambda/GCP before, I’m keen to see your approach an...

    $197 Average bid
    $197 Giá đặt trung bình
    72 lượt đặt giá

    Các bài viết top scrapy cộng đồng