Import.io Scrape/Crawl Contest

  • Tình trạng: Closed
  • Giải thưởng: $140
  • Các bài thi đã nhận: 2
  • Người chiến thắng: benideas

Tóm tắt cuộc thi

We are looking for an [login to view URL] EXPERT who can provide us with a data mining solution (we do not want someone to provide data - we want the solution) in the form of an [login to view URL] desktop application. The website as part of this contest includes scenarios such as infinite-scroll w/ autoload product pages and heavy javascript. The winner of this contest will provide a solution so that we may at any given time run the program and pull all of the required products and information. This will require extensive use of manual regexp overrides, xpath overrides, URL & Element templates. If you are not intimately familiar with these functions and [login to view URL] do not bother with a submission.

Website to be mined: HTTP://[login to view URL]

Details: Create an extractor and crawler to pull all information on every product on SamsClub.com. Include a connector to search specific items. Information scraped includes item titles, item numbers, model numbers, item price, item shipping cost to zip code 90210, item description (html and static format), item specifications(html and static format), all item links, item categories/links, all item images (link).


Requirements:
-You must use [login to view URL] desktop software and provide login credentials for entry to be considered. We will not accept any entries using any other software/methods.
-You must provide [login to view URL] login information so that we may access your extractors and crawlers to judge the entries.
-You must provide the information requested in the contest details outlined above.
-Your crawler/extractors MUST navigate the infinite scroll autoloading pages.
-Your crawler/extractors must pull the correct information for every item. This will only work by using the manual overrides noted above as the location of the requested information changes within product groups and category groups.

***WINNING*** The winner of this contest will be the one who provides all of the requested information in the quickest, most detailed and organized manner. This includes keeping the number of crawlers/extractors as close to 1 as possible, but being able to navigate across all item and categories using the single crawler/extractor. Combination crawler/extractors may be used for crawling links and extracting data from said links, but crawling should start at high level and accurately navigate without duplicating products.

Các kĩ năng yêu cầu

Những bài dự thi tốt nhất dự cuộc thi này

Xem thêm bài dự thi

Bảng thông báo công khai

  • benideas
    benideas
    • cách đây 3 năm

    Hi,
    Thanks for the choose us we need to discuss with you further to start the work. Please contact with us.
    Thanks
    Benideas

    • cách đây 3 năm
  • Harun1986
    Harun1986
    • cách đây 3 năm

    Sir i am ready to show you sample .Thank you

    • cách đây 3 năm

Làm thế nào để bắt đầu với cuộc thi

  • Đăng cuộc thi của bạn

    Đăng cuộc thi của bạn Nhanh chóng và dễ dàng

  • Nhận được vô số bài dự thi

    Nhận được vô số Bài dự thi Từ khắp nơi trên thế giới

  • Trao giải cho bài thi xuất sắc nhất

    Trao giải cho bài thi xuất sắc nhất Download File - Đơn giản!

Đăng cuộc thi ngay hoặc tham gia với chúng tôi ngay hôm nay!