PHP scraper for two sites

Đã Hủy Đã đăng vào Oct 23, 2010 Thanh toán khi bàn giao
Đã Hủy Thanh toán khi bàn giao

I need a robust scraper program created in PHP that will scrape book data from the following two example sites... [url removed, login to view] [url removed, login to view] The scripts must be dynamic and able to scrape all of the different universities. The urls to the universities are already available in a MySQL database. The scraper should scrape the following data, Term Department Course Section Title Author Publisher ISBN NewPrice UsedPrice and insert them into a mySQL database table. You will see after following the above links that in order to manually browse to all of the possible books, you have to use the pull-down menus. The script must also be able to stop and restart where it left off because there is a lot of data and server errors may occur. A scheduling feature may be needed to help this. Please feel free to ask any questions before bidding.

## Deliverables

1) All deliverables will be considered "work made for hire" under U.S. Copyright law. Employer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the employer on the site per the worker's Worker Legal Agreement).

2) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

3) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):

a) For web sites or other server-side deliverables intended to only ever exist in one place in the Employer's environment--Deliverables must be installed by the Worker in ready-to-run condition in the Employer's environment.

b) For all others including desktop software or software the employer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this project.

* * *This broadcast message was sent to all bidders on Thursday Oct 28, 2010 1:34:20 AM:

There has been a fundamental change in this project and if you are interested, I would like you to reconsider and hopefully lower your bid accordingly. I am about to attach a zip file to the project that includes an existing script that in the past worked to scrape the information off of the sites. The sites have not changed much since it worked last. You would only have to edit this script to fix it, and add a few features to make it better handle restarts and crashes and also allow for a scheduling feature to pick which of the universities are scraped first. I will also include a sql file of the universities database table and the book table. Please review the code and advise, thank you.

## Platform

Linux litespeed, apache, PHP 5, Mysql 4

Amazon Web Services Kĩ thuật PHP Quản lí dự án Kiến trúc phần mềm Kiểm tra phần mềm Web Hosting Quản lý website Thử nghiệm trang web

ID dự án: #3811111

Về dự án

9 đề xuất Dự án từ xa Oct 28, 2010 đang mở

9 freelancer chào giá trung bình$289 cho công việc này

marchent

See private message.

$170 USD trong 10 ngày
(168 Nhận xét)
6.3
softwarevamp

See private message.

$212.5 USD trong 10 ngày
(29 Nhận xét)
5.2
Matija

See private message.

$850 USD trong 10 ngày
(40 Nhận xét)
4.5
hassana19

See private message.

$174.25 USD trong 10 ngày
(21 Nhận xét)
4.3
ezgontechno

See private message.

$255 USD trong 10 ngày
(4 Nhận xét)
4.2
ltaylor82

See private message.

$212.5 USD trong 10 ngày
(1 Nhận xét)
0.7
yayaNasr

See private message.

$85 USD trong 10 ngày
(4 Nhận xét)
0.7
vw7695618vw

See private message.

$425 USD trong 10 ngày
(2 Nhận xét)
0.0
erpoojasharma

See private message.

$212.5 USD trong 10 ngày
(1 Nhận xét)
0.0