PHP scraper for two sites
$30-5000 USD
Thanh toán khi bàn giao
I need a robust scraper program created in PHP that will scrape book data from the following two example sites... [url removed, login to view] [url removed, login to view] The scripts must be dynamic and able to scrape all of the different universities. The urls to the universities are already available in a MySQL database. The scraper should scrape the following data, Term Department Course Section Title Author Publisher ISBN NewPrice UsedPrice and insert them into a mySQL database table. You will see after following the above links that in order to manually browse to all of the possible books, you have to use the pull-down menus. The script must also be able to stop and restart where it left off because there is a lot of data and server errors may occur. A scheduling feature may be needed to help this. Please feel free to ask any questions before bidding.
## Deliverables
1) All deliverables will be considered "work made for hire" under U.S. Copyright law. Employer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the employer on the site per the worker's Worker Legal Agreement).
2) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
3) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Employer's environment--Deliverables must be installed by the Worker in ready-to-run condition in the Employer's environment.
b) For all others including desktop software or software the employer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this project.
* * *This broadcast message was sent to all bidders on Thursday Oct 28, 2010 1:34:20 AM:
There has been a fundamental change in this project and if you are interested, I would like you to reconsider and hopefully lower your bid accordingly. I am about to attach a zip file to the project that includes an existing script that in the past worked to scrape the information off of the sites. The sites have not changed much since it worked last. You would only have to edit this script to fix it, and add a few features to make it better handle restarts and crashes and also allow for a scheduling feature to pick which of the universities are scraped first. I will also include a sql file of the universities database table and the book table. Please review the code and advise, thank you.
## Platform
Linux litespeed, apache, PHP 5, Mysql 4
ID dự án: #3811111