Đang Thực Hiện

125188 Crawler/Price grabber

What we need as a company is a script written in PHP that will work as a spider. We want to be able to provide the script with a url outside ours, then tell the script which links inside that url it should follow, eg. [url removed, login to view] will be the beginning url and the links allowed to be followed should contain "[url removed, login to view]" or "[url removed, login to view]" or something else. (the script should be able to read all links from the page even if there is paging and not all products are shown in a single page.) Then we should tell the script that it should only process a specific type of links, for retrieving information, that contain another text eg."[url removed, login to view]" or something else. Now, in the pages that will be processed we should be able to tell the script what information we need exactly. (We can do this by providing html code that is used by the page) eg. "td tag" Title of product "another tag" So we tell the script to search between the first text and the second one in order to find the title of the products. The information that i need from every page is title of the product, the description, the price, the image of the product and the link to the page itself at the actual shop. We could also tell the script to search only a specific part of the page and not all of it (again by providing it with html code of the page). It could be helpfull to create a subscript that will only fetch the prices, after the first time that the script runs in order to update the prices in our database and not the whole product. The prices collected will be in euros, if this makes any difference, maybe the script can search for the euro sign (not always true though). This will save us bandwith. We want to be able and save each script we create for every shop or multiple scripts for every shop, for later use, along with the subscript for the prices. We also want to be able to tell the script if it should update our database (keep the existing ones and add the new products it finds) or just add all of them to the database. Each script should be attached to a shop and to a category of products eg. Laptops (these are just things that we should be able to control ourselves) So basically for the database part you could only provide us with the table that saves the scripts and the products, the other ones like, categories, shops etc. we can handle, as long as you tell us which variables coming from the script we should use to enter to our database. Also a log file created by the script with the results will be quite usefull. The script should be able to process pages written with various encodings like utf, or iso-8859-7, iso-8859-1 etc. The language that will be used is greek however your script can be written in english. The script should be tested by us in any way the programmer wants before project completion in order to test that it works as described above.

Kĩ năng: Bất kì công việc gì, MySQL, PHP

Xem nhiều hơn: true results, subscript.com, subscript com, something greek, price file, php programmer prices, php id shop, paging tag, paging control, find works programmer, find html programmer, c++ programmer price, category php id, what is a crawler, Price tag, price crawler, price category, iso 7, domain specific language, database crawler, project product description shop, php link crawler script, script multiple pending order, php link grabber mysql, subscript

Về Bên Thuê:
( 0 nhận xét ) Thessaloniki,

ID dự án: #1871354