Đang Thực Hiện

129859 Scraper-Data Extractor

Project name: Data Collection Script

Project Description: We want to extract data from various online yellow page companies ([url removed, login to view]). The data will be extracted and sent to a mySQL DB. The script will perform searches using the keywords provided by the txt file with listed cities, zip, category, etc.

We are looking for elegant methods to perform the best query search that will extract 50,000+ entries a [url removed, login to view] want all data to be stored into a mySQL DB located on a Linux server.

Project Technical Walk-Thru:

•Using php or a combo of perl/php/soap the script should have the ability to scrape/extract data from a web form.

[url removed, login to view] settings (BROWSER BASED HTML GUI)

I.“Session” management – Each query or extraction on a keyword, category, city, zip, etc. in which a NEW db is associated.

[url removed, login to view] Database – Ability to select what db is for a particular session (add/edit/delete/update/view)

[url removed, login to view] new domains for data extraction and add as a “Session”. Settings to include adding form variables and other related settings. The following 3 will be added

[url removed, login to view]://[url removed, login to view]

[url removed, login to view]://[url removed, login to view]

[url removed, login to view]://[url removed, login to view]

[url removed, login to view]://[url removed, login to view]

[url removed, login to view] a “Session” is stopped or running it will have a green/red light indicating status (mini green or red circle).

[url removed, login to view] settings that will enhance the usability of the software.

[url removed, login to view] – DB – The completed DB’s in addition to being available for download will be FTP’s to a different IP.

[url removed, login to view] Back Up- After a “Session” has completed that completed db will be available to download as a ZIP file. (Not e ZIP rpm has been installed but let us know if other stuff needs to be installed on the server).

•Front-Ed Yellow Pages Directory such as:

[url removed, login to view]://[url removed, login to view]

[url removed, login to view] a simple Open Source PHP like the link above (this is Joomla based).

You can also use a script of your choice provided it is Open-Source.

[url removed, login to view] software will need to have an automated way to import the completed database

d.“Sessions”

Project Details:

•Project is on-going so you should be interested in long-term arrangement.

•Must be available to chat using MSN, Yahoo or Google daily.

Server Specs :

•Centos + Cpanel

•3 gb Ram with 400+ GIG storage.

This project is fairly simple. The winner will be chosen based on how many quality records per minute/hour can be extracted and based on communication skills.

Kỹ năng: Bất kì công việc gì, MySQL, PHP

Xem thêm: your gig, yahoo superpages, web gig, web data extractor online, us communication companies, task project management web based, switchboard com, red light management, perl chat script, open source cpanel, online gig, how to delete a web page from joomla, green domains, gig directory, ftp yahoo.com, ftp server zip, data front, cpanel open source, cpanel mysql database, zip form online, how to delete searches, google web scraper, delete searches, daily soap, yellow page script

Về Bên Thuê:
( 1 nhận xét )

Mã Dự Án: #1876027