Đang Thực Hiện

Cache Scraper Script

I'm looking for an web application / script that will scrape a website's cache through google. It must be able to scape and edit unlimited pages.

I'd like to input a domain and have the script go to google and scrape all of the html from google's cached links and save it in a .txt or html format, plus add the content to separate folders just like the website.

For example: If the site had content in a folder named /blog ([url removed, login to view]), the software will save the pages in a html or text format in the respective folder. The URLs should be saved exactly the same as seen in the cache.

All of this data should be also ideally be saved into a database as well.

It also needs to be able to extract any html or code from each page and save it as the original file name and in it's respective folder.

This is because some websites have images, java script, and other code that will not work after the hosting is transferred. For example, google adds their own html at the top of these cached pages and most will have their own contact forms, etc, that must be removed.

There also needs to be proxy support so scraper doesn't get banned.

I would want the application / script to be written in php / mysql and related languages as needed.

Upon completion, I would like you provide a screen cast demonstration video of the software in action and provide full rights to the software.

If changes and updates are needed to be made in the future I want to be assured that this person or team is willing and interested in working at a reasonable and fair rate.

I want to be able to create a long term relationship as I have many other projects for the right team or person.

Please provide any examples of similar projects.

To be CONSIDERED. Please reply with the words "I can help you scrape" in your reply. Those who do not will not be considered.

If you are this person or team please apply.

Please ask any questions you may have.

Kĩ năng: jQuery / Prototype, MySQL, PHP, Kiến trúc phần mềm

Xem nhiều hơn: you proxy google, script php proxy web, script for google forms, i want create blog in go, can a domain name be transferred, architecture updates, script words, google web scraper, screen scraper, scraper, scraper software, scrape for links, html java script needed, file hosting script, edit 343, Cast, cache, cache c, mysql scrape website, screen scraper php, php proxy video, extract html code script, scraper mysql php, blog script html, data scrape input screen

Về Bên Thuê:
( 0 nhận xét ) Fort Lauderdale, United States

ID dự án: #1620915

2 freelancer đang chào giá trung bình $175 cho công việc này


please check PM.

$200 USD trong 15 ngày
(49 Nhận xét)

Hi, "I can help you scrape". Pls check the PMB.

$150 USD trong 4 ngày
(0 Nhận xét)