Đã hoàn thành

151019 Image scraper

Image scraper

Please see the attached file for example URLs.

Given a list of urls the following information needs to be extracted from the page to our server:

Thumbnail image of Page 1 and if it doesn't exist, then the Abstract image

Rename the image according to the following format from data scraped on the same page as the image: Filing Date, Inventor, patent title.

When an existing file already exists with the same name, the script will automatically append an increasing numerical value to the end of the title ( _1, _2 )

Punctuation like periods and commas to be removed from the file name.

The newer pages have a slightly different layout but the script needs to work with both layouts since the urls can not be sorted according to dates. Please see the examples attached.

Preference to coders that can provide a working demo.

Kĩ năng: Bất kì công việc gì, ASP, Perl, PHP, Python, Thiết lập Bản thảo

Xem nhiều hơn: scraper, numerical abstract, inventor, automatically image, date scraper, page abstract, name image, page scraper script, image rename, image list, please inventor, perl image, php script append data, image scraper php, perl file format, data image scraper, php script image thumbnail, image layout, php file image, script php scraper, patent filing, php thumbnail image, image scraper, page title scraper, end image

Về Bên Thuê:
( 57 nhận xét )

ID dự án: #1897198

Được trao cho:


Updated, please see my pmb

$75 USD trong 3 ngày
(0 Đánh Giá)