I need a php scraper script/software designed to scrape content from a particular website.
The script will get a list of urls as input for the scraping.
Those sites will all have the same layout.
I must be able to define several patterns for this layout to extract the content (for example the website title, a certain image, a link, ...).
The content should be immediately written to a static file using a template that I define, replacing the placeholders with the scraped content. Images that match the pattern must be downloaded to an images folder.
The solution can either be online as a php script or offline as a standalone windows program.
The pages should be all stored in one folder, using the title of the page as filename ("[login to view URL]"). You must make sure that files are not overwritten (for example by nameing a file "[login to view URL]" if it already exists).