I need to have a site scraped of ~4000 products. The dropshipper uses Magento, and appears to have very clean, standardized code.
I have attached two files:
1. Excel document with 3 tabs
A. Product URLs - all product pages that need to be scraped
B. Product Page Data - attribute names of all data that needs to be scraped.
C. Non-product URLs - For category pages, I just need a list of all products that appear under each category. This will allow me to map their products/categories to the slightly different category names I use on my site.
File 2: TXT file of page schema:
I copied the page source for a random page and labeled all the page elements that need to be scraped. This file matches raw page code with the field names in tab 2 of the Excel file.
I only need the details scraped and cleaned of unnecessary HTML code. I will handle loading everything into my OpenCart site.