My distributor for my products has a website which has a tabular html page with product number, price. Each product number also has a link to another page that takes you to another tabular page which has size, color and the product availability. I need someone to write a html parser to scrape the data and produce a RSS feed. Then you will write a script which reads in the RSS feed and updates the database products with the correct price and whether it is in stock.
I already have an existing script that mines the data and modifies my cart. This script is working, only problem it is poorly coded that it cannot parse a new catalogue my distributor has. So the new parser must be well coded so it is easy to parse new catalogues in the future. The catalogues are all in very similar tabular format.
You can make use of the existing code as a reference.
So there will be two scripts you will be creating:
1) HTML to RSS feed, crawl through the online catalogue parsing product price, product number, and size and color availabilities. Written so that it can be adapted to new catalogues.
2) A script that reads in the RSS feed and modifies the database with data parsed. Also generates a simple report at the end
i) number of products available in the online catalogue but not available in the database
ii) list of products not available in the database
iii) number of products out of stock
iv) list of products out of stock
My current script does all these, so you can use it as a reference. The cart I am using is CubeCart 4.
Programmer must be able to write well documented and modular code and well tested. Would be beneficial if they have some experience with Cubecart 4.
Please consider carefully bid amount and duration.