I am looking to create a database of venues and performers from 2 different classifieds sites. I imagine this will be a simple web scraping exercise and given there are only 2 different sites, would not be a huge task.
The data points I would like to get from each of the sites are included in the excel attachment provided.
The two different sites are labelled "YP" and "YB" in the attachment. I have provided an example of the data we would like to get for each of the sites. The individual sheets in the excel attachment provide the URL to be used to mine the data – this is located in cell D5 of each sheet.
Note for the "YB" site, there are a number of different states to select. The full instructions are contained in the spreadsheet.
Please provide a sample for each of the two different sites as part of your initial bid.