I have some more scrape work for you if your up for it.
Job 1: Take the BOTW scrape and visit each of their sites searching for the twitter, facebook so on.
Basically I have visited a lot of the sites in the BOTW scrape that you did and I believe we can pad out the data we have by scraping the site themselves. Please could you look at the attached and scrape the 32K sites. No extra column just fill in anything missing.
Job 2: Scrape the technorait feeds to get website descriptions.
In the technorati scrape that you did we do not have any descriptions. could you visit each of the site and scrape the meta description and add it to the data. New field would be description.
Job 3: Scrapes Klout, using the twitter account for description plus websites.
Now on the scrape version that you did we need descriptions plus also the website if we can find them. So what I would like is for you to scrape each of the twitter descriptions. Save the description and if you can find any clear link to what look like a site belong to them then add that to a new column to, so the new columns would be description + website.
Please give me a quote and time frame.