Scraping & Email Validation App
Part 1: Spidering Bot
A: Bot will go to
[url removed, login to view]
B: First example.
Bot will go to Basic Materials
Bot will go to Agriculture Chemicals
Bot will click on Agrium Inc.
Bot will scrape following information
Full time Employees
First Name Last Name Middle Name and Title all in seperate columns
This will be done for all the categories and all the companies in the database. I will do the spidering you just have to build the bot with a gui interface
Part 2: Email Validation:
a)Web Based - 1 html page where user can enter 3 fields: 1_First Name 2: Last Name 3: Domain and click submit and app will find right email from google/yahoo and/or give the right format of the email based on similiar email found in search.
b)Import .csv or .mdb list with 3 columns. Same as above and will run all queries for possible emails and store email in same sheet.
Email Validation Process
Example John Doe at Microsoft
First Query: john.doeatmicrosoft.com....
Second Query: [url removed, login to view]
Third Quesry: [url removed, login to view]
Fourth Query: [url removed, login to view]
Fifth Query: [url removed, login to view]
Sixth Query: [url removed, login to view]
Seventh Query: [url removed, login to view]
Eigth Query: [url removed, login to view]
Iand other variations so on and so on
If no results then a general query needs to be made "atmicrosoft.com" and excluding results with info, support, inverstor, relations, prodcuts and other terms need to be excluded. Must be able to differentiate to find a real persons email. Once this format is found for instance Mark Pech at micsofot email is mpechatmicrosoft then this should be displayed with a message saying alternate email formats have been found mpechatmicrosoft, bgateatmicrosoft and the formate should be applied to the user we are looking for giving a result saying based on alternate emails at this domain the email might be [url removed, login to view]
I hope this helps in further explaining and will let you come up with your plan