We are looking to develop an application which will visit websites based on a given search terms based in an input file and extract emails from these sites after crawling the site and finding a mailto:, or email address displayed and place this within a csv file, along with the website address as (url, email).
This application shall be ran directly from my PC as an applet and we will input the file for it to do the search from and the search engines to visit (Google, Yahoo, MSN, Ask, Lycos). We should have the ability to add additional search engines based on name and primary search URL if need be.
Several website do not have a emails phyiscally available on the website and when this is the case it shall crawl [url removed, login to view] or similiar site and try extracting the address from here. Failing getting an address is shall input these results to a third csv file known as xxx-nonemail.csv.
We should set the number of websites to search for each search term from 1-1000.
we need the ability to name the CSV files.
Search list (see sample)
Results with email (see sample)
Results without email
Additionally if possible, we need a simplistic emailing function to automatically email these addresses when found but if this increases the cost too much then we prefer not th have this function.