Đang Thực Hiện

159507 Email Extractor

We are looking to develop an application which will visit websites based on a given search terms based in an input file and extract emails from these sites after crawling the site and finding a mailto:, or email address displayed and place this within a CSV file, along with the website address as (URL, email).

This application shall be ran locally from my PC as an applet and ideally be a self installing .exe file and we will input the file (see below) for it to do the search from and the search engines to visit (Google, Yahoo, MSN, Ask, Lycos). We should have the ability to add additional search engines based on name and primary search URL if need be.

The email extractor will need to visit ALL pages of a given website to locate an email address by doing a search on an @ symbol, mailto: within the source.

Several website do not list their contact email and when this is the case it shall crawl [url removed, login to view] or similar site and try extracting the address from here. Failing getting an address is shall input these results to a third CSV file.

All the files need to be downloadable in .CSV or text format, and be named by us.

We should set the number of websites to search for each search term from 1-many (could be over a 1,000,000 but needs to be set our end).

The input file for example if we target the FOREX market could be:

forex trading

currency rates

forex rates


link to: [url removed, login to view]

link to: [url removed, login to view]

foreign exchange market


There are two output files that will look like

File when it finds an email:

[url removed, login to view], email address

[url removed, login to view], email address

[url removed, login to view], email address

In the event the email does not exist on the website it will do a look up using WHOIS database (such as [url removed, login to view]), and if found there then it adds the URL and email to the above file, otherwise a second file is created with just the URLs listed.

We are also looking for the ability to automatically email the leads once found, but since we want to keep the cost within budget this is an additional feature. The above email extractor is by far the primary aim.

Kĩ năng: Bất kì công việc gì, Lập trình C, MySQL, PHP, Thiết lập Bản thảo, Visual Basic

Xem nhiều hơn: xe com, self programming, programming terms, programming symbol, finding market, event programming, event based programming, email over http, email application in php, applet http, php symbol, xe, foreign exchange, finding leads, files email, Email Finding, email extractor, email extract, domain email, php file exchange, file email extract, whois csv output, email website yahoo, text extracting, email domain list

Về Bên Thuê:
( 16 nhận xét ) San Jose, Costa Rica

ID dự án: #1905696