This program automatically and regularly scan the web (using the data of a crawler) to figure out if one's contact are referenced. For each contact, the list of presence on the web are stored, with their entering date; it is also possible to know all the pages where other contacts of the database are mentioned. Environment: a contact manager (Outlook), with its scripting language (VB), accessing a web crawler (Google), and storing results (links) into a database (Outlook Journal) Brief specs: - contact to be tracked are identified by a value in an outlook field (can define the frequency), and date of last scan - string to search for each contact is defined by default (for contact John Doe, it is by default "John Doe"); string is stored in an outlook field, and so modified by user if necessary - the program contains a part that regularly scan the web by calling a crawler for each contact eligible. Allow to chose frequency of calls to the crawler. The program provides easy way to optionally display what the scanner is currently doing, with some statistics; scanning can be stopped immediately. - the scanner calls for a crawler, typically Google; two approaches are possible: * Web service API: call Google web service API. Pro : simple if using the proper library. Cons: limited to Google, limit of 1000/day * Parser/scraping: call the web page of the crawler, parse the results, and follow the next page until finish reading results. Pro: flexible since not limited to Google; Con: may have to write a crawler result page parser in VB (easy in Perl since libraries exist) - the results of scanning are a list of URLs. Those are to be stored in a database. It is strongly suggested to use the Journal provided by Outlook, where each entry is the URL. - before storing an entry, check before if it does not already exist for that contact or if it does not exist for an other contact. Associate the URL entry to the contact.
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Installation package that will install the software (in ready-to-run condition) on the platform(s) specified in this bid request. 3) Complete ownership and distribution copyrights to all work purchased.
Program should be an Outlook 2000/XP add-in, on Windows 98 or XP. Database would use Outlook model: contact in a contact file, URL stored in a Journal file Language: VBA for Outlook; if using Perl for parsing, provide Perl library.