Build a system to grab specific information, such as location information, phone number, and name from a list of input HTML files. The layout of the HTML files will differ, and so you will need to be able to intelligently figure out what an address is based on some key words, such as city names, state names, and country names. This program will look into the HTML of different websites. The layout of the pages that have the information we need should be fairly consistent within the same website. However, the location and layout of the information will change drastically from site to site.
You should look for the name information based on a list of first names and last names to build an understanding of where the names are located. You should also look for identifiers before the name, such as "Name: ". You should use this same logic for finding the phone number(s) on the page.