This project is for the construction of a scraper to save football results and goal times from a specific website to a MySQL database. The target website is well structured, with all of one day's results per page.
The MySQL database consists of just two tables:
1. Matches: id, date, country, competition, home_team, away_team, full_time_score, half_time_score
2. Goals: match_id, team, minute
The website displays the score at the end of a match, so half time scores must be created from the goal times. Similarly for matches with extra-time and a penalty shoot-out it is the score after normal time that must be saved.
The intented usage of the scraper is to do one large run to harvest information from 2004 onwards, and then to do periodic runs (e.g. weekly) to keep the database up to date.
The scraper is to be written in well-documented Python, and a quick turnaround is expected.