OBJECTIVE - copy old data from web site to spreadsheets. The data describes up to 5,000 training organisations and their academic courses. Some have few Courses, some have hundreds of courses. Some operate from one campus, some from many campuses.
a./ to sort and clean old data and insert new data
b./ to build a series of new databases supporting different applications
The data is contained at [login to view URL]
To see the data organisation and estimate the amount of data to be scraped:
Select SEARCH CRITERIA tab
Select INSTITUTION SEARCH
Select STATE - use any option
Ignore other drop-down fields
Select START SEARCH
The Result in this example is 21 pages, and each ROW also has a URL to drill down to more information.
For example, Select any Institution Name to see the Institution Details tab, Contact tab and the Course Offered tab.
Then, Select the Course Offered tab to see a list of academic courses. Select any Course on the list to see its details, and other Tabs.
The unique organising key is these tables is anything that says CRICOS CODE - the CRICOS PROVIDER CODE and the CRICOS COURSE CODE. All the data is referenced to these codes.
LEGAL: The data itself is not copyright. It is freely available from each of the Training Organisations.
TIMING: no rush for this project, completion within 30-days from starting.
1./ the text information
2./ the HYPERLINKS associated with each data ROW
3./ the Google MAP HYPERLINKS associated with each location address.
4./ the Web Addresses
5./ the Email addresses
Each CRICOS CODE enables the person viewing to see ALL the information for its Training Organisations and also show ALL of its associated Courses.
Over to you to think about the best way to present this data in spreadsheets. For example, thousands of rows with dozens of columns in one huge spreadsheet or the use of smaller spreadsheets for each STATE or some way you know to be convenient and well ordered and easy to handle.
- Google Sheets
I would like to see a sample scrape to prove that the result can be achieved.
Please write back with questions you may have.
63 freelancer chào giá trung bình$151 cho công việc này
Hi, I am Python script developer with 10 years of experience. I can scrape required website by python script/bot with your instructions very short time. Can we discuss please? Thanks.