I need a web scraper written for the following url:
[url removed, login to view]
All of the information needed is available on the main page. The number of rows will vary.
The output should be a pipe (|) delimited file with the following column mappings:
origin_city --> data located in the "Origin City" column
origin_state --> data located in the "ST" column located between “Origin City” and “Destination City”
ship_date --> the date from the "Pickup" column changed to the YYYY-MM-DD format
destination_city --> data located in the "Destination City” column
destination_state --> data located in the "ST” column located between “Destination City” and “Payment”
receive_date --> leave blank
trailer_type --> data in the "TRLR" column
load_size --> data in “F/L” column, if "F" then change put "FULL" if "L" change to "PARTIAL"
weight --> data in the "WEIGHT" column, if 0 or blank leave blank
length --> data in “Length” column, if 0 or blank leave blank
width --> leave blank
height --> leave blank
trip_miles --> data in the "MILES" column
pay_rate --> data in the “Payment” column, if blank then leave it blank
contact_phone --> data located in the "PHONE" column
contact_name --> leave blank
tarp_required --> leave blank
comment --> leave blank
load_number --> leave blank
commodity --> leave blank
The first line of the output should contain all of the column headers.
Any field that contain no data should be left blank.
Please do not use words like "null" or "blank" in blank columns.
Below is a sample output of the first 5 columns using sample data:
The deliverable will be a Perl .pl file that must run on
Ubuntu Linux and must use Modern::Perl. The Perl .pl file
should be called '[url removed, login to view]' and the output file should be
called '[url removed, login to view]'
It will be scheduled in cron to run unattended every 15 minutes.
We suggest WWW::Mechanize but you are free to use other Perl libraries.
Please specific what language/OS/tools you will be using in your bid.
Also, please include the word "raccoon" in your bid so I know that
you read this description.