I need a web scraper written for the .xlsx file in the following directory:
[login to view URL]
The latest .xlsx file within that directory will need to be downloaded.
The name of the file is subject to change and will need to be identified by the latest .xlsx extension.
If a row is blank, skip that row.
If the ship_date column has a past date the data does not need scraped, only scrape the data with the current or future dates in the ship_date column.
The output should be a pipe (|) delimited file with the following column mappings:
origin_city --> data located in column "C", if the column contains a comma and data after the comma only add data BEFORE the comma
origin_state --> data located in column "D"
ship_date --> the date from column "A" changed to the YYYY-MM-DD format, if the date is a past date do not scrape that data
destination_city --> data located in column "F", if the column contains a comma and data after the comma only add data BEFORE the comma
destination_state --> data located in column "G"
receive_date --> leave blank
trailer_type --> the abbreviation located in column "B"
load_size --> data located in column "I"
weight --> data located in column "K"
length --> data located in column "J"
width --> leave blank
height --> leave blank
trip_miles --> leave blank
pay_rate --> data located in column "L"
contact_phone --> data located in column "O"
contact_name --> leave blank
tarp_required --> leave blank
comment --> data located in column "P" and column "Q" add the text "Load#" before data in column "Q"
load_number --> leave blank
commodity --> data located in column "M"
The first line of the output should contain all of the column headers.
Any field that contain no data should be left blank.
Please do not use words like "null" or "blank" in blank columns.
Below is a sample output of the first 5 columns using sample data:
The deliverable will be a Perl .pl file that must run on
Ubuntu Linux and must use Modern::Perl. The Perl .pl file
should be called '[login to view URL]' and the output file should be
called '[login to view URL]'
It will be scheduled in cron to run unattended every 15 minutes.
Please specific what language/OS/modules you plan to use.
Also, please include the word "raccoon" in your bid so I know that
you read this description.
Được trao cho:
12 freelancer đang chào giá trung bình $135 cho công việc này
Hi "raccoon", I plan to use Perl LWP/Mechanize for this project. You can run the script thru crontab then. I have developed many web scraping scripts before. You can check my job history for relevant experience