This project is for the completion and improvement to the following script that extracts patent data.
The script was made available by a coder who is no longer available due to illness.
The script is for a 24/7 data extract of the US patent office. The script will run from my host, Extracted data will be inserted into Mysql? DB
The chosen coder must complete this script and insure it mets the following requirements.
1. Option 1 - Script must use random proxies to hide IP address (Coder must provide clear instructions on how to do this and where to get proxies.)
? ? Option 2 - Coder must provide alternative method to prevent blocking or banning
2. Script must query and extract patents in 10 year ranges
Example (<[url removed, login to view]>)
ISD/1/1/1976->1/1/1986 (extract all 758161 patents)
ISD/1/1/1987->12/31/1997 (extract all 1173799 patents)
? ? ?
3. Script must extract and load the following fields into my database.
? ? patent number
? ? patent title
? ? patent inventors
? ? patent issue date
? ? patent assignees
? ? Appl_ No (If available)
? ? Filed (If available)
? ? patent abstract
Since the basic functions of the script has been written this should be a easy project for an experienced coder.
To insure all requirements are understood - Please start all bids or comments with '4U2READ'
Failure to do this will mean auto disqualification from consideration.