Plese note: Shortly after I first posted this the website I mention went down. It's now back up. If for any reason you can't get to [url removed, login to view] I've attached a copy of the site. This project comprises three parts:
1. Help finish up the extractXML standard as outlined at extractXML.org. Please check out that site before making your bid.
There is already a sample [url removed, login to view] file. Left to do is create a tightly constructed XML schema to work with this file and provide a detailed explanation of the different elements of the schema and how they work. I will want to add a few elements to the schema that are relevant to our site.
2. Create PHP functions to read and parse this file. Create a class and a set of functions/methods in PHP which will do the following: Given a specific URL, search in the closest directory for an [url removed, login to view] file, if not found, keep moving up the file hierarchy looking for an [url removed, login to view] file until getting to the root domain folder. The functions which do the requests for this file need to show up in people's logfiles as "Fusker! extraction bot, extractXML compliant" or similar. If an [url removed, login to view] file is found, it will have to be validated and checked against the extractXML standard to make sure it is well formed. If so, then it will have to be parsed. The functions should return all of the information about the content of a website's [url removed, login to view] file to our other scripts in an easy to manage way, for example, after declaring $website = new extractXML; , it should be possible to ask if $website->allowsExtraction() or if $website->requresLinkback().
3. Create online forms for webmasters to generate extractXML files
Create two online forms, one simple and one advanced, where webmasters can fill out their information and have a well-formed [url removed, login to view] file generated for them, which they can save and use on their website. These files should be saved to a MySQL database so they can be retrieved by their creators.
1. You are free, even encouraged, to use open source/GPL code to help with this project, so long as all sources are clearly documented.
2. Extensive experince working with XML is a requirement for this project, as are good PHP skills.
3. I think this standard should be presented to the World Wide Web Consortium. A bonus will be given for successfully presenting this standard to the WC3.
1) Complete and fully-functional working scripts and XML files as complete source code of all work done. All code thoroughly documented according to jdoc guidelines
2) All code functioning on our server working with PHP and MySql.
3) Exclusive and complete copyrights to work created for part three of this project.
Must work on our UNIX/FreeBSD server using PHP, MySQL, and whatever XML tools/libraries are needed.