You will find an archive of Usenet posting here:
[login to view URL]
There are about 2GB of messages.
The goal of this project is to take these messages and convert them into puffball format. The full description of puffball format can be found here:
[login to view URL]
The content of each message will go in the "content" field, and the username will be formed from the user's email address by replacing the at sign with a dot. Each message will be "signed" with a key generated just for that user. We will provide you with the functions (in Javascript) to sign the content. The most complicated field is "parents", which needs to reference all of the messages that the user is replying to (it is possible that a user has replied to more than one message). However, the way that the archives are structured should make it easy to locate the messages being replied to, given the header information. You should confirm this!
You can write the function that parses the archives and creates the puffs in Python, PHP, Perl, JavaScript (with node), or as a linux shell script. We will want the code you create, as well as the puffs it creates.
Hi there
I can write a PHP script which can take your Usenet archives and convert them to puffball format. I can deliver the output and the script to you. Looking forward to work with you.
Thanks
Rinsad
Hi,
I known i'm not le lowest bid, but i think i can make you a very decent offer.
I watch closely the archives and i see the format evolves around time which means more complex parser that should adapt.
My solution would be perl tech.
Concerning messages the respond to more than one other, i'm quite sure it's not easy.
We can discuss on some sample of how to select parents.
Sincerely yours.
Eric
Hello,
My name is Elias Hamaz and I am a Perl programmer. I can write a script that converts the Usenet Archive to Puffball format.
I am interested in knowing a bit about the project. What are the aims and objectives of your project, and how will the script be used?
Regards,
Elias Hamaz