Đã Hủy

Wikipedia Parser

Schedule of conditions for the technical implementation of the WikiMedia Parser in C#

Introduction

This is a small program parser to make a static dump (HTML output) of Wikipedia which is based on WikiMedia ([url removed, login to view])

The program is designed to extract WikiMedia tags (including the template [url removed, login to view]:Template_messages/All) from text to transform onto html output. The Html must comply with the W3C's HTML specifications.

The parser must be written in C# language. The main class should have an easy to use method for getting the text parsed.

I would like to use the API something like this:

String OrignalText = “wiki text”;

WikimediaParser parser = new WikimediaParser();

String textParsed = [url removed, login to view]( OrignalText);

You can take as a starting point this site:

[url removed, login to view]

Job types

• .NET C#

• Regular expression

• HTML/CSS

Resume

Wikitext language or wiki markup is a markup language that offers a simplified alternative to HTML and is used to write pages in wiki websites.

Wikitext is text in this language.

There is no commonly accepted standard wikitext language. The grammar, structure, features, keywords and so on are dependent on the particular wiki software used on the particular website. For example, all wikitext markup languages have a simple way of hyperlinking to other pages within the site, but there are several different syntax conventions for these links.

Some wiki programs allow extensive optional use of HTML tags within wikitext, others a smaller subset, and still others no HTML at all. Other wiki programs allow the restrictions on HTML to be set by the particular site.

MediaWiki's wikitext allows you to freely mix wiki format and HTML, but it provides a simple, readable syntax that allows users to not even know HTML

Project

Wiki markup

I would like to translate all wiki markup that is on this page:

[url removed, login to view]:How_to_edit_a_page

Wiki template

Wiki markup templates on this page:

[url removed, login to view]:Template_messages/All

I don’t need “User talk namespace”.

Log Message:

I want to use Logger4Net to log each error and accurate debug message when debug message is enabled.

Flexible code

I want flexible code to add future Wiki Markup or Wiki Template. The code must be commented very clearly.

Platform

The API must be run on Windows and with the .NET Framework 1.1 or more. The API must be written with C# language.

Budget

We pay only at the end of the project. Any method payment is accepted ( Paypal, wire, etc…)

Data I/O

• Input

It must be string, text file or xml file.

• Output

The output must be complying with HTML specifications.

Methods

I need 2 methods, you can implement this Interface.

Public interface IWikiParser

{

String Parse( string wikitext);

String Parse( string wikitext, int length);

}

For the second method, be carefully don’t split between two html tags.

Test

You can test all Wikipedia articles with this database dump:

[url removed, login to view]

I give you also smaller files for testing the parser.

Release

The API must be on production release the mid January. But I would like to see every x days a working parser to check the quality of the dump.

Kỹ năng: .NET, XML

Xem thêm: wikipedia parsing, logger4net, wikitext html, wiki xml net, wiki markup html, write xml code website, wiki websites, wikipedia websites

Về Bên Thuê:
( 0 nhận xét ) Boulogne-billancourt, France

Mã Dự Án: #37619

8 freelancer đang chào giá trung bình $1006 cho công việc này

webexpertz

Dear Sir, I am interested in your project. I request you to check your PM. Thanks. Regards, Webexpertz

$1200 USD trong 30 ngày
(8 Đánh Giá)
7.0
websoftinfo

Our bid is for really very high quality work for your Wikipedia Parserthat will be made to be upgradable in case you need some upgrades in future. We will always be available for upgrades. Our bid includes six week fre Thêm

$1500 USD trong 30 ngày
(7 Đánh Giá)
5.9
bruzli2005

I'm very experienced in parsing data. I can deliver this for you on time. Serious Bid.

$800 USD trong 14 ngày
(9 Đánh Giá)
3.2
provatitechno

Dear Sir! We are an efficient and dedicated team of professionals. We offer our large experience and professionalism to make all qualitatively. We provide post-developing support until all Your requirements are complet Thêm

$1000 USD trong 35 ngày
(0 Đánh Giá)
0.0
lambdagroup

Please see PMB. Thanks.

$300 USD trong 15 ngày
(0 Đánh Giá)
4.4
paker

i am a professional in web design graphics design logo design,java ,java script ,php,translation so give me the job and consider it done

$1500 USD trong 10 ngày
(0 Đánh Giá)
0.0
niaterra

Hi, We could easily do this job for you. Please visit our site at niaterradesign.com to get a detailed quote for your needs. It's worth looking into.

$1000 USD trong 10 ngày
(0 Đánh Giá)
0.0
akka

Very interesting project.

$750 USD trong 30 ngày
(0 Đánh Giá)
0.0