Đang Thực Hiện

118201 UTF-8 screen scraper & xml pac

REQUIRED TECHNOLOGIES/EXPERIENCE:

*XML

*XHTML

*Screen scraping

*XML data design

*XML schema

*XML & XHTML parsing and serializing

*UTF-8 and ISO-8859 character encoding and serialization

Screen scraping and data repackaging guru needed to extract html from multi-paged and formatted forum from portuguese html text into UTF-8 xml files containing XHTML data.

REQUIREMENTS:

=============

1) Experience with screen scraping, and both html and xml parsing tools. Alternative 1: Person has significant experience in use of robot software that can crawl, parse, and repackage data into xml from a site's html. Alternative 2: Person has significant experience in use of standard open source tools that can accomplish this task. I AM NOT INTERESTED IN DEVELOPERS CREATING THEIR OWN TOOL FROM SCRATCH

2) Person has significant experience in UTF-8 and Latin character encoding and entity escaping in XML and text

PREFERENCIA:

=============

Brasilieros/as, o pelo menos fala Portuguese do Portugal. O conteudo todo fica em Portuguese.

Kỹ năng: Bất kì công việc gì, HTML, XML

Xem thêm: xml guru, text guru, text em, robot tool, open source robot, open source guru, guru developers, text parsing, robot character, parsing tool, pac, open xml, open source developers, forum portuguese, multi text xml, character scratch, parsing html, portuguese forum, iso extract, xml files, character design needed, 8859, design xml schema, html screen design, xml text

Về Bên Thuê:
( 0 nhận xét ) NYC, United States

Mã Dự Án: #1864369