Đang Thực Hiện

Extract Sentences from HTML using the R framework

Qualifications

- You need to be very fit with the R framework

- Have understanding of text mining

- The project has to be done with R (not PHP or another programming language)

Project´s goal

- Read static HTML files

- Extract Meta title of HTML

- Remove HTML, just keep plain text

- Search in plain text for given keywords/searchwords

- Extract the sentence where the keyword occurs

- Extract the sentence before

- Extract the sentence after

- Build text out of this 3 sentences

Exampel of output

- Title of HTML file

- Introtext

- Keyword #1 with 3 sentences (before, the one with the keyword and after)

- Keyword #2 with 3 sentences (before, the one with the keyword and after)

- Keyword #3 with 3 sentences (before, the one with the keyword and after)

Kỹ năng: Kiến trúc phần mềm

Xem thêm: extract sentences, html framework, r architecture, programming html, html programming software, html programming file, framework programming, software framework, r software, r s, programming r, html programming, extract, c r, static html framework, extract php, need sentences, php framework html, output using, static html project, plain html, extract text php files, text mining project, keywords html, html plain

Về Bên Thuê:
( 0 nhận xét ) Vienna, Austria

Mã Dự Án: #1717962

Đã trao cho:

danilonqueiroz

Hi, I have over 2 years of experience in R language and Machine Learning, development and implementation of academic projects on clustering analysis and data mining.

$275 USD trong 5 ngày
(0 Đánh Giá)
0.0