Text Corpus software

$300-1000 USD

Đã đóng

Đã đăng vào

hơn 18 năm trước

$300-1000 USD

Thanh toán khi bàn giao

To build a database of lexical words from a corpus of texts divided into text types and to rate each word as to the probability of its occurrence within that text type. This involves being able to store text files, take each lexical word and count it within the overall corpus, count it within the text type (novel, newspaper article, etc) and then say which text type it is most likely to occur in, or - alternatively - give the probabilities of its occurrence in each text type. The text corpus will initially be about 10 million words, but will grow to about 100 million, so only programs which are very fast will be useful. Results need to be represented numerically and if possible graphically. Programmers need to be highly numerate, preferably with a reasonable knowledge of statistics, intermediate level. Phase 2: the user inputs a text and the text is 'typed' according to the probability of each word in the text occurring in a particular text type. Phase 3: other clasifications will be important eventually, e.g. age, gender, social background of author of text. We will eventually need to profile each corpus author and thereby rate the user's text's author's profile on the basis of the corpus. It would be ideal if the program can be web-based AND on a user's p.c. You will need an ftp address where I can upload the corpus of texts to.

Mã dự án: 22451

Về dự án

5 đề xuất

Dự án từ xa

Hoạt động 17 năm trước

Bạn muốn kiếm tiền?

Địa chỉ email

Lợi ích khi chào giá trên Freelancer

Thiết lập ngân sách và thời gian

Nhận thanh toán cho công việc

Phác thảo đề xuất của bạn

Miễn phí đăng ký và cháo giá cho công việc

5 freelancer chào giá trung bình $560 USD cho công việc này

@nidle

Dear Sirs, We got extremely interested in the project proposed by you. We are an IT company specializing in web technologies and programming. Our specialists are ready to start working on the required software straight away. Our managers will keep you informed of the project progress until the work is completed. We are available via any means of communication. At the moment our analysts have a number of questions to be clarified to outline a plan and make a detailed proposal. Please, contact us via PMB for further discussion after what we'll provide you with an accurate calculation. Looking forward to your reply, Nidle Inc.

$1.000 USD trong 45 ngày

4,6

(7 nhận xét)

1,6

@gawab

I will be glad to work with you.

$300 USD trong 10 ngày

0,5

(1 nhận xét)

0,0

@donghuayi

I have a Master degree in computational linguistics from University of Southern california. I have done several research projects just like yours, not only were we able to place words with syntactical features, but we can do semantic, 2 n relations, 3 n relations probability. Dr. Kevin Knight who is the most famous statistical linguistic professor was my advisor. Your project fits right into my bucket, please let me know when you are ready to proceed further.

$500 USD trong 10 ngày

0,0

(0 nhận xét)

0,0

Đăng dự án tương tự

Về khách hàng

Shrewsbury, United Kingdom

0,0

Thành viên từ thg 7 24, 2005

Xác thực khách hàng

Công việc trước đây

Công việc tiếp theo

Đăng dự án tương tự