Đã Đóng

Looking for someone who is good in Data Science and vector representation

GloVe that relies on a different algorithmic principle but also yields vector representations of words.

Within the text file inside the archive, each line contains a word followed by a space and then a series of floating point numbers (also space-separated). The floating point numbers for a word (300 in total) constitute the word vector representation in a 300-dimensional word vector space.

Please write code to achieve the following tasks and report the results. Do

not use any libraries for the nearest neighbor computation, but instead write

your own code for this. You may use any programming language (it is easy

to store and manipulate a 300-dimensional array of floating point numbers

in almost any programming language).

--> Task 1

Determine the 5 nearest neighbours of your first name in terms of the cosine

similarity measure, along with the respective cosine similarity scores. For

each neighbour, list the word/name, not the vector.

Note that you may need to lower-case your name to find it (e.g. “nicole”

instead of “Nicole”). If (and only if) your first name is genuinely not covered

by the word vector data, then report this fact and use the first name of a

celebrity instead.

--> Task 2

Write code to create a vector representation for an entire sentence simply

by taking the average of all word vectors for words in that sentence. This

involves 1) tokenizing a sentence, i.e., splitting it into words, for which you

may use a very na¨ıve and imperfect method. Then 2) look up the word

vectors for those tokens. Make sure to apply lower-casing if necessary. You

may ignore tokens that are not covered by the vocabulary of the word vectors.

Finally, 3) take the average, i.e. compute the component-wise sum of the

word vectors, and then divide each component by the number of words in

the sentence that were covered by the data.

Next, choose a random sentence S0 and compute the vector representation

of that sentence using the above method. List the nearest neighbour words to

that sentence vector (i.e., determine which words in the data have a similar

vector representation to the vector for the sentence).

--> Task 3

Choose two other sentences S1 and S2 such that S1 is similar in meaning

to S0, and S2 is dissimilar in meaning to S0. Create the sentence vectors

using the method from Task 2, and report the cosine similarities between the

vectors for S0 and S1, and between the vectors for S0 and S2.

Explain whether the obtained cosine similarity scores are reasonable and

give a brief explanation of why or why not.

Kĩ năng: Thuật toán, Khoa học dữ liệu, Machine Learning, Python, Vector

Xem nhiều hơn: good book for data structures and algorithms in java, freelancing in data science, employer looking for data entry company in dubai, business looking for data entry in newmarket ontario, binary tree representation in data structure, jobs in data science, we are in usa looking for data entry services in indian company, looking for data entry project in uk, looking for data entry operator in dwarka new delhi, looking for data entry offices in el paso texas, looking for data entry jobs in abu dhabi, looking for a good set up apointmant worker in usa, data science in r, looking good data entry, good looking powerpoint data tables, looking offshore data encoding site, good data division set, looking good welcome pack design, good data entry speed, looking good writer

Về Bên Thuê:
( 10 nhận xét ) Piscataway, United States

ID dự án: #16766414

6 freelancer đang chào giá trung bình $172 cho công việc này


---Very Nice Job. Professional Data science & AI & Machine Learning expert. Best result in time----- [login to view URL] I read your description very carefully. I am very interesting for your project because I have rich ex Thêm

$250 USD trong 3 ngày
(72 Nhận xét)

Have worked with vector representations like word2vec, glove, fasttext etc. before for NLP project. Thanks for the complete description, would be able to do this task within 3 days. Looking forward to hearing back from Thêm

$180 USD trong 3 ngày
(48 Nhận xét)

Hello there, my name is Daniel and I would love to help you out with this project. I am very familiar with machine learning algorithms and NLP, so I have worked in the past with word vectors plenty of times before. I h Thêm

$210 USD trong 5 ngày
(26 Nhận xét)

Greetings of the day! I am the best fit to your requested requirement. I can help you in Data Science and vector representation. My Expertise & Experience: I am skilled in Algorithm, Data Science, Machine Le Thêm

$250 USD trong 3 ngày
(33 Nhận xét)
$30 USD trong 0 ngày
(1 Nhận xét)

Hi, I am good at machine learning and data science and I am proficient in Python. I have a research degree in m/c learning from IIT Madras. Please consider and have a good day. Regards

$111 USD trong 3 ngày
(0 Nhận xét)