create the TF-IDF matrix, creating training data set
$30-250 USD
Fechado
Publicado há mais de 8 anos
$30-250 USD
Pago na entrega
1)create data set that has 10 emails, 5 about sad news and 5 about congratulatory notes.
2) for each email create a document vector containing not more than 12 important terms.
3) compute TF-IDF value for each term
4) Write a program that takes 12 terms as test input and then computes cosine similarity between the test input and each of the 10 document (email) vectors. If the document with the highest similarity is a “sad” email, the program outputs “sad”, otherwise the program outputs ”congratulatory”
Hi,
I read your description very carefully. I am familiar with Information Retrieval. You already clearly explained what you want to do. The thing that you are trying to do falls under vector space model in Information Retrieval. I can assure you that I can provide you the solution within the specified budget and time. I will use python or R (if you have preference) for this problem. I am looking forward to working with you.
Cheers,
I hope above task can be done by me. I hope that it is possible with MATLAB also . So i will do above project and I am expert in MATLAB. Thanks for posting .