Hi,
We'd like to find someone who can either give us a proven Psaudo-code, or better off - send us a working code in C# that does the following:
1. The code gets a document (preferably HTML, but not a must).
2. The code finds the best matching document, from a list of pre-defined documents (possibly that the calculation had already ran through).
3. It returns a list of N documents ordered by best match first.
For example -
If I read this article on Engadget - [login to view URL]
Which is about the Toshiba Thrive android tablet, then the "best match" documents would be the ones that are also talking about the same topic, and after them in order documents that talk about generic/other android tablets, then after that possibly documents about either tablets in general (iPad?) or Android in general and so on and so forth.
Possible places to look at are opencalais, td-idf, etc...
There are many algorithms out there, so we need something that will do the job well.
The test of the algorithm should be by taking ANY document, and finding the best matched from a pre defined list of other documents.
Once this is done, and we have an algorithm we can try, we will need to open another bid, to implement the algorithm on an existing C# based system (Like SharePoint) with many medical documents on it (but the algorithm must not use the fact that it's medical data, at least, not at first).
I have done similar tasks in past and I have an idea that I would like to try out. Could you send me some sample documents, so I could send results back to you?
$350 USD in 10 days
5.0 (1 review)
1.8
1.8
13 freelancers are bidding on average $412 USD for this job
Dear sir,
I am strong in programming especially in algorithm implementation. I am strong in information retrieval and NLP. I am familiar with Boolean model, Vector model and text similarity measurement such as Jaccard coefficient, Cosine and other ones. I have studied many papers and I have implemented many relevant algorithms. I can do the project with high quality.
Wait for your response
Thank you
BR
I am eligible for this project because I was working on Text Classification from Last one Years.
I know all of the aspects of these kinds of problem and I have worked lot on TF-IDF, SVM and Document Clustering.
Hello,
10+ years exp full time freelancer here, can deliver a quality & professional work in the timeframe posted. Please contact me via PM for any question, Thanks.