Em Andamento

Algorithm to match short text strings (equivalent product names from disparate sources)

We have an existing MS SQL database which is populated with products retrieved from various sources (portals).

The various portals usually have slightly different names for the products.

E.G

Portal 1:

PortalProduct 1: CocaCola

PortalProduct 2: Manchester United

PortalProduct 3: Maplin Electronics

Portal 2:

PortalProduct 1: Coca-Cola

PortalProduct 2: Man Utd

PortalProduct 3: Maplin Elec Ltd.

- We need to correlate these PortalProducts to a central list of Products.

- One Product can have 1-to-many PortalProducts

- The database is populated with around 10,000 PortalProducts

- The database is not populated with any Products. Products need to be added for each new PortalProduct that doesn't match an existing Product.

This must be implemented as a C# console application so that a) we can see the results when the correlate is run and b) so that we can ultimately run it on a task-schedule.

The developer must develop a text matching algorithm to correlate PortalProduct names with Product names (and create a product where there is no match). The emphasis is on the accuracy of the text matching algorithm. Speed is less important than [url removed, login to view] developer should have a mathematical mind and preferably has experience in developing mathematical text matching algorithms.

Examples of mathematical text matching algorithms that could be used (and potentially refined and combined) to meet our requirements are:

String metric

Locality-sensitive hashing

Needleman–Wunsch algorithm

Smith–Waterman algorithm

Levenshtein distance

Concept Search

Approximate matching with addition of regular expressions ability

Regular expressions for non-fuzzy (exact) matching

Metaphone

Soundex

Agrep

Plagiarism detection

[url removed, login to view]

Ideally the developer will already understand and have used some of these before and able to pick the best algorithm(s) for this application.

Habilidades: Algoritmo, Programação C#, Matemática

Ver mais: t sql programming, text to string, text string search, text search algorithms, text search algorithm, text matching algorithm, text algorithms, strings in c programming, string search algorithm c, string search algorithm, string matching in c, string matching algorithms, string matching algorithm, string match algorithm, string match, string algorithms, string algorithm, sharp-electronics, search text string, search string examples, search algorithms in c, search algorithm in c, search algorithm examples, regular expressions list, regular expressions in c

Acerca do Empregador:
( 2 comentários ) Altunizade, Turkey

ID do Projeto: #10964138

16 freelancers are bidding on average £182 for this job

malballah

Dear hiring manager, I am a full stack software developer with 12+ years experience with .net framework, 6+ years experience with asp.net mvc, 8+ years experience with sql server, and 12+ years total development experi Mais

£250 GBP in 15 dias
(36 Comentários)
6.8
abhijitbuet

easy task for me :)

£133 GBP em 1 dia
(117 Comentários)
6.1
solutionEngrn

Hi there, You have a Challenge, We Create the Solution Solution Engineering is a Team of quality and solution oriented engineering professionals across various engineering disciplines that have tackled challenges lik Mais

£244 GBP in 4 dias
(2 Comentários)
5.3
£252 GBP in 3 dias
(23 Comentários)
5.8
sflogics

Hi, We are small team of 4 developers with 5+ years of .Net experience and have happy old clients. One from UK from about last 4 years in Bedford. We never had excuses; even not asked to clients to pay extra money. Mais

£200 GBP in 3 dias
(15 Comentários)
4.9
SharjeelSohail

hi. im a software development engineer with excellent algorithm design skills. i can do this work for you

£100 GBP in 5 dias
(60 Comentários)
5.0
£150 GBP in 5 dias
(16 Comentários)
5.0
AbanoubLabib92

I'm C# developer. I worked in Microsoft in AppFactory program for developing Windows 8 and Windows Phone Apps. I got here many Desktop and WPF projects. You can check my profile and if you accepted, just let me know to Mais

£150 GBP in 15 dias
(11 Comentários)
4.7
hildebrandofc

Will provide documentation and solid, robust, easy-to-understand, fully-commented code. Will provide support and maintenance for three months after approval. Thanks.

£222 GBP in 15 dias
(1 Comentário)
4.2
Alethor

Hi! I would like to help you with this project. I have some questions, so, if you are interested, please let's talk BEFORE closing anything. One everything is set and understood, then we will be able to work toge Mais

£150 GBP in 3 dias
(20 Comentários)
3.9
pakaimsolutions

Hello, I hope you doing very well! I have gone through the description and would like to provide a quality solution using my 5+ years of professional experience in required skills. (.NET/ASP.NET - MVC, EF, SQL, C#). Mais

£122 GBP in 3 dias
(11 Comentários)
4.0
joy151280

Hi! Thanks for posting this requirement. I am experts with design and development works. I have developed several websites in ASP.net vb. I have done research in .net I have also made several research tool in vb.net. I Mais

£177 GBP in 3 dias
(2 Comentários)
3.1
aptriangle

First, when using an algorithm, instead of a list that contains phrase pairs considered equivalent, there's no way to be completely certain that a match is correct. Any match would have a confidence level. Your first Mais

£150 GBP in 5 dias
(2 Comentários)
2.0
sammy2889

Our core competency lies in complete end-end development cum management of a new website development project, and we are seeking opportunities to build Websites/Applications from the ground up for you or your business. Mais

£200 GBP in 5 dias
(0 Comentários)
0.0
fifthjourney

I have used Copyleaks and Fuzzystring in a plagiarism checking software for library for a college. Im sure it will be more than enough for this purpose.

£135 GBP in 5 dias
(0 Comentários)
0.0
Mazookr

I wrote something similar for the BBC and MACAT.com. Probably do it in less than 5 days, but good to have some leeway, I have used those exact algos and much more such as ML(wont be needed) I work as a Tech lead Mais

£277 GBP in 5 dias
(0 Comentários)
0.0