Encerrado

Development of WebCrawler / bot for job offers

The outcome of the development project hast o be a WebCrawler for websites (mainly company sites) containing job offers which allows the extraction of content and stores it in a database.

Extraction of content

The content of the websites containing the job offers (job description) has to be extracted in a text format and exported in a mysql database whereas the format should be “onlinedate, offlinedate, id, category, url, jobtitle, jobdescription, e-mail (if existing), phone (if existing), contactperson (if existing)”. The software also has to be able to crawl job portals like [url removed, login to view], [url removed, login to view], [url removed, login to view],...

Example Pages:

[url removed, login to view]://[url removed, login to view]://[url removed, login to view]

[url removed, login to view]

[url removed, login to view]

Note:

It has to be considered that on some of the company websites you need a search engine in order to get to the jobs.

Validity of jobs

The software also has to verify if jobs are taken offline by a company and are not online anymore. The jobs than has to be marked offline in the database with the corresponding date the job has been taken offline (date of crawling process where the software has identified that the job is not online anymore).

Definition / Configuration of URLs

The configuration of the software must allow, that the URLs of the websites which have to crawled including SUB-URLs can be defined. The URLs are mainly company webpages whereas the software has to identify on which pages job offers are published.

Please do not bid on the project if you are not familiar with the technology of web crawlers.

Configuration of Keywords

Besides the URLs also keywords can be defined whereas a website will only be extracted if predefined keywords are on the site. Keywords can be grouped in a category so that a job which has been found can be put in a certain job category (e.g. consulting, banking,…).

Progress and statistics

A small progress and statistic module of the software always has to show the progress of a crawling process as well as the result (websites crawled, new jobs found, jobs updated, jobs taken offline, websites a job was found). The data has to be provided for each new crawling process.

Error Messages

In order to identify problems with a crawling process an error log has to be written which allows the identification of the problems which have occurred during a crawling process.

Dublicate Check

The software has to identify if job offers are posted on different sites. For example on a company home page as well as on a job portal. This information has to be stored in the database.

Performance

The WebCrawler must ensure a fast crawling process.

Documentation

The software has to be documented in a proper way (functions, methos, etc.) so that a third party is able to understand it.

Payment

The payment amount will be put in an escrow agreement. After the software has been finished, delivered and quality assured the escrow payment will be released.

Habilidades: .NET, Programação C, Processamento de dados, Java, Visual Basic

Ver mais: www monster jobs . com, www monster jobs, www monster com, www monster, www job search com, www job search, www escrow com is it a proper company, written jobs, where you search order for software development, well published, websites search development, websites problems, websites for online jobs, websites development company, website for online development with html, web site development process, website development online company, website development offers, website development jobs online, web search jobs online, web project documentation format, web portal development software, web page for software development company, web page development sites, web page development online

Acerca do Empregador:
( 4 comentários ) Hamburg, Germany

ID do Projeto: #437267

17 freelancers estão ofertando em média $1182 para este trabalho

interpb

Dear friend please kind view PMB for more details thanks

$1000 USD in 15 dias
(53 Comentários)
6.0
aruhat

Hi, Thanks for giving an opportunity to place a [url removed, login to view] kindly go through PM. Regards, Bhavik

$1500 USD in 20 dias
(8 Comentários)
6.0
rsdsoft

I done a lot of similar projects. Please check PMB for demo.

$1350 USD in 30 dias
(7 Comentários)
5.9
bgdnnet

fake bid for PMB

$750 USD em 1 dia
(25 Comentários)
4.9
AstinSoftech

Hi, Greeting from Astinsoftech.!!! We are ISO 9001:2000 Certified Software Development Company I have studied your requirements in detail and based on our assessment I am very confident that we will execute t Mais

$1100 USD in 15 dias
(3 Comentários)
4.4
dziban

We are twelve team members who have professionally mastered the skills of Web Design and Development. We have provided excellent quality service within our clients’ budget to their utmost satisfaction. We offers an Mais

$750 USD in 35 dias
(2 Comentários)
3.5
ManiksSoftware

We possess extensive experience of developing numerous high-end websites and are highly organized and adept at meeting tight deadlines that are so common in this industry. Please see PMB for more details.

$1500 USD in 30 dias
(1 Comentário)
3.7
mirod

Please, check PMB

$1500 USD in 0 dias
(2 Comentários)
2.9
silv3rm00n

can do [url removed, login to view]

$1200 USD in 40 dias
(2 Comentários)
2.0
nrupayshah

In your project i have to work on java for better security purpose of database so i bidding $ 1050.

$1050 USD in 15 dias
(1 Comentário)
0.0
hardik81

Hi, I am system analyst and have 5+ years of experience in .NET technology. I worked on Windows application, Windows service and Web application. I also worked on WCF and WF. I am really intrested in this. I hope, Mais

$1000 USD in 30 dias
(0 Comentários)
0.0
sayanDotNet1

Please check PMB

$751 USD in 30 dias
(0 Comentários)
0.0
nsoftsolutions

Plz check your PMB for further details..

$1250 USD in 25 dias
(0 Comentários)
0.0
kkiran33

Hi,Please check PM. Thanks.

$1200 USD in 30 dias
(0 Comentários)
0.0
thucnh

Hi, I have experience of working on WebCrawler bot. I have been working for a WebCrawer boot which download data from hundreds of site. Please see my PM. Thanks, Thuc

$1500 USD in 20 dias
(0 Comentários)
0.0
blueskybdz

HI,Check PM please.

$1200 USD in 20 dias
(0 Comentários)
0.0
Ashwinipatil

Hello, We are Software Development Company from India provides IT services, having a great experience in development and designing of complex solutions. We have well trained and fully experienced developers and prog Mais

$1500 USD in 35 dias
(0 Comentários)
0.0