Find Jobs
Hire Freelancers

Data collection script or program for Linux

$30-250 USD

Cancelado
Publicado há mais de 15 anos

$30-250 USD

Pago na entrega
I have a need to harvest data from a web site on a weekly basis and need a program to do the work. I currently do it with a program I wrote that runs under windows but I need something more robust in order to run this more often and it must run on Linux (Slackware 2.6.2x). The project is simple: download some html files and then extract the data in them and put them in a standard delimited ASCII file. The program will simply query a web site to get the html pages that are available. There are about 100 per calendar date and the need is to be able to download the files starting with the current date and going to an end date. This means downloading between 35,000 and 100,000 html files each time the program runs. The program will begin by getting the html file and then parse the html file and extract the data contained in it. Then get the next one, and so on. The HTML files keep the same format and are easy to extract as they are simple lists. ALL fields must be extracted. The list have header information, so the header information must be repeated on each record of data created to ensure that the information stays together. For example, it will give the name and then a list of all the clients for that name. The final output should be sent to the standard output device so that it can be redirected to a file etc. All fields from all html files that are downloaded should be included on each record. The variation between the 3 types of html files in this project are minor so each record line may have about 30 fields, with some used or unused. The field is simply left blank if it is unused. The extracted data must be in standard ASCII format where each field is delimited by a character to be specified on the command line. example: getfiles <start_date> <end_date> <delimiter> > [login to view URL] The default delimiter, if no delimiter is specified on the command line, should be the pipe symbol | (ascii/decimal 124). I do not need a program with a fancy user interface. It needs to be simple and functional. It will run at the command prompt with parameters just like all standard linux commands and scripts. The program can be written using linux script language or PHP, either is ok with me. Attached is a ZIP file containing samples of the html files as well as the web site information. The provider must submit a final script with all necessary files, and also the source code if any. He must have tested the program and must submit the results for one run between two dates, of a year time period (example: the provider can run the program on february 1st 2009 and put the end date february 1st 2010. The data collected must be submited to show the program works.). This is a simple project but please only bid if you have done this type of work before and are sure you can deliver the work. I do not want to waste your time or mine. Payment for this project will be made via escrow only.
ID do Projeto: 378128

Sobre o projeto

11 propostas
Projeto remoto
Ativo há 15 anos

Quer ganhar algum dinheiro?

Benefícios de ofertar no Freelancer

Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
11 freelancers estão ofertando em média $175 USD for esse trabalho
Avatar do Usuário
We can help in your project, please check PMB to see our related experience.
$250 USD em 3 dias
4,8 (219 avaliações)
7,7
7,7
Avatar do Usuário
Hello, Please look at the PMB. Regards, Sergey
$200 USD em 1 dia
4,9 (19 avaliações)
6,3
6,3
Avatar do Usuário
I've completed already a few such projects on GAF, so I'll be glad to help you too.
$50 USD em 1 dia
5,0 (26 avaliações)
6,0
6,0
Avatar do Usuário
Please check PM. Thanks.
$160 USD em 4 dias
5,0 (22 avaliações)
5,8
5,8
Avatar do Usuário
I can do this job for you. See PM for details.
$120 USD em 2 dias
4,9 (103 avaliações)
5,7
5,7
Avatar do Usuário
Hi Can be done using web scrapping. Thanks.
$250 USD em 0 dia
5,0 (6 avaliações)
2,6
2,6
Avatar do Usuário
pls check p.m.b, thanks.
$80 USD em 0 dia
4,0 (1 avaliação)
2,6
2,6
Avatar do Usuário
I can do this job for you
$240 USD em 2 dias
0,0 (0 avaliações)
0,0
0,0
Avatar do Usuário
Sounds like the job perl was designed for. (sounds like fun actually)
$180 USD em 5 dias
0,0 (0 avaliações)
0,0
0,0
Avatar do Usuário
I'll do it for you
$150 USD em 5 dias
0,0 (0 avaliações)
3,9
3,9

Sobre o cliente

Bandeira do(a) CANADA
Montreal, Canada
5,0
25
Método de pagamento verificado
Membro desde out. 23, 2008

Verificação do Cliente

Obrigado! Te enviamos um link por e-mail para que você possa reivindicar seu crédito gratuito.
Algo deu errado ao enviar seu e-mail. Por favor, tente novamente.
Usuários Registrados Total de Trabalhos Publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Carregando pré-visualização
Permissão concedida para Geolocalização.
Sua sessão expirou e você foi desconectado. Por favor, faça login novamente.