Find Jobs
Hire Freelancers

Scrape Data from 5 different websites(Need to Learn Scraping data in php and python)

$10-30 AUD

Fechado
Publicado há aproximadamente 8 anos

$10-30 AUD

Pago na entrega
Basically my father has to go to 5 different websites to go watch his online videos i want to be able to grab any information from any part of a website and more importantly grab an array of all the items then run little loops and that to extract the first second third fourth and fifth things type of thing within the bigger array to add to the arrays to be placed in for the moment file on the computer as a html file for accessing later. Now i want someone to teach me the fine arts of web scraping so i can put together one webpage based on what i will scrape from these websites. This is a small thing but something i need to do both for this and an upcoming website that could be worth a bit should i get it scraped in time but its better i learn scraping for python and php so i can apply this to my own websites i can in theory use php to enter data into a mysql database that stuff is easy to do if you have the data. i can even learn to hack my own wordpress theme with that data but i need to get the data before i can do any of that plus if someone knows wordpress plugin integration that would help me with my projects. At the moment a tutorial to scrape this website for both python and php would be appreciated main site is With python and beautiful soup i can get down to <div class="section-programs"> <p class="episode" data-keywords="abc3"> <a style="color:#262626" href="/programs/yoohoo-and-friends/ZX6514A015S00" title="Series 1 Ep 41 Stoney Island">YooHoo And Friends</a> <span style="color:#6f6f6f"> - 15 episodes</span> with the following code from BeautifulSoup import BeautifulSoup import requests url ="[login to view URL]" r = [login to view URL] (url) soup = BeautifulSoup([login to view URL]) paragraph_number = len([login to view URL]('p', attrs={"class":"episode"})) paragraph number for looping current_paragraph = [login to view URL]('p', attrs={"class":"episode"})[0] current paragraph php i havent tried sucessfully to pull anything but the main site [login to view URL] but this is one of 5 or so i need to scrape and the basic information is similar to above code i need the abc3 in the datakeywords as channel the href with a base url added to that and with the above data apply it like the a text YooHoo And Friends and append the title data to that so Yoo Hoo And Friends Series 1 Ep 41 i can add and change ep to episode and the like i believe but i need to grab them oh yeah and i need to grab the span tag specifically the 15 episodes this area will create the loop number for that link for how many hidden episodes of that program there are and then every link grabbed from the first list if more then 1 episodes are in that span area then the resulting links are parsed and they go in a got links area value in the links that have more episodes the links are treated the same way they get created and the hrefs with a base url are compared to already got links if they arnt in there added and finally in the programs page there are some images i need to steal aka channel images and the like then a big list is made with a <a href ="abc links" title="abc program titles">program name series episode etc</a> all links are then put into a file then the next site. But this is basically what i need from someone so i can scrape pages get the main page i can do that on php and python. python i can get down to an array of paragraphs with all the info i need to get php i cant even get the first element of the items i need. So anyone who can teach me php and python scraping i would appreciate it. David Beams
ID do Projeto: 9721713

Sobre o projeto

6 propostas
Projeto remoto
Ativo há 8 anos

Quer ganhar algum dinheiro?

Benefícios de ofertar no Freelancer

Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
6 freelancers estão ofertando em média $70 AUD for esse trabalho
Avatar do Usuário
Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi
$120 AUD em 3 dias
5,0 (383 avaliações)
7,7
7,7
Avatar do Usuário
Hello. I am web extractor. Can teach you. Thanks . Eugene
$55 AUD em 1 dia
4,9 (290 avaliações)
6,8
6,8
Avatar do Usuário
I am an experience web developer having good hands on latest relevant technologies used for the front end and backend development,i am experienced in doing the similar projects,i could be results a better resource for your project,looking forward to here from you for the further discussion. i could result a good resource for this work,i'll try to complete work as soon as i can with perfection.
$25 AUD em 1 dia
4,7 (9 avaliações)
2,8
2,8
Avatar do Usuário
i have worked on similar projects outside freelancer.
$35 AUD em 5 dias
5,0 (1 avaliação)
2,7
2,7
Avatar do Usuário
I have done many data scraping work before, such as dictionary entries and news. Accept my bid and I'll start teaching you immediately. I can give you example code how to scrape movie websites you've mentioned and explain to you. I'm familiar with using php and I can assist you to export data into your desired form(cvs or mysql). I'm sure you'll be happy to explore with me.
$150 AUD em 2 dias
0,0 (0 avaliações)
0,0
0,0

Sobre o cliente

Bandeira do(a) INDIA
faridabad, India
0,0
0
Membro desde set. 30, 2011

Verificação do Cliente

Outros trabalhos deste cliente

Amazon AWS Lamba Python Script
$10-30 USD
Obrigado! Te enviamos um link por e-mail para que você possa reivindicar seu crédito gratuito.
Algo deu errado ao enviar seu e-mail. Por favor, tente novamente.
Usuários Registrados Total de Trabalhos Publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Carregando pré-visualização
Permissão concedida para Geolocalização.
Sua sessão expirou e você foi desconectado. Por favor, faça login novamente.